lobo/stt

Live microphone speech to text using vosk

Find a file

Emil Lerch 049ea33c8d ai did not understand defer in that function		2025-09-15 19:44:21 -07:00
src	ai did not understand defer in that function	2025-09-15 19:44:21 -07:00
.gitignore	AI generated first pass	2025-09-09 10:15:16 -07:00
.mise.toml	working sample without nix needed	2025-09-09 19:02:07 -07:00
.pre-commit-config.yaml	add structural aspects to repo	2025-09-09 14:08:57 -07:00
alsa.conf	switch to alsa default device and ask users to configure through alsa.conf	2025-09-10 18:05:51 -07:00
build.zig	remove demo - we will use. Also add --exec handling	2025-09-10 18:40:53 -07:00
build.zig.zon	support linux arm64, riscv64, and x86_64	2025-09-09 20:14:49 -07:00
LICENSE	AI generated first pass	2025-09-09 10:15:16 -07:00
README.md	switch to alsa default device and ask users to configure through alsa.conf	2025-09-10 18:05:51 -07:00

Real-time Speech Recognition with Vosk and Zig

This project implements a minimal real-time speech-to-text application using Vosk and Zig.

Audio Device Configuration

The application uses ALSA's default device, which is configured in alsa.conf. To use a different audio device:

Edit alsa.conf and update the pcm.!default section:

pcm.!default {
    type hw
    card 3      # Change to your card number
    device 0    # Change to your device number
}

The application uses the Vosk small English model for speech recognition:

The application will:

Vosk tends to recognize "light" as lake or like