lobo/stt

Live microphone speech to text using vosk

Find a file

Emil Lerch c7a9808052 remove zlib from nix, add to build		2025-09-09 13:58:33 -07:00
src	clean up a bit, move to dependencies, remove unnecessary nix packages	2025-09-09 13:30:20 -07:00
.gitignore	AI generated first pass	2025-09-09 10:15:16 -07:00
.mise.toml	AI generated first pass	2025-09-09 10:15:16 -07:00
build.zig	remove zlib from nix, add to build	2025-09-09 13:58:33 -07:00
build.zig.zon	remove zlib from nix, add to build	2025-09-09 13:58:33 -07:00
flake.lock	AI generated first pass	2025-09-09 10:15:16 -07:00
flake.nix	remove zlib from nix, add to build	2025-09-09 13:58:33 -07:00
LICENSE	AI generated first pass	2025-09-09 10:15:16 -07:00
README.md	AI generated first pass	2025-09-09 10:15:16 -07:00

README.md

Real-time Speech Recognition with Vosk and Zig

This project implements a minimal real-time speech-to-text application using Vosk and Zig.

Setup

Prerequisites

Zig 0.15.1 (configured via mise)
Nix development environment with C compilation tools, ALSA, and audio libraries

Vosk Model Download

The application uses the Vosk small English model for speech recognition:

Source: https://alphacephei.com/vosk/models/vosk-model-small-en-us-0.15.zip
Size: ~50MB
Language: English only
Accuracy: Good for simple sentences and commands

Installation Steps

Enter nix development environment: nix develop
Download Vosk model: wget https://alphacephei.com/vosk/models/vosk-model-small-en-us-0.15.zip
Extract model: unzip vosk-model-small-en-us-0.15.zip
Build application: zig build
Run: ./zig-out/bin/stt

Usage

The application will:

Initialize audio capture from default microphone
Load the Vosk speech recognition model
Process audio in real-time
Output recognized text to terminal
Exit on Ctrl+C

Dependencies

Vosk C API library
ALSA for audio capture
Standard C libraries for audio processing