Speech To Text
11 articles about speech to text.
download-ggml-model.sh large-v3: How to Download the Full Whisper Large Model
Step-by-step guide to using download-ggml-model.sh large-v3 for whisper.cpp. Covers setup, model size, performance benchmarks on Apple Silicon, large-v3 vs large-v3-turbo, quantization, and troubleshooting.
ggml-large-v3.bin: Complete Guide to Whisper's Largest GGML Model
Everything about ggml-large-v3.bin for whisper.cpp, including download, setup, performance benchmarks, quantization options, and when to choose it over the turbo variant.
ggml-large-v3-turbo.bin: The Fast Whisper Model for Real-Time Transcription
Complete guide to ggml-large-v3-turbo.bin for whisper.cpp. Covers download, setup, quantization, benchmarks, and how the turbo model achieves 6x faster inference with minimal accuracy loss.
whisper.cpp Metal on Apple Silicon: GPU Acceleration for Local Speech-to-Text
How to build and optimize whisper.cpp with Metal GPU acceleration on Apple Silicon Macs. Covers build flags, performance tuning, model selection, and real benchmarks.
download-ggml-model.sh large-v3-turbo: Complete Guide to Downloading Whisper Models
How to use download-ggml-model.sh to get the large-v3-turbo model for whisper.cpp. Covers the script internals, model variants, troubleshooting, and performance on Apple Silicon.
macOS Dictation With Your Own Model - Accessibility API for Text Insertion
How bring-your-own-key dictation apps on macOS use the Accessibility API for text insertion - local models, privacy, and real-time transcription.
Voice-Activated AI Desktop Agents - Why Voice Beats Keyboard Shortcuts
Voice activation is more natural than hotkeys for multi-step AI agent tasks. Native private speech-to-text on Mac makes voice-first workflows practical.
Voice AI Latency Matters More Than Accuracy - On-Device WhisperKit Benchmarks
Why switching from cloud STT to on-device WhisperKit changed everything for our voice desktop agent. Real latency data, interruption handling, and why 0.46s changes user behavior.
Self-Hosted iOS Voice Keyboard for AI Agent Workflows
Voice input is massively underrated for AI workflows. A self-hosted iOS voice keyboard paired with a macOS desktop agent creates a hands-free automation
Running whisper.cpp on Apple Silicon for Local Voice Recognition
The best setup for local voice recognition on Mac: whisper.cpp with large-v3-turbo on Apple Silicon. Here is the model choice, pipeline architecture, and
Native Mac Speech-to-Text That Runs Locally - Privacy, Speed, and No Cloud
Why local speech-to-text on Mac matters for AI desktop agents. No cloud dependency, instant transcription, and complete privacy for voice-controlled automation.