[go: up one dir, main page]

Speech To Text

11 articles about speech to text.

download-ggml-model.sh large-v3: How to Download the Full Whisper Large Model

·10 min read

Step-by-step guide to using download-ggml-model.sh large-v3 for whisper.cpp. Covers setup, model size, performance benchmarks on Apple Silicon, large-v3 vs large-v3-turbo, quantization, and troubleshooting.

whisperggmllarge-v3speech-to-textapple-siliconmacoswhisper-cpp

ggml-large-v3.bin: Complete Guide to Whisper's Largest GGML Model

·9 min read

Everything about ggml-large-v3.bin for whisper.cpp, including download, setup, performance benchmarks, quantization options, and when to choose it over the turbo variant.

whisperggmllarge-v3speech-to-textapple-siliconmacoswhisper-cpp

ggml-large-v3-turbo.bin: The Fast Whisper Model for Real-Time Transcription

·9 min read

Complete guide to ggml-large-v3-turbo.bin for whisper.cpp. Covers download, setup, quantization, benchmarks, and how the turbo model achieves 6x faster inference with minimal accuracy loss.

whisperggmllarge-v3-turbospeech-to-textapple-siliconwhisper-cppreal-time

whisper.cpp Metal on Apple Silicon: GPU Acceleration for Local Speech-to-Text

·11 min read

How to build and optimize whisper.cpp with Metal GPU acceleration on Apple Silicon Macs. Covers build flags, performance tuning, model selection, and real benchmarks.

whisper-cppmetalapple-silicongpu-accelerationspeech-to-textmacos

download-ggml-model.sh large-v3-turbo: Complete Guide to Downloading Whisper Models

·9 min read

How to use download-ggml-model.sh to get the large-v3-turbo model for whisper.cpp. Covers the script internals, model variants, troubleshooting, and performance on Apple Silicon.

whisperggmllarge-v3-turbospeech-to-textapple-siliconmacos

macOS Dictation With Your Own Model - Accessibility API for Text Insertion

·2 min read

How bring-your-own-key dictation apps on macOS use the Accessibility API for text insertion - local models, privacy, and real-time transcription.

dictationbyokaccessibility-apimacosspeech-to-textlocal-models

Voice-Activated AI Desktop Agents - Why Voice Beats Keyboard Shortcuts

·2 min read

Voice activation is more natural than hotkeys for multi-step AI agent tasks. Native private speech-to-text on Mac makes voice-first workflows practical.

voice-controlspeech-to-textkeyboard-shortcutsdesktop-agentmacosmacapps

Voice AI Latency Matters More Than Accuracy - On-Device WhisperKit Benchmarks

·4 min read

Why switching from cloud STT to on-device WhisperKit changed everything for our voice desktop agent. Real latency data, interruption handling, and why 0.46s changes user behavior.

voice-aiwhisperkitspeech-to-textlatencyon-deviceapple-silicondesktop-agent

Self-Hosted iOS Voice Keyboard for AI Agent Workflows

·2 min read

Voice input is massively underrated for AI workflows. A self-hosted iOS voice keyboard paired with a macOS desktop agent creates a hands-free automation

voice-inputios-keyboardself-hostedai-workflowsspeech-to-text

Running whisper.cpp on Apple Silicon for Local Voice Recognition

·2 min read

The best setup for local voice recognition on Mac: whisper.cpp with large-v3-turbo on Apple Silicon. Here is the model choice, pipeline architecture, and

whisperapple-siliconvoice-recognitionlocal-aispeech-to-text

Native Mac Speech-to-Text That Runs Locally - Privacy, Speed, and No Cloud

·3 min read

Why local speech-to-text on Mac matters for AI desktop agents. No cloud dependency, instant transcription, and complete privacy for voice-controlled automation.

speech-to-textlocalprivacymacosvoice-control

Browse by Topic