Offline speech recognition API for Android, iOS, Raspberry Pi
Robust Speech Recognition via Large-Scale Weak Supervision
Speech-to-text, text-to-speech, and speaker recognition
Speech recognition module for Python
Audio foundation model excelling in audio understanding
A PyTorch-based Speech Toolkit
kaldi-asr/kaldi is the official location of the Kaldi project
On-device Speech Recognition for Apple Silicon
Captcha solver extension for humans
A free, open source, and extensible speech-to-text application
Multilingual Automatic Speech Recognition with word-level timestamps
Port of OpenAI's Whisper model in C/C++
StreamSpeech is a seamless model for offline speech recognition
Toolkit for conversational AI
Cross-platform AI language practice app
Underthesea - Vietnamese NLP Toolkit
OpenVINO™ Toolkit repository
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Repo of Qwen2-Audio chat & pretrained large audio language model
Speech to Text to Speech, sends text as OSC messages
A cross-platform software for text translation and recognition
Training data (data labeling, annotation, workflow) for all data types
Capable of understanding text, audio, vision, video
The behavior guidance framework for customer-facing LLM agents
AzioSpeech Recognition and Translation