Speech-AI-Forge is a project developed around TTS generation model
Code for openai.fm, a demo for the OpenAI Speech API
Robust Speech Recognition via Large-Scale Weak Supervision
Synchronized Translation for Videos
A robust, efficient, low-latency speech-to-text library
StreamSpeech is a seamless model for offline speech recognition
PersonaPlex code
Towards Human-Level Text-to-Speech through Style Diffusion
A high-quality rapid TTS voice cloning model
Qwen3-TTS is an open-source series of TTS models
Use Microsoft Edge's online text-to-speech service from Python
Industrial-level controllable zero-shot text-to-speech system
A lightweight text-to-speech model with zero-shot voice cloning
Open source text-to-speech tool, supports extra-long text
A TTS that fits in your CPU (and pocket)
Python library and CLI tool to interface with Google Translate
Speakr is a personal, self-hosted web application
End-to-end speech processing toolkit
A fast TTS architecture with conditional flow matching
TTS with kokoro and onnx runtime
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
An Open Source text-to-speech system built by inverting Whisper
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
MARS5 speech model (TTS) from CAMB.AI
Qwen3-ASR is an open-source series of ASR models