Speech-AI-Forge is a project developed around TTS generation model
Code for openai.fm, a demo for the OpenAI Speech API
Robust Speech Recognition via Large-Scale Weak Supervision
StreamSpeech is a seamless model for offline speech recognition
A robust, efficient, low-latency speech-to-text library
PersonaPlex code
Towards Human-Level Text-to-Speech through Style Diffusion
A high-quality rapid TTS voice cloning model
Qwen3-TTS is an open-source series of TTS models
Industrial-level controllable zero-shot text-to-speech system
Use Microsoft Edge's online text-to-speech service from Python
A lightweight text-to-speech model with zero-shot voice cloning
Open source text-to-speech tool, supports extra-long text
A TTS that fits in your CPU (and pocket)
Python library and CLI tool to interface with Google Translate
Speakr is a personal, self-hosted web application
End-to-end speech processing toolkit
A fast TTS architecture with conditional flow matching
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
TTS with kokoro and onnx runtime
An Open Source text-to-speech system built by inverting Whisper
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
MARS5 speech model (TTS) from CAMB.AI
Qwen3-ASR is an open-source series of ASR models
Offline inference engine for art, real-time voice conversations