Speech to Text to Speech, sends text as OSC messages
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Robust Speech Recognition via Large-Scale Weak Supervision
Speech-to-text, text-to-speech, and speaker recognition
Build voice-based LLM agents. Modular + open source
Large Audio Language Model built for natural interactions
The behavior guidance framework for customer-facing LLM agents
Real-time voice interactive digital human
Conversational voice AI agents
A chatbot built based on a large model
In-App assistant SDK to build a multimodal conversational UX websites
Realtime AI Voice Agents with SoTA Multimodal AI models on Arduino ESP
Repo of Qwen2-Audio chat & pretrained large audio language model
TEN, a voice agent framework to create conversational AI.
Map location picker component for Android
Assistant SDK to build a multimodal conversational UX for Android
A free, open source, and extensible speech-to-text application
In-App assistant SDK to build a multimodal conversational UX for iOS
Build your own AI friend
Video translation and dubbing tool powered by LLMs
Bailing is a voice dialogue robot similar to GPT-4o
Deploy your private Gemini application for free with one click
Transform your voice in real-time voxal voice changer
iOS application for Lumo
Textream is a free macOS teleprompter app for streamers, interviewers