In-App assistant SDK to build a multimodal conversational UX websites
Virtual AI anchor that combines state-of-the-art technology
Offline inference engine for art, real-time voice conversations
Realtime AI Voice Agents with SoTA Multimodal AI models on Arduino ESP
Mobile and Web client for Codex and Claude Code, with realtime voice
Qwen3-TTS is an open-source series of TTS models
Large Audio Language Model built for natural interactions
This is a simple demonstration of more advanced, agentic patterns
Industrial-level controllable zero-shot text-to-speech system
PersonaPlex code
AI suite powered by state-of-the-art models and providing advanced AI
Generate high-definition story short videos with one click using AI
Code for openai.fm, a demo for the OpenAI Speech API
Minimal plugin that lets Claude Code call you on the phone
The python library for real-time communication
Controllable & emotion-expressive zero-shot TTS
Open Source TypeScript AI Agent Framework
Warcraft III Peon voice notifications (+ more!) for Claude Code
Open source text-to-speech tool, supports extra-long text
A Claude skill that automatically posts personalized comments
Qwen3-ASR is an open-source series of ASR models
Free & Easy AI Voice Accounting Software For Blind & Speechless People
Instructions on how to use the Realtime API on Microcontrollers
Workflow and speech recognition app
A Conversational Speech Generation Model