State-of-the-art TTS model under 25MB
Transforming Multimodal Content into Captivating Multilingual Audio
Capable of understanding text, audio, vision, video
High-quality multi-lingual text-to-speech library by MyShell.ai
GLM-4-Voice | End-to-End Chinese-English Conversational Model
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
A Conversational Speech Generation Model
Implementation of NÜWA, attention network for text to video synthesis
PyTorch implementation of convolutional neural networks
C++ class library for sound analysis, synthesis, and morphing
A cross-platform wrapper for common text-to-speech engines in Python
An Incremental Spoken Dialogue Processing Toolkit
Python to eSpeak speech synthesis
A graphical interface GUI for Fluidsynth Soundfont Player
SuperCollider Code for Livecoding Experimental Sound