AirPlay audio player
Official repository for LTX-Video
Synchronized Translation for Videos
Multimodal-Driven Architecture for Customized Video Generation
Multimodal Diffusion with Representation Alignment
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
A python tool that uses GPT-4, FFmpeg, and OpenCV
HLS.js is a JavaScript library that plays HLS in browsers
Generate blog articles from video or audio
Make videos programmatically with React
The python library for real-time communication
Streaming Real-time Audio-Driven Avatar Generation
The missing YouTube Music macOS app
The core of Membrane Framework, multimedia processing framework
A suite of advanced multi-modal LLMs
FFmpeg for browser, powered by WebAssembly
Large Multimodal Models for Video Understanding and Editing
This package provides an integration with FFmpeg for Laravel
A react-based starter app for using the Live API over websockets
Generate high-definition story short videos with one click using AI
A HTML5 video player with a parser that saves traffic
Build Vision Agents quickly with any model or video provider
Network transparent, client/server audio transport system
Python inference and LoRA trainer package for the LTX-2 audio–video
Open source text-to-speech tool, supports extra-long text