Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.

Features

  • YouTube Downloader: You can download YouTube videos and extract the audio (mp3, wav, flac)
  • Vocal Remover: Use MDX-Net supported in UVR5 and the Demucs engine developed by Meta for voice separation
  • STT: Supports speech-to-text conversion with Whisper, Faster-Whisper, and whisper-timestamped
  • Translator: Google Translator. Short text translation, subtitle file translation
  • TTS: Text to Speech. Edge-TTS. E2 and F5-TTS that support zero-shot voice cloning
  • We provide Celeb voices for free. Try creating your own podcast. You can check it in the F5-TTS tab

Project Samples

Project Activity

See All Activity >

Categories

Text to Speech

License

MIT License

Follow Voice-Pro

Voice-Pro Web Site

You Might Also Like
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
1
0
0
0
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

User Reviews

  • Tried Voice-Pro on my RTX 3080 desktop. The quality is truly excellent, and it includes voice cloning capabilities using F5-TTS and CosyVoice. The installation was very simple, and the usage is quite intuitive, so I think it's worth a try. Before installing this project, I checked their YouTube demo video, and I was able to achieve the same results on my desktop as shown in the demo. It offers transcription, translation, Edge-TTS and kokoro through the Gradio WebUI. It's a great tool for youtube creators. I hope you find it helpful.
Read more reviews >

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Text to Speech Software

Registered

2024-11-27