[go: up one dir, main page]

Showing 16 open source projects for "google tts nvda"

View related business solutions
  • Award Winning Time and Labor Software Icon
    Award Winning Time and Labor Software

    Synerion offers time and labor, advanced scheduling, absence management, labor allocation, timesheets, coreHR and more.

    Stop wasting time and resources on manual and error-prone paper-based workforce management with Synerion. Synerion offers a comprehensive range of workforce management solutions that goes beyond time and tracking. The platform also offers enhanced scheduling features, labor costing, absence management, and payroll integration.
    Learn More
  • The Leading Social Media Archiving Software Icon
    The Leading Social Media Archiving Software

    The world’s most dependable social media archiving software for records compliance and risk management

    ArchiveSocial connects directly to your social networks to capture and preserve all the content your organization posts and engages with, in-context and in near-real-time. And it all lives in one easy-to-use, secure archive, so you can easily manage your online communications and help your organization stay compliant with public records laws, regulations, and recordkeeping initiatives.
    Learn More
  • 1
    Cookbook (Google Gemini)

    Cookbook (Google Gemini)

    Examples and guides for using the Gemini API

    The Gemini Cookbook is an official repository of examples and guides for using Google’s Gemini API. It provides a structured learning path with quick-start tutorials for beginners and practical examples for advanced users. The repository covers a wide range of Gemini capabilities, including text, images, video, speech, robotics, and multimodal interactions. It highlights newly introduced features such as Gemini 2.5 models (Flash and Pro), Gemini’s native image generation, Veo for video...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    emily-gr

    emily-gr

    Emily combines NVDA and MARY TTS to help people with disabilities.

    Emily is a multilingual text to speech application based on Mary TTS and NVDA. It supports English, French, German, Italian, Turkish and Greek languages with many voices. It can be used as a NVDA addon or as a standalone application by people with reading disabilities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    RealtimeTTS

    RealtimeTTS

    Converts text to speech in realtime

    ...It is designed around a streaming model: you can feed it text incrementally (for example, as an LLM responds) and get audio output almost immediately, which keeps end-to-end latency very low. The library is engine-agnostic and plugs into a wide range of cloud and local TTS systems, including OpenAI, ElevenLabs, Azure, Coqui, Piper, StyleTTS2, Edge TTS, Google TTS, system TTS and others, so you can swap providers without rewriting your pipeline. It supports both internet-based engines and fully local engines, which lets you choose between privacy, cost, and quality trade-offs. RealtimeTTS also includes robustness features such as automatic fallbacks when a backend fails, so production systems can stay responsive even if one TTS provider is temporarily unavailable.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Voice-Pro

    Voice-Pro

    Comprehensive Gradio WebUI for audio processing

    Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.
    Downloads: 45 This Week
    Last Update:
    See Project
  • Smarter Packing Decisions for Retailers and 3PLs Icon
    Smarter Packing Decisions for Retailers and 3PLs

    Paccurate is an API-first cartonization solution.

    Paccurate is the only patented cartonization solution that optimizes for transportation costs directly. So you can have the right boxes, and control how they're packed.
    Learn More
  • 5
    Auto Synced & Translated Dubs

    Auto Synced & Translated Dubs

    Automatically translates the text of a video based on a subtitle file

    Auto-Synced-Translated-Dubs is a toolchain that automatically translates and re-dubs videos using AI voices while keeping the new speech aligned to the original timing via subtitle files. It assumes you have a human-made SRT (or similar) subtitle file; the script then uses translation services such as Google Cloud or DeepL to generate translated subtitle tracks in one or more target languages. Using the timestamps of each subtitle line, it computes the required duration of each spoken segment and synthesizes audio via neural TTS services, producing one audio clip per subtitle entry. The tool then time-stretches or compresses each TTS clip to match the original speech duration exactly, which preserves lip-sync and rhythm as closely as possible without manual editing. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 6
    gTTS

    gTTS

    Python library and CLI tool to interface with Google Translate

    gTTS (Google Text-to-Speech) is a Python library and command-line tool that wraps the speech functionality of Google Translate. It lets you send text to the Google Translate TTS endpoint and receive spoken audio back as MP3 data, either written to a file, a file-like object, or standard output. The library is designed to handle long texts, using a speech-specific sentence tokenizer that keeps intonation and punctuation natural while splitting requests into acceptable chunks. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    SillyTavern

    SillyTavern

    LLM Frontend for Power Users

    Mobile-friendly, Multi-API (KoboldAI/CPP, Horde, NovelAI, Ooba, OpenAI, OpenRouter, Claude, Scale), VN-like Waifu Mode, Horde SD, System TTS, WorldInfo (lorebooks), customizable UI, auto-translate, and more prompt options than you'd ever want or need. Optional Extras server for more SD/TTS options + ChromaDB/Summarize. SillyTavern is a user interface you can install on your computer (and Android phones) that allows you to interact with text generation AIs and chat/roleplay with characters...
    Downloads: 243 This Week
    Last Update:
    See Project
  • 8
    AudioLM - Pytorch

    AudioLM - Pytorch

    Implementation of AudioLM audio generation model in Pytorch

    Implementation of AudioLM, a Language Modeling Approach to Audio Generation out of Google Research, in Pytorch It also extends the work for conditioning with classifier free guidance with T5. This allows for one to do text-to-audio or TTS, not offered in the paper. Yes, this means VALL-E can be trained from this repository. It is essentially the same. This repository now also contains a MIT licensed version of SoundStream.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    ChatTTS_colab

    ChatTTS_colab

    One-click deployment (including offline integration package)

    ChatTTS_colab is a wrapper project around the ChatTTS model that focuses on “one-click” deployment, especially in Google Colab. It provides an integrated offline bundle and scripts for Windows and macOS so users can run ChatTTS locally without wrestling with complex environment setup. The repository includes Colab notebooks that launch a Gradio-based web UI and expose streaming TTS, making it possible to listen to generated audio as it is produced.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Secure User Management, Made Simple | Frontegg Icon
    Secure User Management, Made Simple | Frontegg

    Get 7,500 MAUs, 50 tenants, and 5 SSOs free – integrated into your app with just a few lines of code.

    Frontegg powers modern businesses with a user management platform that’s fast to deploy and built to scale. Embed SSO, multi-tenancy, and a customer-facing admin portal using robust SDKs and APIs – no complex setup required. Designed for the Product-Led Growth era, it simplifies setup, secures your users, and frees your team to innovate. From startups to enterprises, Frontegg delivers enterprise-grade tools at zero cost to start. Kick off today.
    Start for Free
  • 10
    VALL-E X

    VALL-E X

    Open source implementation of Microsoft's VALL-E X zero-shot TTS model

    VALL-E-X is an open-source implementation of Microsoft’s VALL-E X zero-shot text-to-speech model, focused on multilingual, cross-lingual voice cloning. It is capable of synthesizing speech in English, Chinese, and Japanese from text while mimicking the voice characteristics of a speaker given only a short 3–10 second prompt. The model attempts to match not just timbre, but also tone, pitch, emotion, and prosody of the reference audio, resulting in highly personalized output. VALL-E-X...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    flightcombat_chung
    Flightcombat chung is a 3D openGL space / air / sea / ground flight / car simulator written in compiled freebasic with combat against ships DCA and massive air /air /space dogfight.Give orders to your wingmens ,declare war or attack other planes,or fly in formation , refuel at airports or space stations and explore vastes satellite heightmap countries or planets by plane, spacecraft, foot or car. Can run on a small netbook with windows 7. Zipped and unzip with 7zip. (22/09/2015) added...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    guglinatts-en

    guglinatts-en

    Guglina TTS, special edition: in English (guglinatts-en)

    Guglina TTS, special edition: in English (guglinatts-en), is a voice synthesizer originally designed for Brazilian Portuguese. Uses the Google Translate text-to-speech API. Read screens for the visually impaired. Transforms text into audio, allowing blind or low-vision people to access content displayed on the screen. Although the main target audience for text-to-speech conversion systems - such as Guglina TTS EN - is people with visual impairment, this type of program can be used by people with dyslexia and other reading disabilities, people with severe as well as by pre-literate children. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    guglinatts-it

    guglinatts-it

    Guglina TTS, edizione speciale: in italiano (guglinatts-it)

    Guglina TTS, edizione speciale: in italiano (guglinatts-it), è un sintetizzatore vocale originariamente progettato per il portoghese brasiliano. Utilizza l'API di sintesi vocale di Google Traduttore. Leggi gli screenshot per gli ipovedenti. Trasforma il testo in audio, consentendo a persone non vedenti o ipovedenti di accedere al contenuto visualizzato sullo schermo.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    guglinatts

    guglinatts

    sintetizador de voz, em português do Brasil, que usa a API do Google

    Guglina TTS é um sintetizador de voz, em português do Brasil, que usa a API de conversão de texto em fala do Google Tradutor. Lê telas para portadores de deficiência visual. Transforma texto em áudio, permitindo que pessoas cegas ou com baixa visão tenham acesso ao conteúdo exibido na tela. Embora o principal público-alvo de sistemas de conversão texto-fala – como o Guglina TTS – seja formado por pessoas com deficiência visual, esse tipo de programa pode ser usado por pessoas com dislexia e outras dificuldades de leitura, pessoas com deficiência severa de fala, bem como por crianças pré-alfabetizadas. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    The News Book

    A News Program Which Extract Contents from Websites And Share Them...

    This Is A News Program Which Extracts Contents From Several News Websites, And Make Them Available For Share On Social Network Sites.Currently Supports Facebook, Working On More Social Networks.It Contains Some Other Features Like You Tube Client And You Tube Video Sharer On Social Networks And Blogs.A Gmail Client.Also Contains The Feature Of Bookmarks For Both You Tube Videos And News Articles.Bookmarks For News Articles Also Works Offline.The Program Maintains The Article Format According To The Websites.Also Supports NSM(News Speech Manager) Which Read The Articles Using The TTS(Text To Speech) Mechanism, Very Useful For Blind Peoples.Very Secure In Gmail Login Process, The Program Remember Password Only When It Is Running,And To Open A Mail You Have To Login First And Once Only, Automatically Sign Out From Google Account On Application Close.Contains A Smooth And Simple User Interface,Working On User Defined GUI. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Provides Text to speech synthesis systems for Indian Languages. It has festival speech synthesiser integrated with screen readers like NVDA and ORCA for windows and linux based systems respectively.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next