[go: up one dir, main page]

Showing 293 open source projects for "voice to text"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • More Bookings. Better Experience. Icon
    More Bookings. Better Experience.

    For tour and activity providers

    The all-in-one solution built to help you stay organised and get more bookings with thousands of connections to online travel agencies (OTAs), resellers and suppliers.
    Learn More
  • 1
    Auto-Commenter

    Auto-Commenter

    A Claude skill that automatically posts personalized comments

    Auto-Commenter is a Claude-oriented automation project built to help users write and post comments that sound natural and context-aware in targeted online communities. It centers on learning a user’s writing style from their real comment history, then applying that style to generate responses that feel consistent with the user rather than generic template text. The workflow emphasizes deeper post analysis so the system can respond to what is actually being discussed, instead of replying with...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Piper TTS

    Piper TTS

    A fast, local neural text to speech system

    ...Piper is particularly suited for offline voice assistants and embedded systems.
    Downloads: 477 This Week
    Last Update:
    See Project
  • 3
    Node.js Telegram Bot API

    Node.js Telegram Bot API

    Telegram Bot API for NodeJS

    TelegramBot is an EventEmitter that emits several events. Message, received a new incoming Message of any kind. Depending on the properties of the Message, one of these events may ALSO be emitted, text, audio, document, photo, sticker, video, voice, contact, location, new_chat_members, left_chat_member, new_chat_title, new_chat_photo, delete_chat_photo, group_chat_created, game, pinned_message, poll, dice, migrate_from_chat_id, migrate_to_chat_id, channel_chat_created, supergroup_chat_created, successful_payment, invoice, video_note, etc. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    texttalk

    Talk through typing the text

    Speak with generated voice of text input. Using Google translate web service, the audio sound of the spoken text can be extracted. Ideal for helping remote support, voice proxy for privacy, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Free and Open Source HR Software Icon
    Free and Open Source HR Software

    OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.

    Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
    Learn More
  • 5
    VALL-E X

    VALL-E X

    Open source implementation of Microsoft's VALL-E X zero-shot TTS model

    VALL-E-X is an open-source implementation of Microsoft’s VALL-E X zero-shot text-to-speech model, focused on multilingual, cross-lingual voice cloning. It is capable of synthesizing speech in English, Chinese, and Japanese from text while mimicking the voice characteristics of a speaker given only a short 3–10 second prompt. The model attempts to match not just timbre, but also tone, pitch, emotion, and prosody of the reference audio, resulting in highly personalized output. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    KoboldCpp

    KoboldCpp

    Run GGUF models easily with a UI or API. One File. Zero Install.

    KoboldCpp is an easy-to-use AI text-generation software for GGML and GGUF models, inspired by the original KoboldAI. It's a single self-contained distributable that builds off llama.cpp and adds many additional powerful features.
    Downloads: 419 This Week
    Last Update:
    See Project
  • 7
    EasyTTS

    EasyTTS

    Text to Speech Utility

    EasyTTS is a text to speech app for 64 bit Windows that offers online and offline text-to-speech, with settings for how fast the voice is. It supports languages other than English but only if you are connected to the Internet. These are Spanish, Portuguese, Russian, French, and Mandarin (?) Chinese.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Simple TTS Reader

    Simple TTS Reader

    A small clipboard reader

    Simple TTS Reader is a small utility that reads text from your clipboard using Microsoft Speech API. Whenever you copy any text, the app instantly converts it into spoken words. Select your preferred speech engine from those installed on your system, such as Microsoft Zira, and adjust speed and volume for personalized playback. The application can also be minimized to the system tray. Plus, it is free and comes with an intuitive interface that makes it accessible to everyone.
    Leader badge">
    Downloads: 100 This Week
    Last Update:
    See Project
  • 9
    CSM (Conversational Speech Model)

    CSM (Conversational Speech Model)

    A Conversational Speech Generation Model

    The CSM (Conversational Speech Model) is a speech generation model developed by Sesame AI that creates RVQ audio codes from text and audio inputs. It uses a Llama backbone and a smaller audio decoder to produce audio codes for realistic speech synthesis. The model has been fine-tuned for interactive voice demos and is hosted on platforms like Hugging Face for testing. CSM offers a flexible setup and is compatible with CUDA-enabled GPUs for efficient execution.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Do not waste time with spreadsheets Icon
    Do not waste time with spreadsheets

    Connect to common Project Management tools such as Jira, Smartsheet, Asana, Wrike, Monday.com, and others.

    Acuity PPM provides Senior Leaders and Project Management Teams (PMO's) with lightweight and easy-to-use software to manage the project portfolio. Acuity PPM helps you create and evaluate new project requests through prioritization, financial planning and resource management.
    Learn More
  • 10
    pdf-to-podcast

    pdf-to-podcast

    PDF to Podcast transforms any PDF document into a podcast-ready audio

    PDF to Podcast transforms any PDF document into a podcast-ready audio episode using advanced AI text-to-speech (TTS) providers. Upload a PDF, select your preferred voice and provider, and receive an MP3 and a ready-to-use RSS feed for your podcast app.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    SpeakFlow-TTS

    SpeakFlow-TTS

    Multilingual Text-to-Speech (TTS)

    Excited to present SpeakFlow - an intuitive desktop application for Text-to-Speech (TTS) conversion! It allows you to easily transform entered text into high-quality audio files, using natural voices in many languages. Key features of SpeakFlow: Multilingual support: Choose from a wide range of languages and voices (Ukrainian, English, German, Russian, Polish, French, Italian, Spanish, Portuguese, and more). Simple and intuitive interface: Designed for quick and convenient audio...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Feishu ChatGPT

    Feishu ChatGPT

    Voice dialogue, role-playing, multi-topic discussion, picture creation

    Feishu × (GPT-3.5 + DALL·E + Whisper) = flying-like work experience. Voice dialogue, role-playing, multi-topic discussion, picture creation, table analysis, document export. Golang language, it goes without saying! Master the gin framework proficiently, developing the backend is as natural as breathing! Familiar with the SDKs of DingTalk, Feishu, Qiwei and other platforms, and be able to develop and integrate a series of amazing functions! Proficient in platform-based detail thinking, let...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 13
    SpeakLogPSU
    SpeakLogPSU can speak chat messages with an individual voice if the NPC or player was configured or with a default one. You will never miss if someone talks to you. Voice cloning can be accomplished with Coqui in less than five minutes without GPU. The result is archived and can be used the next time in game. Some TTS projects already started to add tag support to speak text with emotions or sing it.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge">
    Downloads: 146 This Week
    Last Update:
    See Project
  • 15
    Whisper Batch Transcriber

    Whisper Batch Transcriber

    Unlimited, private and free Speech-To-Text program

    ## About: Automatically transcribe all of your voice recordings into clean, organized, neat text files. It's free, fully automated, unlimited, using state-of-the-art speech-to-text technology. Works 100% offline on your computer, privately and locally. ## Usecases: Convert speeches, podcasts, webinars, monologues, storytellings and other audio speech into a formatted .txt file.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 16
    RemoteTTS

    RemoteTTS

    Tool to remotely activate Text-To-Speech (TTS) on a server

    The tool provides a simple TCP/UDP interface to let a remote machine perform TTS outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Softwares For Blind, Deaf, Handicap

    Softwares For Blind, Deaf, Handicap

    Easy AI Softwares for Blind, Deaf, Handicapped, Disabled People

    Just download the above zip file, extract it first and then open the index.html file on internet browsers like Firefox ( preferable ) or Google Chrome. Also, keep NumLock ON while using the Numeric Keypad of any Keyboard. Can also attach an external USB keyboard, with seperate Numeric Keypad, if required. I have added some general guidelines for students, using these softwares, on the Wiki Page of this website. Please refer them for more instructions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    ViaVoip

    ViaVoip

    A portable peer to peer voice-chat/walkie-talkie.

    ViaVoip is a simple Voice Over IP application that can be used when you need to talk, chat, or send files through the internet, but you can't or don't want to make use of any third party services. Its peer to peer design allows the two end points to connect directly to each other, without any central server nor account registration. It runs on Windows, Linux, Mac OS X and Android, and is portable, that is you don't need any setup, just get a copy and run it from any storage...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    Gangio Desktop

    Gangio Desktop

    Gangio is a full-featured chat and communication platform.

    Gangio is a full-featured chat and communication platform. It provides real-time messaging, server-based communities, direct messaging, voice/video communication, and social features in a modern, responsive interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Pearl MATE Studio 12

    Pearl MATE Studio 12

    OSX Styled Powerful Audio Workstation

    Pearl MATE Studio 12 is running on the Ubuntu 24.04 base with no snap support. This release does however support flatpak and the text to speech on Pearl is managed with Pied for downloading and selecting natural voice models which through keyboard shortcut, <Alt>+s, which will playback highlighted. This release of Pearl MATE Studio is alot lighter on the pre installed software so the user may choose what they want by installing through our software manager, Gdebi, or Synaptic which are pre-installed. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 21
    Besgnulinux
    Based on Debian Stable Installation tool: Calamares Besgnulinux is for both new and weak machines and end users (newbies). Besgnulinux tries to be fast, lightweight, easy to use and stable. It is designed to meet every need with window manager sessions like JWM, Openbox. Instead of background elements, the system is under user control. It does the same things that high-resource desktops like KDE and Gnome do. But it does the job with very little resources. Besgnulinux's goal is to...
    Leader badge">
    Downloads: 550 This Week
    Last Update:
    See Project
  • 22
    slop-index

    slop-index

    Analyzer that spots AI-slop and boilerplate in text and webpages

    ...The goal isn’t to “detect AI” for punishment—it’s to surface low-effort patterns so teams can raise the floor on quality and ship with receipts. Use it to gate drafts before they go live, audit legacy content, compare revisions, and keep brand voice from flattening
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    cerberuscms2

    cerberuscms2

    Cerberus Content Management System

    Cerberus Content Management System is a dynamic, secure and infinitely expandable CMS designed after a Unix-Like model. It is a custom written Web Application Framework ( W.A.F. ) with a consistent and custom written Pre-Hyper-Text-Post-Processor Programming Code Framework ( P.C.F. ). This Web Application Software Project' aim is to be the fastest and most secure Web Application Framework, Web Application Programming Code Framework, Text, Voice and Video Communications Platform and Content Management System in the world. The latest project version is programmed with the Pre-Hyper-Text-Post-Processor P.D.O. // P.H.P. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24

    MARS5-TTS

    MARS5 is a fully open-source, hyper-realistic text-to-speech (TTS).

    CAMB.AI introduces MARS5, a fully open-source (commercially usable) TTS with break-through prosody and realism available on our Github: https://www.github.com/camb-ai/mars5-tts MARS5 is able to replicate performances (from 2-3s of audio reference) in 140+ languages, even for extremely tough prosodic scenarios like sports commentary, movies, anime and more; hard prosody that most closed-source and open-source TTS models struggle with today. We're excited for you to try, build on and use...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Open Camera

    Open Camera

    Camera app for Android.

    Open Camera is an Open Source Camera app for Android(TM) phones and tablets. * Get it on Google Play: https://play.google.com/store/apps/details?id=net.sourceforge.opencamera * Get it on F-Droid: https://f-droid.org/en/packages/net.sourceforge.opencamera/ * Or you can download the APK/source directly, see "Files" above. Android, Google Play and the Google Play logo are trademarks of Google LLC.
    Leader badge">
    Downloads: 885 This Week
    Last Update:
    See Project