Best Open Source C++ Speech Software

C++ Speech Software

Speech C++ Clear Filters

Browse free open source C++ Speech Software and projects below. Use the toggles on the left to filter open source C++ Speech Software by OS, license, language, programming language, and project status.

Gen AI apps are built with MongoDB Atlas
The database for AI-powered applications.

MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.

Start Free
La version gratuite d'Auth0 s'enrichit !
Gratuit pour 25 000 utilisateurs avec intégration Okta illimitée : concentrez-vous sur le développement de vos applications.

Vous l'avez demandé, nous l'avons fait ! Les versions gratuite et payante d'Auth0 incluent des options qui vous permettent de développer, déployer et faire évoluer vos applications en toute sécurité. Utilisez Auth0 dès maintenant pour découvrir tous ses avantages.

Essayez Auth0 gratuitement
1

eSpeak: speech synthesis

Text to Speech engine for English and many other languages. Compact size with clear but artificial pronunciation. Available as a command-line program with many options, a shared library for Linux, and a Windows SAPI5 version.

">

40 Reviews

Downloads: 2,648 This Week

Last Update: 2021-11-17
See Project
2

RHVoice

Free open source speech synthesizer for Russian and other languages

RHVoice is a free and open-source multilingual speech synthesizer. Its developers hope to give more visually impaired people the ability to use a good free synthesis voice reading in their native language with their screen reader. We are especially interested in supporting those languages for which there are currently no good voices that could be used with a screen reader. The creator of RHVoice, Olga Yakovleva, is blind herself. Many of the contributors to the RHVoice project, both programmers and non-programmers, are blind or partially sighted.

Downloads: 40 This Week

Last Update: 2024-07-04
See Project
3

Open JTalk

Open JTalk is a Japanese text-to-speech synthesis system. This software is released under the Modified BSD license.

">

Downloads: 918 This Week

Last Update: 2018-12-25
See Project
4

Mumble

Low-latency, high quality voice chat for gamers

Mumble is an open source, low-latency, high quality voice chat software primarily intended for use while gaming. It includes game linking, so voice from other players comes from the direction of their characters, and has echo cancellation so the sound from your loudspeakers won't be audible to other players.

">

169 Reviews

Downloads: 140 This Week

Last Update: 2022-01-22
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

DeepSpeech

Open source embedded speech-to-text engine

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.

Downloads: 30 This Week

Last Update: 2021-04-08
See Project
6

eGuideDog free software for the blind

eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.

">

16 Reviews

Downloads: 70 This Week

Last Update: 2025-10-03
See Project
7

MMDAgent

MMDAgent is the toolkit for building voice interaction systems. Users can design users own dialog scenario, 3D agents, and voices. This software is released under the Modified BSD license.

">

7 Reviews

Downloads: 85 This Week

Last Update: 2022-01-13
See Project
8

Coqui STT

The deep learning toolkit for speech-to-text

Coqui STT is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. Coqui STT is battle-tested in both production and research. Multiple possible transcripts, each with an associated confidence score. Experience the immediacy of script-to-performance. With Coqui text-to-speech, production times go from months to minutes. With Coqui, the post is a pleasure. Effortlessly clone the voices of your talent and have the clone handle the problems in post. With Coqui, dubbing is a delight. Effortlessly clone the voice of your talent into another language and let the clone do the dub. With text-to-speech, experience the immediacy of script-to-performance. Cast from a wide selection of high-quality, directable, emotive voices or clone a voice to suit your needs. With Coqui text-to-speech, production times go from months to minutes.

Downloads: 8 This Week

Last Update: 2022-09-03
See Project
9

sourcesinc

Source code from the Research Institute for Signals, Systems and Computational Intelligence http://fich.unl.edu.ar/sinc

">

Downloads: 54 This Week

Last Update: 2023-12-05
See Project
ManageEngine Endpoint Central for IT Professionals
A one-stop Unified Endpoint Management (UEM) solution

ManageEngine's Endpoint Central is a Unified Endpoint Management Solution, that takes care of enterprise mobility management (including all features of mobile application management and mobile device management), as well as client management for a diversified range of endpoints - mobile devices, laptops, computers, tablets, server machines etc. With ManageEngine Endpoint Central, users can automate their regular desktop management routines like distributing software, installing patches, managing IT assets, imaging and deploying OS, and more.

Learn More
10

Sinsy

HMM-based singing voice synthesis system

Sinsy is an HMM-based singing voice synthesis system. This software is released under the Modified BSD license.

">

4 Reviews

Downloads: 15 This Week

Last Update: 2016-03-23
See Project
11

Speech Recognition in English & Polish

Speech recognition software for English & Polish languages

Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More help: https://sourceforge.net/p/skrybotdomowy/wiki/ Domain advanced versions (Polish Language) 1. SkryBot Prawo - for judicial professionals. 2. SkryBot Administracyjny - for civil and government administration. 3. SkryBot Medycyna Rodzinna - for physicians Professional version of SkryBot (commercial) offers you: 1. Audio conversion and cutting sound files into smaller ones. 2. Searching for words or phrases in sound files (recognized by SkryBot). 3. Editing sound files and automatic cutting off long silence parts in audio file.

2 Reviews

Downloads: 11 This Week

Last Update: 2020-03-15
See Project
12

simon

The project provides a ready-to-use interface for the julius CSR engine for a handicapped child which is not able to use the keyboard well. It integrates into X11 and Windows. Find out how you can help: http://simon-listens.org/index.php?support

32 Reviews

Downloads: 5 This Week

Last Update: 2013-09-22
See Project
13

Open VXI VoiceXML Interpreter

The Open VXI VoiceXML interpreter is a portable open source library that interprets the VoiceXML dialog markup language. It is designed to serve as a reference for parties interested in understanding how VoiceXML markup might be executed.

">

Downloads: 24 This Week

Last Update: 2013-06-03
See Project
14

Omilo - a text to speech application

Omilo is a simple text to speech application

Omilo is a simple text to speech application for Windows and Linux using Festival, Flite, Marytts and Piper voices.

">

3 Reviews

Downloads: 12 This Week

Last Update: 2024-09-20
See Project
15

TranscriberAG

TranscriberAG is designed for assisting the manual annotation of speech signals. It provides a user-friendly GUI for segmenting long duration speech recordings, transcribing them, labeling speech turns, topic changes and acoustic conditions.

3 Reviews

Downloads: 8 This Week

Last Update: 2013-06-12
See Project
16

BookReader

BookReader is a file converter from txt to mp3. Now your computer can read a text file to obtain an audiobook. No speech engine nor voices included.

Downloads: 7 This Week

Last Update: 2017-10-23
See Project
17

JuliusModels

Open source speech models for Julius in English and other languages.

Open source speech models for Julius speech decoder. Its aim is to give access a wider community of speech recognition enthusiasts to quality models, which they can use in their own projects on different OS platforms (Unix, Windows, etc...) All of the models are based on HTK modelling software and data sets available freely on the Internet.

Downloads: 7 This Week

Last Update: 2018-05-11
See Project
18

Cotovía

Text-to-Speech System for Galician and Spanish

Cotovía is a unit-selection text-to-speech system for Galician and Spanish. Cotovía is distributed under the GPL3.0+ license, but each of the avaliable speaker voices has its own license. The speakers available at sourceforge are free for commercial and non-commercial uses. Another speaker, free for non-commercial uses, is avaliable through external links (see the Blog section). Cotovia has been developed by the University de Vigo and the center 'Ramón Piñeiro' for Research in Humanities, both in Galicia, Spain. Its development has involved a research group of linguists and engineers. Cotovía has been developed as a research project, therefore most of the work has been focused on the most interesting aspects from a scientific point of view. Although the performance of the whole TTS system is quite good, there are some parts that could be clearly improved. Cotovia files and installing instructions are available at the Files and Git sections.

Downloads: 6 This Week

Last Update: 2018-01-02
See Project
19

FestLang

Project dedicated to Festival voices development

Downloads: 5 This Week

Last Update: 2014-06-09
See Project
20

Automated Attendance System

Automated Attendance System (AAS) uses 2 modes for authentication - * Voice Identification System (VIS) * Fingerprinting Method The algorithms used for the same has been developed by me. This algo is more efficient and faster.

Downloads: 4 This Week

Last Update: 2016-10-13
See Project
21

SingIt Lyric Displayer

The SingIt Lyric Displayer is an XMMS plugin which displays formatted lyrics, including id3v2xx lyrics. It consists of the displayer and an integrated editor which allows one to easily insert time stamps, edit the text, and export & strip HTML.

Downloads: 4 This Week

Last Update: 2013-04-05
See Project
22

Jampal mp3 library

mp3 library, advanced ID3V1 and ID3V2 tagger, player. Organize a large mp3 library, over 40,000 songs. Speech synthesis and tag backup utilities. Scripts to maintain and organize song files.

Downloads: 3 This Week

Last Update: 2015-07-26
See Project
23

QWave

QWave: Qt-based waveform display and audio playback class library.

Downloads: 2 This Week

Last Update: 2013-05-01
See Project
24

flashcards (granule)

GRANULE is a flashcards program based on Leitner cardfile methodology for learning new words. It features long-term memory training capabilities with scheduling, integrated pictures, sound, and full-screen mode.

Downloads: 2 This Week

Last Update: 2012-08-25
See Project
25

AhoTTS - TTS for Basque and Spanish

Text-to-Speech for Basque and Spanish

Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/

1 Review

Downloads: 1 This Week

Last Update: 2022-05-03
See Project