Best Open Source Mac Speech Software 2026

Speech Software for Mac

Speech Mac Clear Filters

Browse free open source Speech software and projects for Mac below. Use the toggles on the left to filter open source Speech software by OS, license, language, programming language, and project status.

Eurekos LMS - Build a Smarter Customer
The Eurekos customer training LMS makes it easy to deliver product training that retains more customers and transforms partners into advocates.

Eurekos is a purpose-built LMS that engages customers throughout the entire learning journey from pre-sales, to onboarding, and everything after.

Learn More
Houzz Pro is the #1 business management software for home construction and design professionals.
Get the all-in-one tool for marketing, project and client management built specifically for remodeling and design professionals.

Get an all-in-one solution that spans the full customer lifecycle, including marketing, CRM, estimation & proposal building, project management, a 3D Floor Plan builder, an online invoicing and payment portal, as well as a client portal and collaboration tools. Start a free trial today to see why thousands of Pros run their business on Houzz Pro. Plans available for all business sizes.

Learn More
1

Buzz

Transcribe and translate audio offline on your personal computer

Buzz transcribes and translates audio to text offline using OpenAI's Whisper. Import audio and video files into Buzz and export them as TXT, SRT, or VTT files. Buzz supports Whisper, Whisper.cpp, Faster Whisper, Whisper-compatible models from the Hugging Face repository, and the OpenAI Whisper API. Get linux versions from: - https://flathub.org/apps/io.github.chidiwilliams.Buzz - https://snapcraft.io/buzz Home page of Buzz https://github.com/chidiwilliams/buzz Note for Windows: App is not signed, you will get a warning when you install it. Select More info -> Run anyway.

">

Downloads: 8,381 This Week

Last Update: 2026-01-04
See Project
2

NoiseGator (Noise Gate)

A simple noise gate app intended for use with VOIPs like Skype.

Ever wanted to cut out background noise when talking with others on Skype? Now it's possible! NoiseGator is a light-weight noise gate application that routes audio through an audio input to an audio output. In real-time the audio level is analysed and if the average level is higher than the threshold the audio bypasses as normal. However, if the average level goes below the threshold, the gate closes and the audio is cut. When used with a virtual audio cable it can act as a noise gate for a either a sound input(microphone) or sound output(speakers). Can also be used to gate noise from your own mic or play your microphone through your speakers. REQUIREMENTS: - Java 7 or higher for Windows. - Java 6 or higher for Mac. Java 7 recommended. - A virtual audio cable is required for use with VOIPs: For Windows users I recommend the VB-Cable driver (http://vb-audio.pagesperso-orange.fr/Cable/index.htm). Mac users can use SoundFlower.

">

7 Reviews

Downloads: 560 This Week

Last Update: 2016-11-08
See Project
3

WaveSurfer

WaveSurfer is an open source tool for sound visualization and manipulation. Typical applications are speech/sound analysis and sound annotation/transcription. WaveSurfer may be extended by plug-ins as well as embedded in other applications.

">

15 Reviews

Downloads: 235 This Week

Last Update: 2020-05-07
See Project
4

RHVoice

Free open source speech synthesizer for Russian and other languages

RHVoice is a free and open-source multilingual speech synthesizer. Its developers hope to give more visually impaired people the ability to use a good free synthesis voice reading in their native language with their screen reader. We are especially interested in supporting those languages for which there are currently no good voices that could be used with a screen reader. The creator of RHVoice, Olga Yakovleva, is blind herself. Many of the contributors to the RHVoice project, both programmers and non-programmers, are blind or partially sighted.

Downloads: 21 This Week

Last Update: 2024-07-04
See Project
Respond 100x faster, more accurately, and improve your documentation
Designed for forward-thinking security, sales, and compliance teams

Slash response times for questionnaires, audits, and RFPs by up to 90%. OptiValue.ai automates the heavy lifting, freeing your team to focus on strategic priorities with intuitive tools for seamless review and validation.

Learn More
5

Mumble

Low-latency, high quality voice chat for gamers

Mumble is an open source, low-latency, high quality voice chat software primarily intended for use while gaming. It includes game linking, so voice from other players comes from the direction of their characters, and has echo cancellation so the sound from your loudspeakers won't be audible to other players.

">

169 Reviews

Downloads: 92 This Week

Last Update: 2022-01-22
See Project
6

SpeechRecognition

Speech recognition module for Python

Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip install SpeechRecognition. The first software requirement is Python 2.6, 2.7, or Python 3.3+. This is required to use the library. PyAudio is required if and only if you want to use microphone input (Microphone). PyAudio version 0.2.11+ is required, as earlier versions have known memory management bugs when recording from microphones in certain situations. To hack on this library, first make sure you have all the requirements listed in the "Requirements" section.

Downloads: 19 This Week

Last Update: 2025-12-31
See Project
7

Open JTalk

Open JTalk is a Japanese text-to-speech synthesis system. This software is released under the Modified BSD license.

">

Downloads: 485 This Week

Last Update: 2018-12-25
See Project
8

FreeTTS

FreeTTS is a speech synthesis engine written entirely in the Java(tm) programming language. FreeTTS was written by the Sun Microsystems Laboratories Speech Team and is based on CMU's Flite engine. FreeTTS also includes a partial JSAPI 1.0

">

5 Reviews

Downloads: 210 This Week

Last Update: 2017-04-11
See Project
9

MMDAgent

MMDAgent is the toolkit for building voice interaction systems. Users can design users own dialog scenario, 3D agents, and voices. This software is released under the Modified BSD license.

">

7 Reviews

Downloads: 104 This Week

Last Update: 2022-01-13
See Project
Remote Network Monitoring and Management for an IoT World
The Only RMM Solution You Need

Domotz is the premier Remote Network Monitoring and Management platform for IoT. We offer powerful network management software for MSP's, Integrators, Security Professionals, and Business Owners. Domotz enables the complete solution to cost-effectively manage and monitor your customers’ networks with plug and play setup, a friendly UX, and a comprehensive feature set, accessible from any desktop browser or mobile device. Utilize one interface to manage multiple networks at multiple locations anywhere in the World. One person can deploy remote monitoring and management in less than 15 minutes.

Sign Up for Free
10

DeepSpeech

Open source embedded speech-to-text engine

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.

Downloads: 14 This Week

Last Update: 2021-04-08
See Project
11

hts_engine

hts_engine is software to synthesize speech waveform from HMMs trained by the HMM-based speech synthesis system (HTS). This software is released under the Modified BSD license.

">

Downloads: 231 This Week

Last Update: 2016-12-25
See Project
12

TTS

Deep learning for text to speech

TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed, and quality. TTS comes with pre-trained models, tools for measuring dataset quality, and is already used in 20+ languages for products and research projects. Released models in PyTorch, Tensorflow and TFLite. Tools to curate Text2Speech datasets underdataset_analysis. Demo server for model testing. Notebooks for extensive model benchmarking. Modular (but not too much) code base enabling easy testing for new ideas. Text2Spec models (Tacotron, Tacotron2, Glow-TTS, SpeedySpeech). Speaker Encoder to compute speaker embeddings efficiently. Vocoder models (MelGAN, Multiband-MelGAN, GAN-TTS, ParallelWaveGAN, WaveGrad, WaveRNN). If you are only interested in synthesizing speech with the released TTS models, installing from PyPI is the easiest option.

Downloads: 7 This Week

Last Update: 2021-10-18
See Project
13

Speech Signal Processing Toolkit (SPTK)

SPTK is a suite of speech signal processing tools for UNIX environments, e.g., LPC analysis, PARCOR analysis, LSP analysis, PARCOR synthesis filter, LSP synthesis filter, vector quantization techniques, and other extended versions of them.

9 Reviews

Downloads: 24 This Week

Last Update: 2023-05-10
See Project
14

Voxal voice changer

Transform your voice in real-time voxal voice changer

Voxal Voice Changer is a program that allows you to modify your voice by applying various effects (e.g. pitch change, echo, etc.) in real-time. Effects can be added in any sequence and in any combination, allowing you to distort your voice beyond recognition. Take your audio to the next level! Our powerful Voice Changer software lets you morph your voice in real-time with stunning AI-powered quality. Whether you're looking to have fun, protect your privacy, or create engaging content, we have the perfect voice for you. Audio can be captured from various sources, pre-listening is available, and the most popular audio formats are supported.

">

1 Review

Downloads: 43 This Week

Last Update: 2025-11-16
See Project
15

Coqui STT

The deep learning toolkit for speech-to-text

Coqui STT is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. Coqui STT is battle-tested in both production and research. Multiple possible transcripts, each with an associated confidence score. Experience the immediacy of script-to-performance. With Coqui text-to-speech, production times go from months to minutes. With Coqui, the post is a pleasure. Effortlessly clone the voices of your talent and have the clone handle the problems in post. With Coqui, dubbing is a delight. Effortlessly clone the voice of your talent into another language and let the clone do the dub. With text-to-speech, experience the immediacy of script-to-performance. Cast from a wide selection of high-quality, directable, emotive voices or clone a voice to suit your needs. With Coqui text-to-speech, production times go from months to minutes.

Downloads: 2 This Week

Last Update: 2022-09-03
See Project
16

Transcriber

a tool for segmenting, labeling and transcribing speech

3 Reviews

Downloads: 31 This Week

Last Update: 2017-03-01
See Project
17

DonnerLaParole

Clavier virtuel et synthétiseur vocal pour les personnes ne pouvant plus parler et ayant du mal à utiliser leurs mains. Virtual keyboard and speech synthetiser for people with reduced mobility and unability to speak. In French and english.

6 Reviews

Downloads: 7 This Week

Last Update: 2013-04-15
See Project
18

OpenOffice.org Export As DAISY

odt2daisy is an OpenOffice.org Writer extension, enabling to export in DAISY XML, Full DAISY (xml+audio) and Audiobook format. DAISY is an NISO Z39.86 standard for blind, visual impaired, print-disabled, and learning-disabled people.

3 Reviews

Downloads: 8 This Week

Last Update: 2020-12-07
See Project
19

Moshi

A speech-text foundation model for real time dialogue

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps). Moshi models two streams of audio: one corresponds to Moshi, and the other one to the user. At inference, the stream from the user is taken from the audio input, and the one for Moshi is sampled from the model's output. Along these two audio streams, Moshi predicts text tokens corresponding to its own speech, its inner monologue, which greatly improves the quality of its generation. A small Depth Transformer models inter codebook dependencies for a given time step, while a large, 7B parameter Temporal Transformer models the temporal dependencies.

Downloads: 1 This Week

Last Update: 2024-11-05
See Project
20

yaafe

Yet Another Audio Feature Extractor is a toolbox for audio analysis. Easy to use and efficient at extracting a large number of audio features simultaneously. WAV and MP3 files supported, or embedding in C++, Python or Matlab applications.

1 Review

Downloads: 13 This Week

Last Update: 2016-02-25
See Project
21

sourcesinc

Source code from the Research Institute for Signals, Systems and Computational Intelligence http://fich.unl.edu.ar/sinc

">

Downloads: 15 This Week

Last Update: 2023-12-05
See Project
22

Java Speech API

Wrapper for vendors to simplify usage of the Java Speech API (JSR 113). Note that the spec is an untested early access and that there may be changes in the API.

2 Reviews

Downloads: 5 This Week

Last Update: 2014-12-12
See Project
23

Jampal mp3 library

mp3 library, advanced ID3V1 and ID3V2 tagger, player. Organize a large mp3 library, over 40,000 songs. Speech synthesis and tag backup utilities. Scripts to maintain and organize song files.

Downloads: 13 This Week

Last Update: 2015-07-26
See Project
24

JuliusModels

Open source speech models for Julius in English and other languages.

Open source speech models for Julius speech decoder. Its aim is to give access a wider community of speech recognition enthusiasts to quality models, which they can use in their own projects on different OS platforms (Unix, Windows, etc...) All of the models are based on HTK modelling software and data sets available freely on the Internet.

Downloads: 12 This Week

Last Update: 2018-05-11
See Project
25

Sinsy

HMM-based singing voice synthesis system

Sinsy is an HMM-based singing voice synthesis system. This software is released under the Modified BSD license.

">

4 Reviews

Downloads: 5 This Week

Last Update: 2016-03-23
See Project