audio synthesis free download

Showing 578 open source projects for "audio synthesis"

View related business solutions

La version gratuite d'Auth0 s'enrichit !
Gratuit pour 25 000 utilisateurs avec intégration Okta illimitée : concentrez-vous sur le développement de vos applications.

Vous l'avez demandé, nous l'avons fait ! Les versions gratuite et payante d'Auth0 incluent des options qui vous permettent de développer, déployer et faire évoluer vos applications en toute sécurité. Utilisez Auth0 dès maintenant pour découvrir tous ses avantages.

Essayez Auth0 gratuitement
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

Sonic Pi

Sonic Pi is your free code-based music creation and performance tool

... Pi is a live-coding music synthesis tool used to create and perform music interactively through code.

Downloads: 22 This Week

Last Update: 2025-06-26
See Project
2

Lobe Chat

An open-source, modern-design AI chat framework

LobeChat, unlock the superpower of your brain. Pioneering the new age of thinking and creating. Built for you, the Super Individual. LobeChat supports file upload and knowledge base functionality. You can upload various types of files including documents, images, audio, and video, as well as create knowledge bases, making it convenient for users to manage and search for files. Additionally, you can utilize files and knowledge base features during conversations, enabling a richer dialogue...

Downloads: 37 This Week

Last Update: 6 hours ago
See Project
3

FluidSynth

Software synthesizer based on the SoundFont 2 specifications

FluidSynth is a real-time software synthesizer based on the SoundFont 2 specifications and has reached widespread distribution. FluidSynth itself does not have a graphical user interface, but due to its powerful API several applications utilize it and it has even found its way onto embedded systems and is used in some mobile apps.

Downloads: 14 This Week

Last Update: 5 days ago
See Project
4

bfxr

Flash + AIR sound effects generator. Based on Sfxr.

The bfxr project by increpare is a sound-effects generator tool originally built using Flash + AIR, based on the earlier Sfxr project. Its purpose is to enable users, especially game developers and sound designers, to quickly generate retro, 8-bit/“chiptune” style sound effects (“bleeps”, “booms”, “zaps”, etc.) without deep knowledge of audio signal processing. It offers an interactive GUI through which you can tweak many parameters (oscillators, envelopes, filters, etc.) to sculpt custom sound...

Downloads: 24 This Week

Last Update: 6 days ago
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

SuperCollider

Audio server, programming language, and IDE for sound synthesis

SuperCollider is a platform for audio synthesis and algorithmic composition, used by musicians, artists, and researchers working with sound. It is free and open source software available for Windows, macOS, and Linux. scsynth, a real-time audio server, forms the core of the platform. It features 400+ unit generators (“UGens”) for analysis, synthesis, and processing. Its granularity allows the fluid combination of many known and unknown audio techniques, moving between additive and subtractive...

Downloads: 4 This Week

Last Update: 2025-07-26
See Project
6

Kitten TTS

State-of-the-art TTS model under 25MB

KittenTTS is an open-source, ultra-lightweight, and high-quality text-to-speech model featuring just 15 million parameters and a binary size under 25 MB. It is designed for real-time CPU-based deployment across diverse platforms. Ultra-lightweight, model size less than 25MB. CPU-optimized, runs without GPU on any device. High-quality voices, several premium voice options available. Fast inference, optimized for real-time speech synthesis.

Downloads: 17 This Week

Last Update: 2025-08-08
See Project
7

Overtone

Collaborative programmable music

Overtone is an open-source audio environment designed to explore new musical ideas from synthesis and sampling to instrument building, live coding and collaborative jamming. We combine the powerful SuperCollider audio engine, with Clojure, a state-of-the-art lisp, to create an intoxicating interactive sonic experience. Synchronize your visuals and noise with ease. Overtone features seamless integration with both Quil, a Clojure front-end to Processing and ShaderTone, a Clojure version...

Downloads: 5 This Week

Last Update: 2024-11-07
See Project
8

Faust

Functional programming language for signal processing

Faust (Functional Audio Stream) is a functional programming language for sound synthesis and audio processing with a strong focus on the design of synthesizers, musical instruments, audio effects, etc. Faust targets high-performance signal processing applications and audio plug-ins for a variety of platforms and standards. The core component of Faust is its compiler. It allows to "translate" any Faust digital signal processing (DSP) specification to a wide range of non-domain specific languages...

Downloads: 3 This Week

Last Update: 6 days ago
See Project
9

Podcastfy.ai

Transforming Multimodal Content into Captivating Multilingual Audio

Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization...

Downloads: 2 This Week

Last Update: 2024-11-16
See Project
AI-based, Comprehensive Service Management for Businesses and IT Providers
Modular solutions for change management, asset management and more

ChangeGear provides IT staff with the functions required to manage everything from ticketing to incident, change and asset management and more. ChangeGear includes a virtual agent, self-service portals and AI-based features to support analyst and end user productivity.

Learn More
10

AudioKit

Swift audio synthesis, processing, & analysis platform

AudioKit is an entire audio development ecosystem of code repositories, packages, libraries, algorithms, applications, playgorunds, tests, and scripts, built and used by a community of audio programmers, app developers, engineers, researchers, scientists, musicians, gamers, and people new to programming. An important goal for AudioKit is to allow it to grow and be maintainable by a handful of volunteers. For this reason we have extensive tests that are run whenever changes are made to any...

Downloads: 3 This Week

Last Update: 2025-03-11
See Project
11

elevenlabs-api

elevenlabs-api is an open source Java wrapper around the ElevenLabs

Elevenlabs-api is an open-source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API. Compiled JARs are available via the Releases tab. To access your ElevenLabs API key, head to the official website, you can view your xi-API-key using the 'Profile' tab on the website. To set up your ElevenLabs API key, you must register it with the ElevenLabsAPI Java API. For any public repository security, you should store your API key in an environment variable, or external from your...

Downloads: 2 This Week

Last Update: 2023-12-25
See Project
12

Qwen2.5-Omni

Capable of understanding text, audio, vision, video

...-of-the-art performance in many multimodal benchmarks, particularly spoken language understanding, audio reasoning, image/video understanding, etc. Very strong benchmark performance across modalities (audio understanding, speech recognition, image/video reasoning) and often outperforming or matching single-modality models at a similar scale. Real-time streaming responses, including natural speech synthesis (text-to-speech) and chunked inputs for low latency interaction.

Downloads: 1 This Week

Last Update: 2025-09-23
See Project
13

GLM-4-Voice

GLM-4-Voice | End-to-End Chinese-English Conversational Model

GLM-4-Voice is an open-source speech-enabled model from ZhipuAI, extending the GLM-4 family into the audio domain. It integrates advanced voice recognition and generation with the multimodal reasoning capabilities of GLM-4, enabling smooth natural interaction via spoken input and output. The model supports real-time speech-to-text transcription, spoken dialogue understanding, and text-to-speech synthesis, making it suitable for conversational AI, virtual assistants, and accessibility...

Downloads: 2 This Week

Last Update: 6 days ago
See Project
14

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai

MeloTTS is an open-source text-to-speech (TTS) system that generates natural-sounding speech from text input. It utilizes advanced machine-learning models to produce high-quality audio outputs.

Downloads: 0 This Week

Last Update: 2025-01-06
See Project
15

Seamless Communication

Foundational Models for State-of-the-Art Speech and Text Translation

Seamless Communication is a research project focused on building more integrated, low-latency multimodal communication between humans and AI agents. The motivation is to move beyond “text in, text out” and enable direct, live, multi-turn exchange involving language, gesture, gaze, vision, and modality switching without user friction. The system architecture includes a real-time multimodal signal pipeline for audio, video, and sensor data, a dialog manager that can decide when to act (speak...

Downloads: 0 This Week

Last Update: 4 days ago
See Project
16

Audio Satanifier 666

Easily apply cool gnarly voice filters to your audio files

Transform pure innocent audio files, speech, music, etc into unholy demonic abominations. Audio Satanifier 666 is a fun easy-to-use browser-based tool forged in the pits of hell, for voice actors, musicians, sound designers, for memes, for creative projects or anyone else who want to twist their sound into something absolutely diabolical! Layperson friendly - you'll be able to apply cool effects to your audio file even if you know nothing about audio engineering. Theres also a Mini...

Downloads: 0 This Week

Last Update: 2025-07-27
See Project
17

Hydrogen

An advanced drum machine with pattern-based programming

Hydrogen is an advanced drum machine for GNU/Linux, Windows and Mac OS X. It's main goal is to bring professional yet simple and intuitive pattern-based drum programming.

">

65 Reviews

Downloads: 1,145 This Week

Last Update: 2025-07-29
See Project
18

VALL-E

PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)

We introduce a language modeling approach for text to speech synthesis (TTS). Specifically, we train a neural codec language model (called VALL-E) using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather than continuous signal regression as in previous work. During the pre-training stage, we scale up the TTS training data to 60K hours of English speech which is hundreds of times larger than existing systems. VALL...

Downloads: 7 This Week

Last Update: 2023-04-14
See Project
19

UTAU

Singing voice synthesis tool

UTAU is singing voice synthesis software created for Windows. It has the ability to assemble a song based on a sampled audio library, and you can create your own audio library (original sound file set) in addition to the included audio. Currently, there are a large number of audio libraries created for UTAU on the internet, and by downloading and using them, you can create singing songs with your favorite voices.

1 Review

Downloads: 165 This Week

Last Update: 2023-10-09
See Project
20

Munt

A multi-platform software synthesiser emulating (somewhat inaccurately) pre-GM MIDI devices such as the Roland MT-32, CM-32L, CM-64 and LAPC-I. In no way endorsed by or affiliated with Roland Corp. Source hosted at: http://www.github.com/munt/

">

12 Reviews

Downloads: 221 This Week

Last Update: 2025-09-22
See Project
21

Simple TTS Reader

Simple TTS Reader is a small utility that reads text from your clipboard using Microsoft Speech API. Whenever you copy any text, the app instantly converts it into spoken words. You have full control over the voice output — simply select your preferred speech engine from those installed on your system, such as Microsoft Zira. The application can also be minimized to the system tray.

">

4 Reviews

Downloads: 92 This Week

Last Update: 5 days ago
See Project
22

Pure Data Computer Music System

The Pure Data (short Pd) project on SourceForge unifies the extensions (or externals) written for the Pure Data Graphical Computer Music System written by Miller Puckette with contributions from many others.

37 Reviews

Downloads: 88 This Week

Last Update: 2025-07-30
See Project
23

Qsynth

A fluidsynth Qt GUI Interface

Qsynth is a fluidsynth GUI front-end application, written in C++ around the Qt framework, using Qt Designer. Eventually it may evolve into a softsynth management application allowing the user to control and manage a variety of command line softsynths.

">

3 Reviews

Downloads: 75 This Week

Last Update: 2025-10-02
See Project
24

Snd

Snd is a sound editor

Snd is a sound editor.

">

2 Reviews

Downloads: 44 This Week

Last Update: 4 hours ago
See Project
25

eGuideDog free software for the blind

eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.

">

16 Reviews

Downloads: 65 This Week

Last Update: 2025-10-03
See Project