[go: up one dir, main page]

Showing 293 open source projects for "voice to text"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • Go beyond a virtual data room with Datasite Diligence Icon
    Go beyond a virtual data room with Datasite Diligence

    Datasite Diligence, helps dealmakers in more than 170 countries close more deals, faster.

    The data room with a view. Evolved for next-generation M&A. Built on decades of deal experience. Packed with expert tools, yet intuitive for novices. A fully mobile platform with frictionless processes. Smart AI tools that let you close more deals, faster, plus end-to-end support at all times. Do due diligence with intelligence.
    Learn More
  • 1
    React Native Voice

    React Native Voice

    React Native Voice Recognition library for iOS and Android

    A speech-to-text library for React Native. Manually or automatically link the NativeModule. Drag the Voice.xcodeproj from the @react-native-voice/voice/ios folder to the Libraries group on Xcode in your project. Click on your main project file (the one that represents the .xcodeproj) select Build Phases and drag the static library, lib.Voice.a, from the Libraries/Voice.xcodeproj/Products folder to Link Binary With Libraries.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    VoiceSmith

    VoiceSmith

    [WIP] VoiceSmith makes training text to speech models easy

    VoiceSmith makes it possible to train and infer on both single and multispeaker models without any coding experience. It fine-tunes a pretty solid text to speech pipeline based on a modified version of DelightfulTTS and UnivNet on your dataset. Both models were pretrained on a proprietary 5000 speaker dataset. It also provides some tools for dataset preprocessing like automatic text normalization. Windows (only CPU supported currently) or any Linux based operating system. If you want to run...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    RHVoice

    RHVoice

    Free open source speech synthesizer for Russian and other languages

    RHVoice is a free and open-source multilingual speech synthesizer. Its developers hope to give more visually impaired people the ability to use a good free synthesis voice reading in their native language with their screen reader. We are especially interested in supporting those languages for which there are currently no good voices that could be used with a screen reader. The creator of RHVoice, Olga Yakovleva, is blind herself. Many of the contributors to the RHVoice project, both...
    Downloads: 32 This Week
    Last Update:
    See Project
  • 4
    Quillnote

    Quillnote

    Take beautiful markdown notes and stay organized with task lists

    Take beautiful markdown notes and stay organized with task lists. Take beautiful markdown notes whenever you feel inspired. Place them in notebooks and tag them accordingly. Stay organized by making task lists, set reminders and keep everything in one place by attaching related files. Quillnote is fully free and open-source. It will never show you ads, ask you for unnecessary permissions or upload your notes anywhere without you knowing.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Power through agendas and documents, make more informed decisions and conduct board meetings faster. Icon
    Power through agendas and documents, make more informed decisions and conduct board meetings faster.

    For team managers searching for a solution to manage their meetings

    iBabs not only captures the entire decision-making process – it takes all the paperwork out of meetings. iBabs empowers everyone who has ever organized or attended, a meeting. With a seemingly simple app that offers complete control and a comprehensive overview of all those fiddly details. With about 3000 organizations and over 300,000 users, iBabs gives you peace of mind. So you can quickly organize effective meetings, and good decisions can be made with confidence. iBabs didn’t just happen overnight. We started analyzing and simplifying board meeting processes many years ago. We understand all the work that goes into meetings, and how to streamline everything so it all flows smoothly. On any device, confidentially, securely and automatically. Make good decisions with confidence.
    Learn More
  • 5
    qTox

    qTox

    qTox is a chat, voice, video, and file transfer IM client

    qTox is a cross-platform, secure messaging and video calling client for the Tox peer-to-peer protocol. It offers end-to-end encryption, zero metadata leakage, and no central servers, making it a privacy-focused alternative to mainstream messaging apps. qTox includes rich features like file transfers, screen sharing, and group chats, all built with a user-friendly interface powered by Qt. As a decentralized app, qTox ensures that user communication is secure, private, and independent of...
    Downloads: 27 This Week
    Last Update:
    See Project
  • 6
    Read Aloud

    Read Aloud

    An awesome browser extension that reads aloud webpage content

    ...Read Aloud lets users choose from multiple voices: built-in browser voices, plus premium cloud voices from providers such as Google Wavenet, Amazon Polly, IBM Watson, and Microsoft. It offers simple controls through a toolbar button, right-click menu, and keyboard shortcuts for play, pause, stop, rewind, and fast forward. Users can customize voice, speed, pitch, and text highlighting behavior directly from the extension’s settings panel.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Mycroft

    Mycroft

    Mycroft Core, the Mycroft Artificial Intelligence platform

    Mycroft is the world’s leading open source voice assistant. It is private by default and completely customizable. Our software runs on many platforms, on desktop, our reference hardware, a Raspberry Pi, or your own custom hardware. Our open-source, modular system can be ported to your device or environment, at any price point. Whether you make voice-assistants, televisions, or microwaves. Whether you have a 5-room BnB or a 1000-room hotel. Your customers will get access to all the...
    Downloads: 53 This Week
    Last Update:
    See Project
  • 8
    Project Alice

    Project Alice

    Main repository of Project Alice, contains main unit source code

    Project Alice is a smart voice home assistant that is completely modular and extensible. It was first built around Snips therefore runs entirely offline and never sends or shares your voice interactions with anyone, Project Alice guarantees your privacy in your home or wherever you’re using Project Alice. However, as an option, since we've built Project Alice on top of Snips, Project Alice can be configured to use some online alternatives and fall backs (for example, using Amazon or Google’s Text to Speech engines), just like Snips. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Mocking Bird

    Mocking Bird

    Clone a voice in 5 seconds to generate arbitrary speech in real-time

    MockingBird is an open-source voice cloning and real-time speech generation toolkit that lets you clone a speaker’s voice from a short audio sample (reportedly as little as 5 seconds) and then synthesize arbitrary speech in that voice. It builds on deep-learning based TTS / voice-cloning technology (in the lineage of projects such as Real-Time-Voice-Cloning), but extends it with support for Mandarin Chinese and multiple Chinese speech datasets — broadening its applicability beyond English....
    Downloads: 0 This Week
    Last Update:
    See Project
  • End-To-End Document Management Software Icon
    End-To-End Document Management Software

    UnForm is ideal for businesses focusing on distribution, manufacturing ERP solutions, and general accounting.

    UnForm® is a platform-independent software product that creates, delivers, stores and retrieves graphically enhanced documents from ERP application printing. A complete, end-to-end document management solution, UnForm interfaces at the point of printing to produce documents in various formats for printing and electronic delivery.
    Learn More
  • 10
    Parakeet

    Parakeet

    PAddle PARAllel text-to-speech toolKIT

    PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN) Parakeet aims to provide a flexible, efficient and state-of-the-art text-to-speech toolkit for the open-source community. It is built on PaddlePaddle dynamic graph and includes many influential TTS models. In order to facilitate exploiting the existing TTS models directly and developing the new ones, Parakeet selects typical models and provides...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    VoiceFixer

    VoiceFixer

    General Speech Restoration

    VoiceFixer is a machine-learning framework for “speech restoration”: given a degraded or distorted audio recording — with noise, clipping, low sampling rate, reverberation, or other artifacts — it attempts to recover high-fidelity, clean speech. The architecture works in two stages: first an analysis stage that tries to extract “clean” intermediate features from the noisy audio (e.g. removing noise, denoising, dereverberation, upsampling), and then a neural vocoder-based synthesis stage that...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 12

    Free Screen Recorder for Windows 10

    Effortlessly record and capture pc screen with this screen recorder

    Many people use screen recording to produce tutorials, record gameplay, or capture a video call. There are numerous solutions available, each with its features and benefits. And you're in luck if you're browsing for a free and trustworthy screen recorder for Windows 10 because this software is one of Windows's best free screen recorders. Indeed, professional streamers, YouTubers, and gamers worldwide use this powerful software. It allows you to seamlessly record your screen, webcam, and...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 13
    TensorFlowTTS

    TensorFlowTTS

    Real-Time State-of-the-art Speech Synthesis for Tensorflow 2

    ...With integrated vocoder + mel-spectrogram generation pipelines, pre-trained models, and fairly flexible architecture, TensorFlowTTS is a great off-the-shelf and extensible TTS engine for applications ranging from voice assistants to content generation or accessibility tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Live Transcribe Speech Engine

    Live Transcribe Speech Engine

    Live Transcribe is an Android application

    Live Transcribe Speech Engine provides on-device speech recognition components that power real-time transcription for accessibility and everyday voice-first experiences. Its design prioritizes latency and robustness in noisy, far-field environments, enabling continuous transcription with low delay on mobile hardware. The engine manages audio front-end processing—such as noise suppression and voice activity detection—before feeding audio into compact, accurate acoustic and language models....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    Flatpak - Discord

    Discord's flatpak as an RPM package.

    Discord is a voice and text chat for gamers. The Text chat supports inline images and videos. Voice chat includes a jitter buffer, automatic gain control, noise suppression, echo cancellation. Server-to-client communications are encrypted.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    QKSMS

    QKSMS

    A beautiful SMS messenger for Android

    Fall in love with texting all over again with QKSMS! QKSMS is a beautiful, simple and customizable open source replacement to the stock messaging app on Android. It's got a clean, clutter-free intuitive design that lets you focus on what matters, plus millions of colors to choose from to make the theme of the entire app truly your own. QKSMS has some powerful features, including photo sharing, group chat, conversation blocking, message backup and restore and more. It's got everything you...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    Trigger Rally

    Trigger Rally

    A fast-paced free software rally racing game

    A 3D rally simulation with a great physics engine for drifting, over 200 maps, different terrain materials like dirt, asphalt, sand, ice etc. and various weather, light and fog conditions give this rally simulation the edge over many other free games. You need to make it through the maps in often tight time limits and can further improve by beating the recorded high scores. All attached single races must be finished in time in order to win an event, unlocking additional events and cars. Most...
    Leader badge">
    Downloads: 59 This Week
    Last Update:
    See Project
  • 18
    PaddlePaddle models

    PaddlePaddle models

    Pre-trained and Reproduced Deep Learning Models

    Pre-trained and Reproduced Deep Learning Models ("Flying Paddle" official model library, including a variety of academic frontier and industrial scene verification of deep learning models) Flying Paddle's industrial-level model library includes a large number of mainstream models that have been polished by industrial practice for a long time and models that have won championships in international competitions; it provides many scenarios for semantic understanding, image classification,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    English2Cards

    English2Cards

    English Learning software

    English2Cards is a smart English learning program designed to improve your listening & speaking skills without Internet connection using great learning files. The program repeatedly reviews the educational cards at different times so that you can remember the new words and phrases that you have learned and be able to use them in real conversations easily and without thinking. During learning you can remember the words in the text and also get the translation, pronunciation and examples of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Multilingual Speech Synthesis

    Multilingual Speech Synthesis

    An implementation of Tacotron 2 that supports multilingual experiments

    This repository provides synthesized samples, training and evaluation data, source code, and parameters for the paper One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech. It contains an implementation of Tacotron 2 that supports multilingual experiments and that implements different approaches to encoder parameter sharing. It presents a model combining ideas from Learning to speak fluently in a foreign language: Multilingual speech synthesis and cross-language voice cloning, End-to-End Code-Switched TTS with Mix of Monolingual Recordings, and Contextual Parameter Generation for Universal Neural Machine Translation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    The SARK Voice Pad

    The SARK Voice Pad

    A text editor with narration and speech recognition

    A fully featured advanced text editor
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    LibreGrammar (LanguageTool fork)

    LibreGrammar (LanguageTool fork)

    Free and open-source style and grammar editor

    This is a free and open-source style and grammar checker forked from Languagetool. This project intends to be a full-blown software editor for French, English, German, Spanish, Portuguese and many other languages that enhances LanguageTool with extended rules — with emphasis on style rules —, and enables many rules disabled by default on LanguageTool project. Other changes include lower reliance on online mechanisms and extra languages. --- Este é um verificador de gramática e...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 23
    Olivia

    Olivia

    Your new best friend powered by an artificial neural network

    Olivia is an open-source chatbot built in Golang using Machine Learning technologies. Its goal is to provide a free and open-source alternative to big services like DialogFlow. You can chat with her by speaking (STT) or writing, she replies with a text message but you can enable her voice (TTS). Olivia can listen to you by saying “Hey Olivia” or clicking on the central button. She speaks to reply to you unless you've disabled her voice. Olivia respects your privacy. All the data used by Olivia is saved in your client. Olivia is organized in modules to facilitate the addition of new capabilities. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    chatbot_chung
    chatbot chung is a keywords based probabilities algorythm simple entertainment chatbot with 3D talking openGL avatars written in freebasic. Can import aiml simple question/answer or question/random/answers or single star/ multi srai data saved from "AIML_chung" open source application . Online html5 javascript version with 44 languages multilingual auto detection available on the website (source included in the zip file). SORT gentext text generation algorythm option added (desktop version) .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Snips NLU

    Snips NLU

    Snips Python library to extract meaning from text

    Snips NLU is a Natural Language Understanding python library that allows to parse sentences written in natural language, and extract structured information. It’s the library that powers the NLU engine used in the Snips Console that you can use to create awesome and private-by-design voice assistants. The exact output is a bit richer, the point here is to give a glimpse on what kind of information can be extracted. Behind every chatbot and voice assistant lies a common piece of technology:...
    Downloads: 2 This Week
    Last Update:
    See Project