[go: up one dir, main page]

Search Results for "converting transcript to audio"

Showing 96 open source projects for "converting transcript to audio"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • The Easy Way To Build A Referral Program Icon
    The Easy Way To Build A Referral Program

    Referral Factory is the #1 referral software used by SMEs and Marketers.

    Referral Factory offers over 1000 pre-built referral program templates you can use as your own, or you can build your own referral program from scratch. You get unlimited referral campaigns on all plans, and brilliant support from their team of referral marketing experts.
    Learn More
  • 1
    Podcastfy.ai

    Podcastfy.ai

    Transforming Multimodal Content into Captivating Multilingual Audio

    Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 2
    abogen

    abogen

    Generate audiobooks from EPUBs, PDFs and text with captions

    abogen is a tool designed to generate audiobooks (or speech narrations) from textual sources such as EPUBs, PDFs, or plain text, with synchronized captions. In other words, it automates the pipeline of reading a digital book (or document), converting its text into speech via a TTS engine, and packaging the result into an audiobook format — likely along with timestamped captions or subtitles that align with the spoken audio. This can be very useful for accessibility, content consumption on the go, or for users who prefer audio over reading. The repository supports handling common ebook formats and generating outputs that combine audio plus caption metadata. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    MARS5

    MARS5

    MARS5 speech model (TTS) from CAMB.AI

    ...The model is built to handle prosodically challenging content such as sports commentary, anime dialogue, and other high-energy or highly varied speech patterns with realistic rhythm and intonation. To control speaker identity, MARS5 uses a short reference audio clip, typically between 2 and 12 seconds, from which it learns the voice characteristics. It supports two main inference modes: shallow clone, which is faster and only needs the reference audio, and deep clone, which additionally uses the transcript of the reference audio to increase similarity and naturalness at the cost of more computation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Meetily

    Meetily

    Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper

    This project is a privacy-first AI meeting assistant that captures meeting audio, produces real-time transcripts, and generates summaries while keeping processing entirely on your own machine or infrastructure. It’s built for organizations that want meeting intelligence without sending recordings or transcripts to third-party cloud services, which helps address compliance and data sovereignty requirements. The app supports live transcription with local model options (including Whisper- and Parakeet-based workflows) and presents the transcript as the meeting happens, making it useful both for note-taking and accessibility. ...
    Downloads: 23 This Week
    Last Update:
    See Project
  • Dragonfly | An In-Memory Data Store without Limits Icon
    Dragonfly | An In-Memory Data Store without Limits

    Dragonfly Cloud is engineered to handle the heaviest data workloads with the strictest security requirements.

    Dragonfly is a drop-in Redis replacement that is designed for heavy data workloads running on modern cloud hardware. Migrate in less than a day and experience up to 25X the performance on half the infrastructure.
    Learn More
  • 5
    EasyVoice

    EasyVoice

    Open source text-to-speech tool, supports extra-long text

    easyVoice is an open-source text-to-speech platform aimed at turning long-form text and novels into high-quality audio, with a strong focus on usability and scalability. It provides a web interface where users can paste or upload large texts and generate speech and subtitles in a single workflow, even for works exceeding 100,000 characters. The system supports multi-role voice acting, letting users assign different neural voices to different characters or narrative roles and configure...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    LARA is software for musical analysis using (new) scientific methods for analysis and visualization. LARA is part of the core research: “Interpretation and performance” of the HSLU – Musik (University of Applied Sciences Luzern – Music depart
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    VoxCPM

    VoxCPM

    TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

    ...Trained on a large 1.8-million-hour bilingual corpus, VoxCPM can infer appropriate speaking style from context, dynamically adjusting intonation, rhythm, and emotional tone. It supports zero-shot voice cloning from a short reference audio clip, capturing timbre, accent, and pacing to closely mimic a target speaker without per-speaker fine-tuning.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    CC2.TV / CC2 - Audio- und TV-Datenbank

    CC2.TV / CC2 - Audio- und TV-Datenbank

    Meta-Datenbank-Anwendung für die Audio- und TV-Sendungen des CC2.TV

    Dieses Programm stellt eine Meta-Datenbank-Anwendung für die Audio- und Video-Sendungen des CC2.TV für GNU/Linux Systeme zur Verfügung. Es ermöglicht das Durchsuchen, Verwalten und Abspielen der umfangreichen Inhalte des CC2.TV-Audiocasts und -Videocasts. Ziel ist es, die über 3000 Audiocast-Themen und über 1000 Videocast-Themen, die sich auf Computerthemen, Technik und gesellschaftliche Aspekte konzentrieren, komfortabel zugänglich zu machen. Für die volle Funktionalität,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    VidCoder

    VidCoder

    A Blu-ray, DVD and video file transcoder for Windows

    VidCoder is a Windows-based open-source video transcoding and ripping tool that provides a graphical interface built around standard command-line multimedia tools. It lets users convert video files (or rip DVDs/Blu-rays, when supported) into modern formats and codecs, making it useful for people who want to compress, re-encode, or transcode video content without dealing directly with low-level encoder settings. Because VidCoder integrates and automates the invocation of complex backend...
    Downloads: 1 This Week
    Last Update:
    See Project
  • IT Asset Management (ITAM) Software Icon
    IT Asset Management (ITAM) Software

    Supercharge Your IT Assets, the Easy Way

    Drowning in misplaced IT assets, compliance headaches, and shadow IT? Navigate to clarity with an intuitive IT Asset Management solution. Experience crisp visibility, effortless control, and unshakable security – all while freeing up your budget with optimized software licenses. The best part? It’s easy.
    Learn More
  • 10
    MediaCoder

    MediaCoder

    Universal media transcoding software

    MediaCoder is a universal media transcoding software actively developed and maintained since 2005. It puts together most cutting-edge audio/video technologies into an out-of-box transcoding solution with a rich set of adjustable parameters which let you take full control of your transcoding. New features and latest codecs are added or updated constantly. MediaCoder might not be the easiest tool out there, but what matters here is quality and performance. It will be your swiss army knife for...
    Leader badge">
    Downloads: 911 This Week
    Last Update:
    See Project
  • 11
    WAV-PRG
    WAV-PRG is a program for converting Commodore 64 tapes to PC and back. It is designed not to require any custom-built cables: transfers between PC and tape are done by means of a tape player/recorder connected to the PC's soundcard by a plain audio cable
    Leader badge">
    Downloads: 135 This Week
    Last Update:
    See Project
  • 12
    CDBurnerXP x64

    CDBurnerXP x64

    Free disc burning software for CDs, DVDs, Blu-ray, and ISO images.

    ...You can also create bootable discs and add multiple sessions to a single disc. Audio CD Burning: Create audio CDs from various formats like MP3, WAV, OGG, FLAC, and WMA. The software includes integrated support for gapless audio, track arrangement, and CD-Text for track information. ISO Image Support: CDBurnerXP allows you to create ISO images from your files and burn them to a disc. It also supports converting BIN and NRG files to ISO format.
    Downloads: 42 This Week
    Last Update:
    See Project
  • 13
    Vimer

    Vimer

    Adjust or convert audios and videos just by describing what you want

    Vimer (VIdeo transforMER) is an AI powered GUI for FFmpeg, a cross-platfom and multilanguage app with a hassle-free interface for adjusting audio and video. You just need to choose the files you want to change, describe what you want and let the artificial intelligence take care of the rest, automatically generating and executing the necessary FFmpeg commands. Forget complicated codes. Whether converting formats, adjusting quality, adding effects, adjusting audio, or mixing different media, Vimer offers a simple path to media conversion and editing without the need for advanced technical knowledge. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Ainee

    Ainee

    Ainee - AI Notetaking and Learning Companion

    Ainee is your ultimate AI-powered notetaking and learning companion. Capture lecture notes in real-time and effortlessly transform audio, text, files, and YouTube videos into formatted notes, mindmaps, quizzes, flashcards, podcasts, and more. Explore our AI meeting note taker, AI notes, video transcript generator, PDF to AI converter, and AI flashcard maker. Enhance your learning with our AI voice recorder, article summarizer AI, and AI quiz generator. Additionally, share your knowledge base with others to foster the flow of information and help new users benefit from collective insights. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    footswitch2basic

    footswitch2basic

    Audio Transcription software for Linux (Vlc) with a foot pedal

    Footswitch 2 (Basic) is a media player for transcribers on Linux. This version is a stripped down version of Footswitch2, containing only the absolute essentials for transcription. Written in python and using the python bindings for VLC it allows a transcriber to control the audio or video with a footpedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    N_m3u8DL-CLI

    N_m3u8DL-CLI

    Simple CLI tool to download m3u8 streams to MP4/TS with rich options

    N_m3u8DL-CLI is a cross-platform command-line downloader for m3u8 (HLS) playlists. It converts streams to MP4 or TS and offers rich command-line options. While the original CLI uses .NET Framework (Windows), its successor N_m3u8DL-RE adds true cross-platform support for Windows, Linux, and macOS. Common in media workflows for downloading and converting streaming video.
    Downloads: 33 This Week
    Last Update:
    See Project
  • 17
    JAFG - Just Another FFmpeg GUI
    JAFG or Just Another FFmpeg GUI is an interface to FFmpeg. JAFG allows conversion of audio to audio file, conversion of video to video files. JAFG allows changing of the Audio Bitrate, Audio Sampling Rate, Audio Channels, Video Codec, Video Bitrate, Video Size, Aspect, Framerate. JAFG also allows converting to DVD, DV, VCD, SVCD and can be pal, ntsc, film. JAFG allows capture of screenshots and screen recording, and Youtube downloading.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    YouTube To Mp4 Converter

    YouTube To Mp4 Converter

    Turn youtube video into mp4 highest quality

    YouTube To Mp4 Converter is a free PC software through which you can easily and quickly convert YouTube videos to Mp4 HD. Particularly, it allows you to choose the output high quality of your Mp4 videos such as 720p, 1080p, 1440p, 2160p. This software has no limit to the video size. Anybody can easily install YouTube To Mp4 Converter online free. Using the right software for converting YouTube to Mp4 HD will save you bandwidth as you don’t have to stream the same video. You can keep the...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19

    ffmpeg-coder

    A small CLI tool which will help in producing different types of video

    A small cross platform CLI tool which will help in producing different types video converting codes quickly for new ffmpeg CLI users. The tool will make it easy to generate ffmpeg commands for them. If you don't use or know about FFMPEG then this tool might not be for you. This tool is created for those who find it difficult to understand or find the appropriate commands for converting a video as they want using the FFMPEG CLI tool. Also those who are new to Command Lines they can also...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Perl Audio Converter

    Perl Audio Converter

    Linux Audio Converter / Tagger / CD Ripper

    A Linux CLI tool for converting multiple audio types from one format to another. It supports the following audio formats: 3G2, 3GP, 8SVX, AAC, AC3, ADTS, AIFF, AL, AMB, AMR, APE, AU, AVR, BONK, CAF, CDR, CVU, DAT, DTS, DVMS, F32, F64, FAP, FLA, FLAC, FSSD, GSRT, HCOM, IMA, IRCAM, LA, MAT, AUD, MAT4, MAT5, M4A, M4R, MP2, MP3, MP4, MP4A, MPC, MPP, NIST, OFF, OFR, OFS, OPUS, OGA,OGG, PAF, PRC, PVF, RA, RAM, RAW, RF64, SD2, SF, SHN, SMP, SND,SOU, SPX, SRN, TAK, TTA, TXW, VOC, VMS, VQF, W64, WAV, WMA, and WV. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 21
    Linux-Intelligent-Ocr-Solution

    Linux-Intelligent-Ocr-Solution

    Easy-OCR solution and Tesseract trainer for GNU/Linux

    Linux-intelligent-ocr-solution Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image, Folder containing Images or screenshot. Program is given total accessibility for visually impaired. A Tesseract Trainer GUI is also shipped with this package. Forum : https://groups.google.com/forum/#!forum/lios Video Tutorial :...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 22
    BatchMp4Converter

    BatchMp4Converter

    A batch MPEG-4 video converter powered by FFmpeg

    A simple and easy to use video converter for converting video files to MP4. Conversion is done using FFmpeg. `libx264` and `aac` are used as output video and audio converter to achieve lower output filesize.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    GoodByeCatpcha

    GoodByeCatpcha

    Solver ReCaptcha v2 Free

    An async Python library to automate solving ReCAPTCHA v2 by images/audio using Mozilla's DeepSpeech, PocketSphinx, Microsoft Azure’s, Google Speech and Amazon's Transcribe Speech-to-Text API. Also image recognition to detect the object suggested in the captcha. Built with Pyppeteer for Chrome automation framework and similarities to Puppeteer, PyDub for easily converting MP3 files into WAV, aiohttp for async minimalistic web-server, and Python’s built-in AsyncIO for convenience.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Subsonic
    Subsonic is a web-based media streamer, providing ubiquitous access to your music and video collection. More than 20 apps are available for Android, iPhone, Windows Phone, BlackBerry, Roku, Chumby, Sonos etc. Supports virtually all media formats, converting files on the fly. Also includes a Podcast receiver and jukebox feature allowing you to control what's playing on your computer from your mobile phone.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 25
    CUETools.NET
    Utility for converting between different cue sheet styles and lossless audio formats, with AccurateRip verification. Supports .wav, .ape, .flac and .wv formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next