[go: up one dir, main page]

Showing 51 open source projects for "transcribe mp4 to text"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Yeastar: Business Phone System and Unified Communications Icon
    Yeastar: Business Phone System and Unified Communications

    Go beyond just a PBX with all communications integrated as one.

    User-friendly, optimized, and scalable, the Yeastar P-Series Phone System redefines business connectivity by bringing together calling, meetings, omnichannel messaging, and integrations in one simple platform—removing the limitations of distance, platforms, and systems.
    Learn More
  • 1
    Concordia

    Concordia

    Crowdsourcing platform for full text transcription and tagging

    Concordia is a platform for crowdsourcing transcription and tagging of text in digitized images. It was developed by the Library of Congress so that volunteers of all backgrounds could transcribe and tag digitized images of manuscripts and typed materials from the Library’s collections that could not otherwise be done by optical character recognition.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Whishper

    Whishper

    Transcribe any audio to text, translate and edit subtitles 100% locall

    Open-source, local-first audio transcription and subtitling suite with a simple web UI. Thanks to open-source technologies, Whishper can run 100% offline. Your data never leaves your computer. Whishper allows you to translate your transcriptions to and from more than 60 languages thanks to Argos Translate and LibreTranslate. Download the transcriptions in many formats (json, txt, vtt, srt). Easily edit your subtitles right in the Web-UI.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 3
    Transcripciones con Whisper Esta aplicación de escritorio basada en web permite transcribir (o transcribir y traducir al ingles), archivos de audio o video utilizando el modelo Whisper de OpenAI. Transcriptions with Whisper This web-based desktop application allows you to transcribe—or both transcribe and translate into English—audio or video files using OpenAI's Whisper model.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Memvid

    Memvid

    Video-based AI memory library. Store millions of text chunks in MP4

    Memvid encodes text chunks as QR codes within MP4 frames to build a portable “video memory” for AI systems. This innovative approach uses standard video containers and offers millisecond-level semantic search across large corpora with dramatically less storage than vector DBs. It's self-contained—no DB needed—and supports features like PDF indexing, chat integration, and cloud dashboards.
    Downloads: 7 This Week
    Last Update:
    See Project
  • All-in-One Inspection Software Icon
    All-in-One Inspection Software

    flowdit is a connected worker platform tailored for industry needs in commissioning, quality, maintenance, and EHS management.

    Optimize Frontline Operations: Elevate Equipment Uptime, Operational Excellence, and Safety with Connected Teams and Data, Including Issue Capture and Corrective Action.
    Learn More
  • 5
    Handy STT

    Handy STT

    A free, open source, and extensible speech-to-text application

    Handy is a free, open-source, offline speech-to-text application built for privacy, accessibility, and extensibility. Developed using Tauri (Rust + React/TypeScript), it runs natively across Windows, macOS, and Linux while performing local speech recognition without sending any audio to cloud servers. Handy allows users to start transcription instantly using a configurable keyboard shortcut—press to record, release to transcribe—and automatically pastes the resulting text into any active text field. ...
    Downloads: 48 This Week
    Last Update:
    See Project
  • 6
    Bootleg Text Slicer

    Bootleg Text Slicer

    Text transcription & slicing tool with visual timeline and WAV output.

    ... - Record the timeline position, along with the global and per‑word timing offsets for each exported word, into a cutTemplate.txt file so that the individual words can later be played using only the source audio file. GitHub repository: https://github.com/Northstrix/bootleg-text-slicer Successfully tested with English and Italian audio files. Both scripts work, but I wouldn’t advise you to use Bootleg Text Slicer V2.py to transcribe more than 60–90 seconds at a time. Otherwise, its UI might become laggy. You can easily adjust the transcription duration by moving the start and end sliders below the timeline.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Vibe

    Vibe

    Transcribe on your own

    Vibe is an open-source project by thewh1teagle designed to deliver a collaborative and interactive social application experience, though its specifics depend on its evolving community scope; its development often focuses on connecting users through dynamic features that can include chat, shared spaces, and immersive interactions. The repository typically includes backend logic, frontend integration, and real-time communication stacks to support live user engagement, performance...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 8
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 9
    Screenity

    Screenity

    The most powerful screen recorder & annotation tool for Chrome

    ...Annotate your screen to give feedback, emphasize your clicks, edit your recording, and much more. Make unlimited recordings of your tab, desktop, any application, and camera. Annotate by drawing anywhere on the screen, adding text, and creating arrows. Highlight your clicks, focus on your mouse, or hide it from the recording. Individual microphone and computer audio controls, push to talk, and more. Custom countdowns, show controls only on hover, and many other customization options. Export as mp4, gif, and webm, or save the video directly to Google Drive. ...
    Downloads: 12 This Week
    Last Update:
    See Project
  • The all-in-one Omnichannel Experience Management Platform Icon
    The all-in-one Omnichannel Experience Management Platform

    Do more than just Surveys.

    Build conversational surveys of any type, for any purpose, in any language. Get 40% more responses.
    Learn More
  • 10
    Translate-Subtitle-File

    Translate-Subtitle-File

    Subtitle Creation Assistant

    Subtitle group machine translation assistant - [Function 1: Translate subtitle file] .srt .ass .vtt [Function 2: Voice to text] (Drag in video or audio to recognize subtitles) (The latest version v4.1.0 Update time 2021 2 May 23) 12 translation service providers can be configured, such as Google, Baidu, Tencent, Caiyun, IBM, Azure, Amazon, etc. (6 voice service providers can be configured: Alibaba Cloud, Xunfei, Tencent Cloud, IBM, Azure, Amazon ) Advantages: 1. You can use multiple service...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 11
    Google AI Edge Gallery

    Google AI Edge Gallery

    A gallery that showcases on-device ML/GenAI use cases

    Gallery is a curated collection of on-device machine learning examples, demo apps, and model artifacts designed to help developers experiment with and deploy ML at the edge. The project bundles runnable samples that show how to run TensorFlow Lite/Edge TPU models (and similar lightweight runtimes) on mobile and embedded platforms, demonstrating common tasks like image classification, object detection, audio recognition, and pose estimation. Each sample is intended to be both a learning aid...
    Downloads: 56 This Week
    Last Update:
    See Project
  • 12
    Buzz

    Buzz

    Transcribe and translate audio offline on your personal computer

    Buzz transcribes and translates audio to text offline using OpenAI's Whisper. Import audio and video files into Buzz and export them as TXT, SRT, or VTT files. Buzz supports Whisper, Whisper.cpp, Faster Whisper, Whisper-compatible models from the Hugging Face repository, and the OpenAI Whisper API. Get linux versions from: - https://flathub.org/apps/io.github.chidiwilliams.Buzz - https://snapcraft.io/buzz Home page of Buzz https://github.com/chidiwilliams/buzz Note for...
    Leader badge">
    Downloads: 3,922 This Week
    Last Update:
    See Project
  • 13
    Amical

    Amical

    Open Source AI Dictation App

    Amical is an open source, AI-powered desktop dictation and note-taking application that enables users to dictate hands-free, transcribe meetings, and capture notes effortlessly with unmatched speed, accuracy, and privacy. It leverages both local and cloud-based AI models, letting users seamlessly switch between providers for the ideal balance of speed, precision, and control, and understands the context of each app in use to automatically format text in a tone and style appropriate to the platform. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    Whisper Batch Transcriber

    Whisper Batch Transcriber

    Unlimited, private and free Speech-To-Text program

    ## About: Automatically transcribe all of your voice recordings into clean, organized, neat text files. It's free, fully automated, unlimited, using state-of-the-art speech-to-text technology. Works 100% offline on your computer, privately and locally. ## Usecases: Convert speeches, podcasts, webinars, monologues, storytellings and other audio speech into a formatted .txt file.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 15
    DeepSeek AIO

    DeepSeek AIO

    Access and use all DeepSeek AI models in one program.

    DeepSeek AIO is a simple program that allows you to interact with all DeepSeek large language models in one place. It supports text-based chats, data analysis, code generation, language translation, and more. The program is designed to make it easy for users to use DeepSeek's AI tools for different purposes without switching between multiple platforms.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    Morse Key Express

    Morse Key Express

    Converts text to Morse sounds

    This is a simple application that converts text to Morse code and audio. It has minimal dependencies and is very lightweight. It does not yet offer the ability to transcribe Morse code and save the output to an audio file. It's a useful tool for amateur radio operators and radio operators. https://github.com/shampuan/morse-key-express
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    Whisper-Transcriber-Tool

    Desktop application that converts video files into accurate text

    WhisperTranscriber is a free, powerful desktop application that converts video files into accurate text using OpenAI's Whisper AI model. Perfect for journalists, researchers, students, content creators, and anyone who needs reliable transcription. KEY FEATURES: - High-accuracy AI transcription with 99+ language support - Works completely offline - no internet required, total privacy - Supports all common video formats (mp4) - Batch processing for multiple files - Automatic language detection - Drag & drop interface - Export as SRT formats - No file size limits PORTABLE VERSION: - No installation needed - Run from USB or any folder - FFmpeg and AI models included - Lightweight and fast WHY CHOOSE WHISPERTRANSCRIBER: ✓ 100% free forever - no subscriptions or hidden costs ✓ Complete privacy - all processing happens on your computer ✓ No account or registration required ✓ Professional-grade accuracy ✓ Works offline
    Downloads: 10 This Week
    Last Update:
    See Project
  • 18
    BWR Ai watermark remover

    BWR Ai watermark remover

    AI-powered tool to quickly remove watermarks from videos flawlessly

    Blue Wave Remover is an advanced AI-driven video watermark removal software designed to effortlessly eliminate logos, text, timestamps, and watermarks from video content. Utilizing cutting-edge computer vision and generative AI algorithms, it accurately detects and removes both static and moving watermarks while preserving the original video's quality, colors, and clarity. The program supports popular video formats and offers batch processing for fast and efficient removal on multiple files....
    Leader badge">
    Downloads: 11 This Week
    Last Update:
    See Project
  • 19
    VATSG

    VATSG

    Video automatic transcribe and translated subtitle generator

    It generates srt format subtitle from videofile which can be any source language that whisper support , and then make translated subtitle file of your target language which deepl support. This is the subtitle generator(VATSG) which use [moviepy](https://github.com/Zulko/moviepy) to generate mp3 and then use [faster-whisper](https://github.com/guillaumekln/faster-whisper) to get text recognition and then use deepl-api to generate your target language subtitle file(srt format) If you...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    OpenAI Web Application

    OpenAI Web Application

    A web application that allows users to interact with OpenAI's models

    ...Take advantage of DALL·E models to generate AI images. Utilize Whisper Model to transcribe audio into text.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    NECCO-EDIT
    ...This app currently supports only Japanese. (English support will be available soon) This app is a video editing application that runs on Windows. You can easily trim or combine unnecessary parts of MP4 files shot with smartphones or digital cameras, synthesize audio files and image files, and synthesize text characters. The completed project can be output as an MP4 file, so it can be posted on video sites such as YouTube. Since it is still under development, the functions are extremely scarce, but we aim to make it possible for anyone to easily enjoy full-fledged video editing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    sorry

    sorry

    Make gifs online

    sorry is a minimal interactive web app designed to generate playful “sorry” animations, messages or digital apology cards. Users can choose from themed styles (fonts, backgrounds, effects) and enter their own message, and the app renders a stylized apology dynamically. Some versions include animation or transitions—letters fading in, shaking, or dissolving—to make the message more expressive. The interface emphasizes simplicity: a user types the text, previews it live, and can share or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Live Transcribe Speech Engine

    Live Transcribe Speech Engine

    Live Transcribe is an Android application

    Live Transcribe Speech Engine provides on-device speech recognition components that power real-time transcription for accessibility and everyday voice-first experiences. Its design prioritizes latency and robustness in noisy, far-field environments, enabling continuous transcription with low delay on mobile hardware. The engine manages audio front-end processing—such as noise suppression and voice activity detection—before feeding audio into compact, accurate acoustic and language models....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Tengwar Editor

    Tengwar Editor

    Create, save, copy and edit tengwar texts with this application.

    Tengwar is the alphabet of elfish languages which was invented by J.R.R.Tolkien. The application will be write sentences in different languages by simbols of tengwar.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    GoodByeCatpcha

    GoodByeCatpcha

    Solver ReCaptcha v2 Free

    An async Python library to automate solving ReCAPTCHA v2 by images/audio using Mozilla's DeepSpeech, PocketSphinx, Microsoft Azure’s, Google Speech and Amazon's Transcribe Speech-to-Text API. Also image recognition to detect the object suggested in the captcha. Built with Pyppeteer for Chrome automation framework and similarities to Puppeteer, PyDub for easily converting MP3 files into WAV, aiohttp for async minimalistic web-server, and Python’s built-in AsyncIO for convenience.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next