[go: up one dir, main page]

Showing 61 open source projects for "whisper"

View related business solutions
  • TelemetryTV content management and device management Icon
    TelemetryTV content management and device management

    Simple and intuitive digital signage software.

    <section class="row"> <div class="small-12 columns"> <p class="description">TelemetryTV is a powerful digital signage platform built for the modern communicator who needs to engage audiences, generate awareness, or give their community a voice. TelemetryTV allows users to broadcast dynamic content easily by streaming video, images, social feeds, turnkey apps, and data-driven dashboards to all of your displays wherever they are. TelemetryTV powers marketing and internal communications at Starbucks, New York Public Library, Stanford University, and more.</p> </div> </section>
    Learn More
  • BidJS enables online timed and webcast bidding on your own website. Icon
    BidJS enables online timed and webcast bidding on your own website.

    For companies wishing to sell by auction

    Bidlogix provide auction software to auction houses around the globe. We offer timed auction software, webcast auction software along with an optional invoicing management system. Our software is embedded on your site with fully customisable styling. Based in Brighton, UK Bidlogix started providing auction software in 2013. Our 2 in-house development teams are constantly evolving the product with in excess of 10 auctions per day being run using our software. Our software can cater for the largest of auctions, all in real-time and can support multiple languages.
    Get Started Now
  • 1
    Whisper

    Whisper

    Robust Speech Recognition via Large-Scale Weak Supervision

    OpenAI Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection.
    Downloads: 69 This Week
    Last Update:
    See Project
  • 2
    whisper.cpp

    whisper.cpp

    Port of OpenAI's Whisper model in C/C++

    whisper.cpp is a lightweight, C/C++ reimplementation of OpenAI’s Whisper automatic speech recognition (ASR) model—designed for efficient, standalone transcription without external dependencies. The entire high-level implementation of the model is contained in whisper.h and whisper.cpp. The rest of the code is part of the ggml machine learning library. The command downloads the base.en model converted to custom ggml format and runs the inference on all .wav samples in the folder samples. whisper.cpp supports integer quantization of the Whisper ggml models. ...
    Downloads: 540 This Week
    Last Update:
    See Project
  • 3
    whisper-timestamped

    whisper-timestamped

    Multilingual Automatic Speech Recognition with word-level timestamps

    Multilingual Automatic Speech Recognition with word-level timestamps and confidence. Whisper is a set of multi-lingual, robust speech recognition models trained by OpenAI that achieve state-of-the-art results in many languages. Whisper models were trained to predict approximate timestamps on speech segments (most of the time with 1-second accuracy), but they cannot originally predict word timestamps. This repository proposes an implementation to predict word timestamps and provide a more accurate estimation of speech segments when transcribing with Whisper models. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    Whisper Turbo

    Whisper Turbo

    Cross-Platform, GPU Accelerated Whisper

    Whisper Turbo is a fast, cross-platform Whisper implementation, designed to run entirely client-side in your browser/electron app.
    Downloads: 4 This Week
    Last Update:
    See Project
  • The only CRM built for B2C Icon
    The only CRM built for B2C

    Stop chasing transactions. Klaviyo turns customers into diehard fans—obsessed with your products, devoted to your brand, fueling your growth.

    Klaviyo unifies your customer profiles by capturing every event, and then lets you orchestrate your email marketing, SMS marketing, push notifications, WhatsApp, and RCS campaigns in one place. Klaviyo AI helps you build audiences, write copy, and optimize — so you can always send the right message at the right time, automatically. With real-time attribution and insights, you'll be able to make smarter, faster decisions that drive ROI.
    Learn More
  • 5
    WhisperLive

    WhisperLive

    A nearly-live implementation of OpenAI's Whisper

    WhisperLive is a “nearly live” implementation of OpenAI’s Whisper model focused on real-time transcription. It runs as a server–client system in which the server hosts a Whisper backend and clients stream audio to be transcribed with very low delay. The project supports multiple inference backends, including Faster-Whisper, NVIDIA TensorRT, and OpenVINO, allowing you to target GPUs and different CPU architectures efficiently.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    Go OpenAI

    Go OpenAI

    OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go

    This library provides Go clients for OpenAI API. OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    Voice-Pro

    Voice-Pro

    Comprehensive Gradio WebUI for audio processing

    Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.
    Downloads: 45 This Week
    Last Update:
    See Project
  • 8
    WhisperKit

    WhisperKit

    On-device Speech Recognition for Apple Silicon

    WhisperKit is a Swift package that integrates OpenAI's popular Whisper speech recognition model with Apple's CoreML framework for efficient, local inference on Apple devices. Whisper has pulled the future forward when fast, free and virtually error-free translation and transcription will be ubiquitous. It inspired numerous developers to improve and deploy it with minimal friction and maximum performance.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    Leader badge">
    Downloads: 53 This Week
    Last Update:
    See Project
  • deskbird is the most intuitive desk booking app for your hybrid office. Icon
    deskbird is the most intuitive desk booking app for your hybrid office.

    With deskbird, creating an efficient workplace has never been easier.

    For companies in need of a people-centric workplace management solution so employees can see who is in the office, schedule their office and work-from-home days, and book resources for office days.
    Learn More
  • 10
    WhisperSpeech

    WhisperSpeech

    An Open Source text-to-speech system built by inverting Whisper

    WhisperSpeech is an open-source text-to-speech system created by “inverting” OpenAI’s Whisper, reusing its strengths as a semantic audio model to generate speech instead of only transcribing it. The project aims to be for speech what Stable Diffusion is for images: powerful, hackable, and safe for commercial use, with code under Apache-2.0/MIT and models trained only on properly licensed data. Its architecture follows a token-based, multi-stage pipeline inspired by AudioLM and SPEAR-TTS: Whisper is used to produce semantic tokens, EnCodec compresses the waveform into acoustic tokens, and Vocos reconstructs high-fidelity audio from those tokens. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Handy STT

    Handy STT

    A free, open source, and extensible speech-to-text application

    ...Handy allows users to start transcription instantly using a configurable keyboard shortcut—press to record, release to transcribe—and automatically pastes the resulting text into any active text field. Its backend leverages OpenAI’s Whisper models for GPU-accelerated speech recognition and Parakeet V3 for efficient CPU-only transcription with automatic language detection. To further refine accuracy and responsiveness, Handy integrates Silero’s Voice Activity Detection (VAD) for silence filtering, ensuring only speech segments are processed.
    Downloads: 52 This Week
    Last Update:
    See Project
  • 12
    Hyprnote

    Hyprnote

    Local-first AI Notepad for Private Meetings

    Hyprnote is an open-source, privacy-first AI notepad app designed for taking notes during meetings—transcribing audio (microphone and system) and generating context-rich summaries using on-device AI models like Whisper and HyprLLM, all without any data leaving your machine.(turn0search7, turn0search1). Listens to your meetings while you write. Crafts smart summaries based on your quick notes. Runs completely offline using open-source models like Whisper or HyprLLM. Use approved third-party APIs like Gemini, Claude, or Azure-hosted GPT.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 13
    Buzz

    Buzz

    Transcribe and translate audio offline on your personal computer

    Buzz transcribes and translates audio to text offline using OpenAI's Whisper. Import audio and video files into Buzz and export them as TXT, SRT, or VTT files. Buzz supports Whisper, Whisper.cpp, Faster Whisper, Whisper-compatible models from the Hugging Face repository, and the OpenAI Whisper API. Get linux versions from: - https://flathub.org/apps/io.github.chidiwilliams.Buzz - https://snapcraft.io/buzz Home page of Buzz https://github.com/chidiwilliams/buzz Note for Windows: App is not signed, you will get a warning when you install it. ...
    Leader badge">
    Downloads: 8,381 This Week
    Last Update:
    See Project
  • 14
    Note67

    Note67

    A private, local meeting notes assistant

    ...Built with a cross-platform architecture using Rust (via Tauri) for backend logic and a TypeScript/React frontend, it prioritizes privacy by performing audio transcription locally with Whisper models and generating summaries with locally-hosted AI, eliminating the need to send sensitive meeting content to external servers. Users can record meetings directly from their microphone, view live transcriptions, filter by speaker, and export structured summaries, making it useful for professionals who need searchable, organized records of discussions. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    ChatGPT Telegram Bot

    ChatGPT Telegram Bot

    A Telegram bot that integrates with OpenAI's official ChatGPT APIs

    A Telegram bot that integrates with OpenAI's official ChatGPT, DALL·E and Whisper APIs to provide answers. Ready to use with minimal configuration required.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    OpenAI .NET

    OpenAI .NET

    The official .NET library for the OpenAI API

    ...Every synchronous method has an async counterpart, and the library offers convenient streaming primitives for chat completions so you can process tokens as they arrive. It supports tool/function calling, structured outputs via JSON schema, audio input/output, image generation, embeddings, Whisper transcription, and assistants with retrieval augmented generation.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Speech-AI-Forge

    Speech-AI-Forge

    Speech-AI-Forge is a project developed around TTS generation model

    ...The system is designed to be deployed in several ways: you can try it online via hosted demos, spin it up in a one-click Colab environment, run it in Docker containers, or set it up locally with its environment preparation scripts. It is model-agnostic and advertises support for a variety of TTS and speech models such as ChatTTS, CosyVoice, Fish-Speech, FireredTTS and others, as well as Whisper-based ASR, giving you a flexible playground for experimenting with different speech stacks. The project also integrates with general-purpose LLMs (for example GPT- or LLaMA-style models), which can be used to pre-process text, manage conversations.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    Whisper Batch Transcriber

    Whisper Batch Transcriber

    Unlimited, private and free Speech-To-Text program

    ## About: Automatically transcribe all of your voice recordings into clean, organized, neat text files. It's free, fully automated, unlimited, using state-of-the-art speech-to-text technology. Works 100% offline on your computer, privately and locally. ## Usecases: Convert speeches, podcasts, webinars, monologues, storytellings and other audio speech into a formatted .txt file. One sentence per new line. ## Notes: - Its 2GB in size and requires 2-6GB of GPU VRAM too. (basically...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 19
    Speech Note

    Speech Note

    Speech Note Linux app. Note taking, reading and translating

    ...All processing is done locally, which means audio, text, and translations never leave the device, emphasizing strong privacy guarantees. The application supports multiple STT engines such as Coqui STT (DeepSpeech fork), Vosk, whisper.cpp, Faster Whisper, and april-asr, giving users flexibility in accuracy, speed, and hardware requirements. For text-to-speech, it can plug into a wide range of engines including espeak-ng, MBROLA, Piper, RHVoice, Coqui TTS, Mimic 3, WhisperSpeech, Kokoro, Parler-TTS, F5-TTS, and even classic S.A.M., making it highly customizable in terms of voices and languages.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    web3.js

    web3.js

    Ethereum JavaScript API

    web3.js is the Ethereum JavaScript API that connects to the Generic JSON-RPC spec. It is composed of a selection of libraries that make it possible to interact with a local or remote ethereum node, using a HTTP or IPC connection. The node may be local, hosted by the DApp provider, or a public gateway such as Infura, which operates free Ethereum access points. It is necessary to run a local or remote Ethereum node to be able to use this library. web3.js is directly usable on web technology...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 21
    LocalAI

    LocalAI

    Self-hosted, community-driven, local OpenAI compatible API

    ...Drop-in replacement for OpenAI running LLMs on consumer-grade hardware. Free Open Source OpenAI alternative. No GPU is required. Runs ggml, GPTQ, onnx, TF compatible models: llama, gpt4all, rwkv, whisper, vicuna, koala, gpt4all-j, cerebras, falcon, dolly, starcoder, and many others. LocalAI is a drop-in replacement REST API that’s compatible with OpenAI API specifications for local inferencing. It allows you to run LLMs (and not only) locally or on-prem with consumer-grade hardware, supporting multiple model families that are compatible with the ggml format. ...
    Downloads: 30 This Week
    Last Update:
    See Project
  • 22
    Whishper

    Whishper

    Transcribe any audio to text, translate and edit subtitles 100% locall

    Open-source, local-first audio transcription and subtitling suite with a simple web UI. Thanks to open-source technologies, Whishper can run 100% offline. Your data never leaves your computer. Whishper allows you to translate your transcriptions to and from more than 60 languages thanks to Argos Translate and LibreTranslate. Download the transcriptions in many formats (json, txt, vtt, srt). Easily edit your subtitles right in the Web-UI.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 23
    Lepton AI

    Lepton AI

    A Pythonic framework to simplify AI service building

    A Pythonic framework to simplify AI service building. Cutting-edge AI inference and training, unmatched cloud-native experience, and top-tier GPU infrastructure. Ensure 99.9% uptime with comprehensive health checks and automatic repairs.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 24

    Whisper-Transcriber-Tool

    Desktop application that converts video files into accurate text

    WhisperTranscriber is a free, powerful desktop application that converts video files into accurate text using OpenAI's Whisper AI model. Perfect for journalists, researchers, students, content creators, and anyone who needs reliable transcription. KEY FEATURES: - High-accuracy AI transcription with 99+ language support - Works completely offline - no internet required, total privacy - Supports all common video formats (mp4) - Batch processing for multiple files - Automatic language detection - Drag & drop interface - Export as SRT formats - No file size limits PORTABLE VERSION: - No installation needed - Run from USB or any folder - FFmpeg and AI models included - Lightweight and fast WHY CHOOSE WHISPERTRANSCRIBER: ✓ 100% free forever - no subscriptions or hidden costs ✓ Complete privacy - all processing happens on your computer ✓ No account or registration required ✓ Professional-grade accuracy ✓ Works offline
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    Audiobookshelf-Whisper-transcriptor
    # Audiobookshelf Windows - Complete Distribution with Whisper Integration **Custom Enhanced Audiobookshelf Distribution for Windows with Full Whisper Integration** This project is a custom and enhanced distribution of Audiobookshelf that includes: - ✅ **Native Windows application** with system tray - ✅ **Complete server** with all dependencies - ✅ **Full Whisper integration** with GPU detection and pre-installed models - ✅ **Android mobile app** with complete subtitle functionality - ✅ **Single Windows installer** with no external dependencies - ✅ **Portable Node.js** included - ✅ **Auto-updates disabled** by default --- ## 📜 Credits and Licenses This project is built upon and integrates multiple open-source projects: ### **Core Projects** - **[Audiobookshelf Server](https://github.com/advplyr/audiobookshelf)** by [@advplyr](https://github.com/advplyr) - License: **GPL v3.0** - Main audiobook and podcast server functionality and more....
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next