[go: up one dir, main page]

Showing 43 open source projects for "voice browser"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Budgyt Is The Highest Rated Business Budgeting Software In The Market. Icon
    Budgyt Is The Highest Rated Business Budgeting Software In The Market.

    Affordable budgeting software for companies with multiple users and multiple departments.

    Budgyt is an easy to use, intuitive platform with a clean simple interface that makes budgeting multiple P&L’s easy to do without needing Excel.
    Book a Demo
  • 1
    clone-voice

    clone-voice

    A sound cloning tool with a web interface, using your voice

    Clone-voice is a local voice-cloning tool that lets you synthesize speech in any target voice or convert one recording into another voice using the same timbre. It is built around Coqui’s XTTS-v2 model, so it inherits multilingual support and modern neural TTS quality while wrapping it in a user-friendly desktop workflow. The app is designed to be very easy to use: you download a precompiled package, double-click app.exe, and it launches a browser-based web interface where you control cloning and synthesis. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 2
    TTS Voice Wizard

    TTS Voice Wizard

    Speech to Text to Speech, sends text as OSC messages

    ...The app can translate your speech from one language to over 20 other support languages. There are 100+ different voices with various customization options so you can pick a voice that best suits you. Display the current song you are listening to on Spotify or via your browser. Display tracker and controller battery life in conjunction with XSOverlay. Use in conjunction with HRtoVRChat_OSC to enable you to display your heartrate in VRChat's Chatbox.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 3
    Applio

    Applio

    A simple, high-quality voice conversion tool focused on ease of use

    Applio is a high-quality voice conversion toolkit designed to make modern RVC/VITS-based voice cloning accessible to non-experts. It focuses strongly on ease of use: installation scripts for Windows, Linux, and macOS set up dependencies and then launch a browser-based Gradio interface. Within that interface, users can train and run voice conversion models for tasks like singing conversion, speech-to-speech transformation, and voice cloning.
    Downloads: 39 This Week
    Last Update:
    See Project
  • 4
    OpenAI.fm

    OpenAI.fm

    Code for openai.fm, a demo for the OpenAI Speech API

    OpenAI.fm is an official interactive demo application built to showcase the OpenAI Speech API and its advanced text-to-speech capabilities, providing developers and creators with a hands-on web interface to convert text into high-quality, customizable audio using state-of-the-art TTS models. Developed using Next.js and the OpenAI Speech API, this demo illustrates how the latest neural voice models can produce natural, expressive speech with adjustable styles and voices, highlighting features like emotional range, tone, and real-time playback. Users can experiment with different input text and voice options directly in their browser, gaining a sense of how high-fidelity AI audio can be integrated into applications ranging from podcasts and narration to accessibility tools and interactive agents. ...
    Downloads: 185 This Week
    Last Update:
    See Project
  • The fastest way to host, scale and get paid on WordPress Icon
    The fastest way to host, scale and get paid on WordPress

    For developers searching for a web hosting solution

    Lightning-fast hosting, AI-assisted site management, and enterprise payments all in one platform designed for agencies and growth-focused businesses.
    Learn More
  • 5
    Happy Coder

    Happy Coder

    Mobile and Web client for Codex and Claude Code, with realtime voice

    Happy is an open-source, cross-platform mobile and web client designed to bring powerful AI coding agents such as Claude Code and Codex to your fingertips no matter where you are. At its core, Happy wraps existing AI coding tools with a unified interface, providing real-time voice interactions, encrypted communication, and seamless device switching between desktop and mobile. You can start a coding session locally through the Happy CLI or connect from a phone or browser, allowing developers to inspect, interact with, and guide the AI as it generates, tests, or explains code. The project includes components like a dedicated backend server for encrypted sync, a rich front-end experience across web and native apps, and support for push notifications when your coding agent encounters permission requests or errors. ...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 6
    OpenAI Realtime Agents

    OpenAI Realtime Agents

    This is a simple demonstration of more advanced, agentic patterns

    This repository demonstrates how to build low-latency, streaming “voice + chat” agents using OpenAI’s Realtime API combined with the OpenAI Agents SDK. The demo shows patterns for connecting a realtime voice stream (audio in/out) with agents that can use tools, maintain state, and orchestrate multi-agent workflows. The SDK offers abstractions such as agent orchestration, event handling, handoffs, state management, and guardrails, tailored to support realtime, conversational systems. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Manyfold

    Manyfold

    A self-hosted digital asset manager for 3d print files

    Manyfold is an open-source 3D collaboration platform that reimagines how distributed teams and communities can meet, create, and interact in immersive spatial environments through the web. Instead of forcing users to download native apps or create accounts on closed metaverse services, Manyfold runs entirely in the browser, letting people join 3D spaces with simple links and participate in real time using avatars, voice chat, and object interaction. Users can build or import shared 3D worlds, arrange media, embed content, and design interactive layouts that support presentations, workshops, social events, games, and team gatherings without heavy software installations. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 8
    OpenClaw

    OpenClaw

    Your own personal AI assistant. Any OS. Any Platform.

    OpenClaw (formerly Clawdbot/Moltbot) is an open-source, self-hosted autonomous AI assistant designed to run on user-controlled hardware and bridge conversational natural language with real-world task execution, effectively acting as a proactive digital assistant rather than a reactive chatbot. It lets you send instructions through familiar messaging platforms like WhatsApp, Telegram, Discord, Slack, Signal, iMessage, and more, and then interprets those instructions to carry out actions such...
    Downloads: 873 This Week
    Last Update:
    See Project
  • 9
    edge-tts

    edge-tts

    Use Microsoft Edge's online text-to-speech service from Python

    edge-tts is a Python module and command-line tool that gives you direct access to Microsoft Edge’s online text-to-speech service without needing the Edge browser, Windows, or any API key. It wraps the same cloud voices used by Edge, exposing them through a simple CLI (edge-tts, edge-playback) and a Python API, so you can script high-quality speech generation in your own applications. The tool lets you list available voices, specify locale and voice name, and generate audio files in common formats like MP3 or WAV. ...
    Downloads: 23 This Week
    Last Update:
    See Project
  • GoAnywhere Managed File Transfer (MFT) Icon
    GoAnywhere Managed File Transfer (MFT)

    Secure and simplify your file transfers

    GoAnywhere MFT provides secure managed file transfer for enterprises. Deployable on-premise, in the cloud, or in hybrid environments, GoAnywhere MFT software enables organizations to exchange data among employees, customers, and trading partners, as well as between systems, securely. GoAnywhere MFT was a recipient of the Cybersecurity Excellence Award for Secure File Transfer.
    Learn More
  • 10
    Telegram Web A

    Telegram Web A

    Telegram Web A, GPL v3

    ...The project achieved recognition (winning first prize in the Telegram Lightweight Client Contest) and serves as the code base behind the official web client available at web.telegram.org/a. The architecture takes advantage of advanced browser capabilities: WebSockets for real-time messaging, Web Workers and WebAssembly for performance-critical tasks, multi-level caching and PWA features for offline or near-offline usability, voice recording and media streaming, raw binary data handling and cryptographic operations. It also handles rich UI/UX elements such as CSS/Canvas/SVG animations, reactive data streams, etc.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 11
    Abaddon

    Abaddon

    An alternative Discord client with voice support made with C++ and GTK

    Alternative Discord client made in C++ with GTK. Abaddon tries its best (though is not perfect) to make Discord think it's a legitimate web client. Some of the things done to do this include: using a browser user agent, sending the same IDENTIFY message that the official web client does, using API v9 endpoints in all cases, and not using endpoints the web client does not normally use. There are still a few smaller inconsistencies, however. For example the web client sends lots of telemetry...
    Downloads: 65 This Week
    Last Update:
    See Project
  • 12
    ChatTTS webUI & API

    ChatTTS webUI & API

    A simple native web interface that uses ChatTTS to synthesize text

    ...From version 0.96 onward, ffmpeg installation is required for deployment, and previous CSV/PT voice tables are no longer valid, so users instead work with updated “voice value” parameters. For convenience, there is a prepackaged Windows build: you download a release archive, extract it, and double-click app.exe to start the web UI, which opens on localhost:9966.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    WhisperLive

    WhisperLive

    A nearly-live implementation of OpenAI's Whisper

    WhisperLive is a “nearly live” implementation of OpenAI’s Whisper model focused on real-time transcription. It runs as a server–client system in which the server hosts a Whisper backend and clients stream audio to be transcribed with very low delay. The project supports multiple inference backends, including Faster-Whisper, NVIDIA TensorRT, and OpenVINO, allowing you to target GPUs and different CPU architectures efficiently. It can handle microphone input, pre-recorded audio files, and...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    Recorder

    Recorder

    HTML5 js recording mp3 wav ogg webm amr format

    ​Supports microphone recording and real-time processing in most of the implemented getUserMediamobile and PC browsers, mainly including Chrome, Firefox, Safari, iOS 14.3+, Android WebView, Tencent Android X5 kernel (QQ, WeChat, Mini Program WebView) , uni-app (App, H5), and most Android phones updated after 2021 have their own browsers; do not support: UC-based kernel (typical Alipay), most of the old domestic mobile phones that have not been updated have their own browsers and any other form of browser (including PWA, WebClip, any App) on low-version iOS (11.0-14.2) except Safari inside page). Provides multiple plug-in function support. Rich audio visualization, variable speed and pitch processing, speech recognition, audio stream playback, etc.; with powerful real-time processing support, it can be used in various web applications: from simple recording to complex real-time voice Recognition (ASR), and even audio-related games, are handled with ease.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    Amica

    Amica

    Amica is an open source interface for interactive communication

    ...Under the hood, Amica leverages modern web and desktop technologies: three.js and three-vrm for 3D rendering, Transformers.js for running models in the browser, Whisper and Silero VAD for speech recognition and voice-activity detection, and a variety of LLM backends such as llama.cpp servers, ChatGPT-compatible APIs, Ollama, KoboldCpp, and others. It also integrates multiple text-to-speech providers, including ElevenLabs, OpenAI, Coqui, RVC, and AllTalkTTS.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Order Voice  PHP Order Number Caller

    Order Voice PHP Order Number Caller

    is a simple yet powerful web-based application designed to streamline

    OrderVoice is a simple yet powerful web-based application designed to streamline the customer experience in businesses like restaurants, retail stores, and service centers. With OrderVoice, you can easily announce when an order is ready using a clear and customizable voice prompt
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Audio Satanifier 666

    Audio Satanifier 666

    Easily apply cool gnarly voice filters to your audio files

    Transform pure innocent audio files, speech, music, etc into unholy demonic abominations. Audio Satanifier 666 is a fun easy-to-use browser-based tool forged in the pits of hell, for voice actors, musicians, sound designers, for memes, for creative projects or anyone else who want to twist their sound into something absolutely diabolical! Layperson friendly - you'll be able to apply cool effects to your audio file even if you know nothing about audio engineering. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18

    Leagueloch

    Free Portable Database For Organizing Your Legal Paper Work

    LeagueLoch, is a portable database, which has been written for Window XP. Its basic design, makes it very easy to use, specially if your dyslexic !!!! Its main purpose is for the organization of your legal paper work !!!! It also incorporates, it's own chat browser, for peer 2 peer video chats, none of which are monitored, making it ideal for legal chats !!!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Xiaomi.eu Multilang MIUI ROMs

    Xiaomi.eu Multilang MIUI ROMs

    OFFICIAL mirror of MIUI Multilang ROM by https://xiaomi.eu/

    Included languages: English (en.miui.com, xiaomi.eu) Chinese (miui.com) Polish (Acid, miuipolska.pl) Slovak (ingbrzy, https://miuios.cz) Czech (VMach, https://miuios.cz) Bulgarian (Global) Hungarian (vagyula, miui.hu) French (Global) Swedish (Global) Russian (Global) Ukrainian (Global) Belorussian (Global) Croatian (Masva, MIUI Adria) Vietnamese (Belmont-Gabriel) Italian (miui.it) Greek (Global) Norwegian (Global) Dutch (cueie, mannyribo, redmaner) Spanish (nosijf982)...
    Leader badge">
    Downloads: 16,446 This Week
    Last Update:
    See Project
  • 20
    cerberuscms2

    cerberuscms2

    Cerberus Content Management System

    Cerberus Content Management System is a dynamic, secure and infinitely expandable CMS designed after a Unix-Like model. It is a custom written Web Application Framework ( W.A.F. ) with a consistent and custom written Pre-Hyper-Text-Post-Processor Programming Code Framework ( P.C.F. ). This Web Application Software Project' aim is to be the fastest and most secure Web Application Framework, Web Application Programming Code Framework, Text, Voice and Video Communications Platform and Content...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Cerberus Content Management System 6

    Cerberus Content Management System 6

    Cerberus Content Management System

    Cerberus Content Management System is a dynamic, secure and infinitely expandable CMS designed after a Unix-Like model complete with a Unix-Like Kernel File named: Cerberus. It is a custom written Web Application Framework ( W.A.F. ) with a consistent and custom written Pre-Hyper-Text-Post-Processor Programming Code Framework ( P.C.F. ). This Web Application Software Project' aim is to be the fastest and most secure Web Application Framework, Web Application Programming Code Framework, Text,...
    Downloads: 24 This Week
    Last Update:
    See Project
  • 22
    CerberusCMS5

    CerberusCMS5

    Cerberus Content Management System

    Cerberus Content Management System is a dynamic, secure and infinitely expandable CMS designed after a Unix-Like model. It is a custom written Web Application Framework ( W.A.F. ) with a consistent and custom written Pre-Hyper-Text-Post-Processor Programming Code Framework ( P.C.F. ). This Web Application Software Project' aim is to be the fastest and most secure Web Application Framework, Web Application Programming Code Framework, Text, Voice and Video Communications Platform and Content...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 23
    Read Aloud

    Read Aloud

    An awesome browser extension that reads aloud webpage content

    ...Read Aloud lets users choose from multiple voices: built-in browser voices, plus premium cloud voices from providers such as Google Wavenet, Amazon Polly, IBM Watson, and Microsoft. It offers simple controls through a toolbar button, right-click menu, and keyboard shortcuts for play, pause, stop, rewind, and fast forward. Users can customize voice, speed, pitch, and text highlighting behavior directly from the extension’s settings panel.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Rhasspy

    Rhasspy

    Offline private voice assistant for many human languages

    Rhasspy (ˈɹæspi) is an open-source, fully offline set of voice assistant services for many human languages that works well with Hermes protocol-compatible services (Snips.AI), Home Assistant and Hass.io, Node-RED, Jeedom, OpenHAB. Rhasspy will produce JSON events that can trigger action in home automation software, such as a Node-RED flow. Rhasspy comes with a snazzy web interface that lets you configure, program, and test your voice assistant remotely from your web browser. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Polarity Browser

    Polarity Browser

    A fast, secure, stable web browser powered by Chromium and Trident.

    Polarity is a dual engine browser powered by both Chromium and Trident that focuses on system efficiency by bringing low RAM and low CPU usage. This browser was also programmed to be optimized for Windows with an Android version available. Browse the web with an uncomplicated UI that is highly customizable with themes, apps, and extensions from the Polarity Store, GreasyFork, OpenUserJS, and UserStyles. Polarity also comes with a built in password manager which safely encrypts all data to...
    Downloads: 68 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next