[go: up one dir, main page]

Showing 88 open source projects for "mega voice command"

View related business solutions
  • Cybersecurity Starts With Password Security. Icon
    Cybersecurity Starts With Password Security.

    Keeper is the top-rated password manager for protecting you, your family and your business from password-related data breaches and cyberthreats.

    Research shows that a whopping 81% of data breaches are due to weak or stolen passwords. Business password managers provide an affordable and simple way for companies to solve the single biggest root cause of most data breaches. By implementing Keeper, your business is significantly reducing the risk of a data breach.
    Get Started
  • Smarter safety. Simplified compliance. Stronger teams. Icon
    Smarter safety. Simplified compliance. Stronger teams.

    EHS Hero works with companies ranging from emerging to enterprise-level to provide a scalable, integrated EHS management solution

    From compliance confidence to operational control, EHS Hero® equips EHS professionals with the tools they need to stay ahead of risk, meet regulatory demands, and build safer, smarter workplaces. Whether you’re managing incidents, maintaining audit readiness, delivering training, or analyzing performance, EHS Hero brings everything together in one powerful, easy-to-use platform. Join thousands of safety leaders who trust EHS Hero to simplify compliance and elevate safety performance—every day.
    Learn More
  • 1
    Real-Time Voice Cloning

    Real-Time Voice Cloning

    Clone a voice in 5 seconds to generate arbitrary speech in real-time

    ...In the first stage, short audio clips are converted into a fixed-dimensional speaker embedding that captures voice characteristics; this embedding is then used by a Tacotron-style synthesizer to generate spectrograms from text, which a WaveRNN-based vocoder finally turns into audio. The repo includes both a command-line demo and a graphical “toolbox” application where you can load reference voices, type text, and hear the synthesized results interactively.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2
    chatgpt-on-wechat

    chatgpt-on-wechat

    A chatbot built based on a large model

    chatgpt-on-wechat turns your WeChat client (including personal accounts) into an intelligent chatbot powered by large language models like ChatGPT, enabling automated replies, context-aware conversations, and media handling directly inside chats. It receives text and voice messages from private and group chats, forwards them to an AI model using official APIs, and returns replies that feel natural and contextually relevant, creating more engaging interactions without manual typing. Beyond simple text, the bot supports voice recognition and automatic voice or text responses, image generation based on descriptions, and independent memory of multi-turn conversations per user or group. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    FFsubsync

    FFsubsync

    Automagically synchronize subtitles with video

    Language-agnostic automatic synchronization of subtitles with video, so that subtitles are aligned to the correct starting point within the video. First, make sure ffmpeg is installed. Make sure ffmpeg is on your path and can be referenced from the command line! Next, grab the script. It should work with both Python 2 and Python 3. There may be occasions where you have a correctly synchronized srt file in a language you are unfamiliar with, as well as an unsynchronized srt file in your native language. In this case, you can use the correctly synchronized srt file directly as a reference for synchronization, instead of using the video as the reference. ffsubsync uses the file extension to decide whether to perform voice activity detection on the audio or to directly extract speech from an srt file. ffsubsync usually finishes in 20 to 30 seconds, depending on the length of the video.
    Downloads: 32 This Week
    Last Update:
    See Project
  • 4
    OpenAI-Compatible Edge-TTS API

    OpenAI-Compatible Edge-TTS API

    Free, high-quality text-to-speech API endpoint to replace OpenAI

    ...A Docker image is provided for one-command deployment, and environment variables can be used to configure default voice, language, response format, authentication, and logging options.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Eurekos LMS - Build a Smarter Customer Icon
    Eurekos LMS - Build a Smarter Customer

    The Eurekos customer training LMS makes it easy to deliver product training that retains more customers and transforms partners into advocates.

    Eurekos is a purpose-built LMS that engages customers throughout the entire learning journey from pre-sales, to onboarding, and everything after.
    Learn More
  • 5
    edge-tts

    edge-tts

    Use Microsoft Edge's online text-to-speech service from Python

    edge-tts is a Python module and command-line tool that gives you direct access to Microsoft Edge’s online text-to-speech service without needing the Edge browser, Windows, or any API key. It wraps the same cloud voices used by Edge, exposing them through a simple CLI (edge-tts, edge-playback) and a Python API, so you can script high-quality speech generation in your own applications.
    Downloads: 24 This Week
    Last Update:
    See Project
  • 6
    Audiblez

    Audiblez

    Generate audiobooks from e-books

    Audiblez is a tool for generating high-quality .m4b audiobooks directly from .epub e-books using the Kokoro-82M neural text-to-speech model. It focuses on making audiobook creation easy and fast: from a single command, the tool splits an e-book into chapters, synthesizes audio for each section, and then merges the results into a structured audiobook with chapter-based WAV files and a final .m4b container. The Kokoro-82M model it uses is compact (82M parameters) yet natural sounding, trained...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    MLX-Audio

    MLX-Audio

    A text-to-speech, speech-to-text and speech-to-speech library

    MLX-Audio is a speech library built on Apple’s MLX framework and optimized for Apple Silicon machines (M-series Macs). It focuses on text-to-speech and speech-to-speech workflows, with APIs and a command-line interface that make it easy to generate high-quality audio from text. Because it uses MLX and targets Apple Silicon, inference is fast and can take advantage of hardware acceleration and quantization for efficient on-device performance. The project provides a straightforward CLI (mlx_audio.tts.generate) as well as a Python API for programmatic generation of audio, including parameters for voice choice, speed, language hints, output format, and sample rate. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    Remove Windows Ai

    Remove Windows Ai

    Strip Windows 11 of built-in AI features for control and privacy

    RemoveWindowsAI is an open source PowerShell-based tool created to help users regain control over their Windows 11 experience by disabling or removing AI-related features that Microsoft has increasingly integrated into the OS. It’s designed to work with currently released, stable versions of Windows 11 and continuously updated to match newly added AI components, especially since the 25H2 major update. The script covers a wide variety of AI surfaces (from core features like Copilot and Recall...
    Downloads: 132 This Week
    Last Update:
    See Project
  • 9
    MiniMax-MCP

    MiniMax-MCP

    Official MiniMax Model Context Protocol (MCP) server

    MiniMax-MCP is the official Model Context Protocol (MCP) server for accessing MiniMax’s multimodal generative APIs from MCP-compatible clients. It acts as a bridge between tools like Claude Desktop, Cursor, Windsurf, OpenAI Agents, and the MiniMax platform, exposing capabilities such as text-to-speech, voice cloning, image generation, text-to-image, video generation, image-to-video, text-to-video, and music generation. The server is written in Python and distributed under the MIT license,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Workload Automation for Global Enterprises Icon
    Workload Automation for Global Enterprises

    Orchestrate Your Entire Tech Stack with Redwood RunMyJobs

    Redwood lets you orchestrate securely and reliably across any application, service or server, in the cloud or on-premise, all inside a single platform.
    Learn More
  • 10
    Eris

    Eris

    A NodeJS Discord library

    A Node.js wrapper for interfacing with Discord. You will need NodeJS 10.4+. If you need voice support you will also need Python 2.7 and a C++ compiler. Create a directory for your bot, and change to that directory in your command line. If you want to be more updated (at the expense of stability), you can install the beta builds instead. Eris supports a few optional libraries that could potentially improve bot performance but may require additional dependencies.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11

    UV_Assistant

    Fast and secure voice assistant for your windows pc!

    UV Assistant can open applications using voice commands, UV Assistant can open websites with voice commands and etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Languine

    Languine

    Translate your application with Languine CLI powered by AI.

    ​Languine is an AI-powered localization platform designed to automate and streamline the translation process for applications, ensuring seamless integration within development workflows. It offers intelligent, context-aware translations across over 100 languages, maintaining brand voice and tone consistency. It provides a command-line interface and continuous integration/continuous deployment integration, allowing developers to manage translations directly or automate them within existing pipelines. Languine supports various file formats, including JSON, YAML, Markdown, and more, catering to diverse project requirements. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Open Interpreter

    Open Interpreter

    A natural language interface for computers

    Open Interpreter is an open-source tool that provides a natural-language interface for interacting with your computer. It lets large language models (LLMs) run code locally (Python, JavaScript, shell, etc.), enabling you to ask your computer to do tasks like data analysis, file manipulation, browsing, etc. in human terms (“chat with your computer”), with safeguards. Runs locally or via configured remote LLM servers/inference backends, giving flexibility to use models you trust or have...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 14
    BlackBelt WASTE - ipv4/Tor/i2p +AI+Voice

    BlackBelt WASTE - ipv4/Tor/i2p +AI+Voice

    Modern, AI-Smart, WASTE p2p for ipv4, Tor and i2p + Voice Conference.

    Open Source - GPLv3 inc images. A WASTE client. Download and create your own WASTE networks. Move 1000's of GB's at 100MB+ per sec. (800 Mbits per sec) FULL pause and resume capable. Voice Conference, Chat, Transfer files and Participate in Forums in a secure environment. For Windows XP 32/64, Vista 32/64, Win7 32/64, Win8 32/64, Win 10, Win 11, Linux (WINE). *** User Based Access Control - for voice, chats, file transfers and uploads. (useful in NULLNETS) *** Distributed...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    Scribe

    Scribe

    Free, open-source, and offline speech-to-text & voice control app.

    > Scribe is a free and open-source desktop assistant that brings powerful speech-to-text and voice control capabilities directly to your PC. It allows you to dictate text into any application, create custom voice commands, launch programs, and automate your workflow with text replacements. > Designed with privacy as a top priority, Scribe works completely offline. Your voice data never leaves your computer. Powered by the Vosk engine, it supports multiple languages and provides...
    Leader badge">
    Downloads: 88 This Week
    Last Update:
    See Project
  • 16
    VoiceClip

    VoiceClip

    VoiceClip es una aplicación de asistencia a usuarios

    VoiceClip es una aplicación de asistencia a usuarios diseñada para integrarse de manera fluida en su entorno de trabajo, proporcionando un acceso rápido y eficiente a diversas funcionalidades mediante comandos de voz y texto. Presentada como una barra de herramientas que permanece siempre visible en primer plano, VoiceClip busca simplificar tareas comunes, mejorar la productividad y facilitar la interacción con su sistema operativo y con tecnologías avanzadas de inteligencia artificial
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Maia
    MAIA (MyApp Intelligence Artificial) is designed to provide a foundation for building your own voice-controlled assistant with Python. It uses various libraries and modules for speech recognition, text-to-speech synthesis, and custom functionality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    KoboldCpp

    KoboldCpp

    Run GGUF models easily with a UI or API. One File. Zero Install.

    KoboldCpp is an easy-to-use AI text-generation software for GGML and GGUF models, inspired by the original KoboldAI. It's a single self-contained distributable that builds off llama.cpp and adds many additional powerful features.
    Downloads: 400 This Week
    Last Update:
    See Project
  • 19
    Besgnulinux
    Based on Debian Stable Installation tool: Calamares Besgnulinux is for both new and weak machines and end users (newbies). Besgnulinux tries to be fast, lightweight, easy to use and stable. It is designed to meet every need with window manager sessions like JWM, Openbox. Instead of background elements, the system is under user control. It does the same things that high-resource desktops like KDE and Gnome do. But it does the job with very little resources. Besgnulinux's goal is to...
    Leader badge">
    Downloads: 1,398 This Week
    Last Update:
    See Project
  • 20
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge">
    Downloads: 261 This Week
    Last Update:
    See Project
  • 21
    TURGEN

    TURGEN

    Create your own Atari tapes

    With TURGEN, you can record your own tapes with software for Atari 8-bit computers, and transfer data from your PC or Mac to your Atari 8-bit computer using your data recorder and a cassette adapter. TURGEN converts digital images of Atari software to the original standard Atari tape records and also various "turbo" systems. You can create WAVE files, tape images, or send the signal directly to the sound card. Tools such as Wizard for files, Tape Side Creator, Dual Track Tape creator...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 22
    cerberuscms2

    cerberuscms2

    Cerberus Content Management System

    Cerberus Content Management System is a dynamic, secure and infinitely expandable CMS designed after a Unix-Like model. It is a custom written Web Application Framework ( W.A.F. ) with a consistent and custom written Pre-Hyper-Text-Post-Processor Programming Code Framework ( P.C.F. ). This Web Application Software Project' aim is to be the fastest and most secure Web Application Framework, Web Application Programming Code Framework, Text, Voice and Video Communications Platform and Content...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Cerberus Content Management System 6

    Cerberus Content Management System 6

    Cerberus Content Management System

    Cerberus Content Management System is a dynamic, secure and infinitely expandable CMS designed after a Unix-Like model complete with a Unix-Like Kernel File named: Cerberus. It is a custom written Web Application Framework ( W.A.F. ) with a consistent and custom written Pre-Hyper-Text-Post-Processor Programming Code Framework ( P.C.F. ). This Web Application Software Project' aim is to be the fastest and most secure Web Application Framework, Web Application Programming Code Framework, Text,...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 24
    CerberusCMS5

    CerberusCMS5

    Cerberus Content Management System

    Cerberus Content Management System is a dynamic, secure and infinitely expandable CMS designed after a Unix-Like model. It is a custom written Web Application Framework ( W.A.F. ) with a consistent and custom written Pre-Hyper-Text-Post-Processor Programming Code Framework ( P.C.F. ). This Web Application Software Project' aim is to be the fastest and most secure Web Application Framework, Web Application Programming Code Framework, Text, Voice and Video Communications Platform and Content...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 25
    Audio Webui

    Audio Webui

    A webui for different audio related Neural Networks

    ...For more advanced users, it exposes a rich set of command-line flags to control behavior such as skipping installation, disabling venv, changing model cache directories, sharing Gradio links, setting passwords, and specifying themes or ports.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next