[go: up one dir, main page]

Showing 81 open source projects for "sound recognition"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 2
    Qwen2-Audio

    Qwen2-Audio

    Repo of Qwen2-Audio chat & pretrained large audio language model

    Qwen2-Audio is a large audio-language model by Alibaba Cloud, part of the Qwen series. It is trained to accept various audio signal inputs (including speech, sounds, etc.) and perform both voice chat and audio analysis, producing textual responses. It supports two major modes: Voice Chat (interactive voice only input) and Audio Analysis (audio + text instructions), with both base and instruction-tuned models. It is evaluated on many benchmarks (speech recognition, translation, sound...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Recorder

    Recorder

    HTML5 js recording mp3 wav ogg webm amr format

    ... of browser (including PWA, WebClip, any App) on low-version iOS (11.0-14.2) except Safari inside page). Provides multiple plug-in function support. Rich audio visualization, variable speed and pitch processing, speech recognition, audio stream playback, etc.; with powerful real-time processing support, it can be used in various web applications: from simple recording to complex real-time voice Recognition (ASR), and even audio-related games, are handled with ease.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    OpenaiBot

    OpenaiBot

    Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant

    If you don't have the instant messaging platform you need or you want to develop a new application, you are welcome to contribute to this repository. You can develop a new Controller by using Event.py. Compatibility with multiple LLMs and integration with GPT and third-party systems is handled by our llm-kira project on GitHub. It can accurately limit billing, with limits and ID binding. Supports asynchronous operations and can handle multiple requests simultaneously. Allows for private and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Simple, Secure Domain Registration Icon
    Simple, Secure Domain Registration

    Get your domain at wholesale price. Cloudflare offers simple, secure registration with no markups, plus free DNS, CDN, and SSL integration.

    Register or renew your domain and pay only what we pay. No markups, hidden fees, or surprise add-ons. Choose from over 400 TLDs (.com, .ai, .dev). Every domain is integrated with Cloudflare's industry-leading DNS, CDN, and free SSL to make your site faster and more secure. Simple, secure, at-cost domain registration.
    Sign up for free
  • 5
    Voxal voice changer

    Voxal voice changer

    Transform your voice in real-time voxal voice changer

    Voxal Voice Changer is a program that allows you to modify your voice by applying various effects (e.g. pitch change, echo, etc.) in real-time. Effects can be added in any sequence and in any combination, allowing you to distort your voice beyond recognition. Take your audio to the next level! Our powerful Voice Changer software lets you morph your voice in real-time with stunning AI-powered quality. Whether you're looking to have fun, protect your privacy, or create engaging content, we...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 6
    Puzzle Patch

    Puzzle Patch

    Entertaining and interactive game of solving 50+ puzzle pieces

    Welcome to Puzzle Patch, the ultimate desktop puzzle-solving experience that promises fun and engaging entertainment for players of all ages including kids! Kids will love to play this game as it helps in their cognitive development, spatial awareness as they arrange these scrambled puzzle pieces into 1 perfect picture. Visual recognition will be improved as kids try to identify shapes, patterns. Puzzle Patch has a wide array of features and functionalities designed to immerse you in a world...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Vosk Desktop

    Vosk Desktop

    Desktop software for controlling the Vosk Speech Recognition Toolkit

    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    VoodooHDA
    VoodooHDA is an open source audio driver for devices compliant with the Intel High Definition Audio specification. It is intended as a replacement for AppleHDA on Mac OS X with support for a wide range of audio controllers and codecs.
    Leader badge">
    Downloads: 341 This Week
    Last Update:
    See Project
  • 9
    VoodooHDA Installer
    VoodooHDA Installer is an open source audio driver Installer for devices compliant with the Intel High Definition Audio specification. It is intended as a replacement for AppleHDA on Mac OS X with support for a wide range of audio controllers and codecs. Taken from projects https://sourceforge.net/projects/voodoohda/ https://github.com/MuntashirAkon/DPCIManager https://github.com/sveinbjornt/Platypus
    Leader badge">
    Downloads: 38 This Week
    Last Update:
    See Project
  • Deliver trusted data with dbt Icon
    Deliver trusted data with dbt

    dbt Labs empowers data teams to build reliable, governed data pipelines—accelerating analytics and AI initiatives with speed and confidence.

    Data teams use dbt to codify business logic and make it accessible to the entire organization—for use in reporting, ML modeling, and operational workflows.
    Learn More
  • 10
    General C++ Library, with modules for Computer Vision, Pattern Recognition and much more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Speech Recognition in English & Polish

    Speech Recognition in English & Polish

    Speech recognition software for English & Polish languages

    Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More help: https://sourceforge.net/p/skrybotdomowy/wiki...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    chatbot_chung
    chatbot chung is a keywords based probabilities algorythm simple entertainment chatbot with 3D talking openGL avatars written in freebasic. Can import aiml simple question/answer or question/random/answers or single star/ multi srai data saved from "AIML_chung" open source application . Online html5 javascript version with 44 languages multilingual auto detection available on the website (source included in the zip file). SORT gentext text generation algorythm option added (desktop version) .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    JNIZ music notation audio to midi

    JNIZ music notation audio to midi

    music composition and notation software, audio to midi converter

    Jniz is a piece of software designed for musicians as a support tool to the musical composition. It allows you to build and to harmonize several voices according to the rules of classical harmony. Sound/audio-to-Midi converter: real-time conversion of any monophonic sound (voice, instrument etc.) into notes / tones. Jniz is a free proprietary piece of software. You do not have the right to sell, distribute Jniz or use its sources under penalty of law. You will infringes on the Jniz staff...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 14
    AudioCuesheetEditor

    AudioCuesheetEditor

    Write audio cuesheets fast and efficient

    Annoyed of writing your audio cuesheets via text editor? Here comes the solution! AudioCuesheetEditor helps you editing your audio cuesheet file. With AudioCuesheetEditor you can import textfiles, edit the imported data and afterwards export to a working audio cuesheet. There are also other export posibilities like csv or xml. Feel free to try it ;).
    Leader badge">
    Downloads: 13 This Week
    Last Update:
    See Project
  • 15
    H.B.S.N

    H.B.S.N

    Speech Recognition System

    H.B.S.N is a simple speech recognition software which programmed using Java. This software is a package of many sub applications.And those are as listed below , Audio Player Video Player Email Client Weather Application Mp3 Tag Editor Picture Viewer Home Automation Application Alarm / Timer Folder Locker Message Encrypt Application Income & Expenses Logging Application Apart from that we can do many thing from this software...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    annyang!

    annyang!

    Speech recognition for your site

    ... nicely with all browsers, progressively enhancing browsers that support SpeechRecognition, while leaving users with older browsers unaffected. Grab the latest version of annyang.min.js, drop it in your html, and start adding commands. You can easily add a GUI for the user to interact with Speech Recognition using Speech KITT. Speech KITT is fully customizable and comes with many different themes, and instructions on how to create your own designs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    ILA - teachable voice assistant

    ILA - teachable voice assistant

    ILA is a fully customizable and teachable voice assistant for Java

    ILA stands for (kind of) intelligent, learning assistant and is a speech recognition system aka voice assistant very similar to Siri, Google Now and Cortana. ILA is fully customizable and you can teach her/him/it new things by yourself like executing system commands, opening web pages, programs and apps or just some basic conversation :-) ILA runs on Java und thus is compatible to Windows, Mac and Linux. It is designed to integrate with your home enviroment and for example build up your own...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    JuliusModels

    JuliusModels

    Open source speech models for Julius in English and other languages.

    Open source speech models for Julius speech decoder. Its aim is to give access a wider community of speech recognition enthusiasts to quality models, which they can use in their own projects on different OS platforms (Unix, Windows, etc...) All of the models are based on HTK modelling software and data sets available freely on the Internet.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    FM2TXT

    FM2TXT

    RtlSdr listen to radio, recognize audio, and writes text file log

    Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt
    Downloads: 7 This Week
    Last Update:
    See Project
  • 20

    Distant Speech Recognition

    Beamforming and Speech Recognition Toolkit

    BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    Speaker Recognition System

    Speaker Recognition System - Matlab source code

    ... is correlated with the physiological and behavioral characteristics of the speaker. These characteristics exist both in the spectral envelope (vocal tract characteristics) and in the supra-segmental features (voice source characteristics and dynamic features spanning several segments). Index Terms: speaker, recognition, verification, sound, words.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    Speech Recognition System

    Speech Recognition System - Matlab source code

    ... has improved. For example, recognition systems optimized for telephone applications can often supply information about the confidence of a particular recognition, and if the confidence is low, it can trigger the application to prompt callers to confirm or repeat their request. Index Terms: speech, recognition, verification, sound, isolated, words.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    My Music Recognition

    My Music Recognition

    This application can help you quickly identify the name of any song.

    My Music Recognition uses a powerful audio recognition engine in order to help you get the name of the song you are listening to. It can capture sound from radio streams, the installed music player or any other source and display the name of the song in seconds.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25

    Tygamusic

    A pygame music lib.

    This lib was produced while I was programming an other program/game. I was tired of pygame's bad system of handling playlists and the management of music in general. With this lib I want to create an layer that allows you to interact with the music, how you would expect it. Currently featuring: -Playlist -Normal pausing and resuming (played time isn’t lost when new song is loaded) -Automatic recognition of songs and adding them to a separate list
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next