[go: up one dir, main page]

Showing 64 open source projects for "tamil speech recognition"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • Accounting Software Built for Owners, and Their Clients Icon
    Accounting Software Built for Owners, and Their Clients

    Make invoicing and billing painless for your small business with FreshBooks.

    Balancing your books, client relationships, and business isn’t easy. FreshBooks gives you the info and time you need to focus on your big picture—your business, team, and clients.
    Learn More
  • 1
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details).
    Downloads: 20 This Week
    Last Update:
    See Project
  • 2
    Textream

    Textream

    Textream is a free macOS teleprompter app for streamers, interviewers

    Textream is an open-source, free macOS teleprompter application designed for streamers, podcasters, presenters, and interviewers who want a smooth, distraction-free way to stay on script. It runs natively on macOS and leverages on-device speech recognition to highlight each word in real time as you speak, keeping your focus where it belongs — on delivery rather than memorization. The interface supports multiple modes of use, such as classic constant-scroll auto-scrolling, voice-activated scrolling that pauses when you’re silent, and direct word tracking that syncs the displayed script to your spoken pace. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    SCAIL

    SCAIL

    Towards Studio-Grade Character Animation via In-Context Learning of 3D

    ...While specific documentation about SCAIL’s exact goals and implementation is limited from the repository context alone, the project appears to be part of a collection of machine learning and AI research tools that facilitate scalable model development, evaluation, or application workflows. Given its listing alongside other ZAI projects like speech recognition and text-to-speech systems, SCAIL likely emphasizes scalable, composable AI learning frameworks that support researchers and practitioners in experimenting with learning algorithms, datasets, and model components. The repository structure suggests a focus on flexibility and extensibility, with potential integration into other ZAI tooling for training or analysis.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Recorder

    Recorder

    HTML5 js recording mp3 wav ogg webm amr format

    ​Supports microphone recording and real-time processing in most of the implemented getUserMediamobile and PC browsers, mainly including Chrome, Firefox, Safari, iOS 14.3+, Android WebView, Tencent Android X5 kernel (QQ, WeChat, Mini Program WebView) , uni-app (App, H5), and most Android phones updated after 2021 have their own browsers; do not support: UC-based kernel (typical Alipay), most of the old domestic mobile phones that have not been updated have their own browsers and any other form of browser (including PWA, WebClip, any App) on low-version iOS (11.0-14.2) except Safari inside page). Provides multiple plug-in function support. Rich audio visualization, variable speed and pitch processing, speech recognition, audio stream playback, etc.; with powerful real-time processing support, it can be used in various web applications: from simple recording to complex real-time voice Recognition (ASR), and even audio-related games, are handled with ease.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Create and manage the email signature you need Icon
    Create and manage the email signature you need

    For companies and organizations that need an email signature solution

    With WiseStamp it’s easy to unify your brand and turn your emails into a powerful marketing tool. Get the most out of your emails with a professionally designed custom email signature.
    Learn More
  • 5
    Google2SRT

    Google2SRT

    Download, save and convert multiple subtitles from YouTube videos

    Google2SRT allows you to download, save and convert multiple subtitles and translations from YouTube and Google Video to SubRip (.srt) format, which is recognized by most video players. You can download XML subtitles or simply type video's URL, Google2SRT will do the rest.
    Downloads: 61 This Week
    Last Update:
    See Project
  • 6
    Voxal voice changer

    Voxal voice changer

    Transform your voice in real-time voxal voice changer

    Voxal Voice Changer is a program that allows you to modify your voice by applying various effects (e.g. pitch change, echo, etc.) in real-time. Effects can be added in any sequence and in any combination, allowing you to distort your voice beyond recognition. Take your audio to the next level! Our powerful Voice Changer software lets you morph your voice in real-time with stunning AI-powered quality. Whether you're looking to have fun, protect your privacy, or create engaging content,...
    Leader badge">
    Downloads: 19 This Week
    Last Update:
    See Project
  • 7
    Vosk Desktop

    Vosk Desktop

    Desktop software for controlling the Vosk Speech Recognition Toolkit

    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    VATSG

    VATSG

    Video automatic transcribe and translated subtitle generator

    It generates srt format subtitle from videofile which can be any source language that whisper support , and then make translated subtitle file of your target language which deepl support. This is the subtitle generator(VATSG) which use [moviepy](https://github.com/Zulko/moviepy) to generate mp3 and then use [faster-whisper](https://github.com/guillaumekln/faster-whisper) to get text recognition and then use deepl-api to generate your target language subtitle file(srt format) If you...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    Speech Recognition in English & Polish

    Speech Recognition in English & Polish

    Speech recognition software for English & Polish languages

    Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More help: https://sourceforge.net/p/skrybotdomowy/wiki/ Domain advanced versions (Polish Language) 1. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Queue Management System for Busy Service Providers | WaitWell Icon
    Queue Management System for Busy Service Providers | WaitWell

    The queue management system that perfectly adapts to your workflows

    The queue management system that perfectly adapts to your workflows. Improve operational efficiency in weeks with the most configurable enterprise queue system.
    Learn More
  • 10
    chatbot_chung
    chatbot chung is a keywords based probabilities algorythm simple entertainment chatbot with 3D talking openGL avatars written in freebasic. Can import aiml simple question/answer or question/random/answers or single star/ multi srai data saved from "AIML_chung" open source application . Online html5 javascript version with 44 languages multilingual auto detection available on the website (source included in the zip file). SORT gentext text generation algorythm option added (desktop version) .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Defox text to speech and downloader

    Defox text to speech and downloader

    Written or imported text offline read or online download.

    This software design to convert text to speech and download the converted speech. Description : • Installation setup with two languages (English, French) • Two areas called text reading and speech downloading • Many languages supported to download center Note 1: I'm a student yet and I'm not in the software designing industry. Therefore maybe I haven't software making skills. I'm worried about that. ! Note 2 : When you double click on the software maybe it will get some seconds...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    NASH OS

    NASH OS

    Nash Operating System for Modern Ecommerce

    The all-built-in-one, automatic, ready-to-go out-of-box, easy-to-use state-of-the-art, and really awesome NASH OS! Over 25,000+ flexible features and controls and all scalable!! The most powerful solution ever built to instantly deliver new heights of online ecommerce enterprise to you.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    H.B.S.N

    H.B.S.N

    Speech Recognition System

    H.B.S.N is a simple speech recognition software which programmed using Java. This software is a package of many sub applications.And those are as listed below , Audio Player Video Player Email Client Weather Application Mp3 Tag Editor Picture Viewer Home Automation Application Alarm / Timer Folder Locker Message Encrypt Application Income & Expenses Logging Application Apart from that we can do many thing from this software by using voice commands , such as , Open & close applications which are installed in the computer Open web sites Open folders which are in the HDD Control built-in audio & video player Control the home automation system Reading mails Reading selected text Speaking clock ( Time & Date) Speaking weather report There are system commands for the tasks which this application does.And we can replace the system default commands with custom commands.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    ILA - teachable voice assistant

    ILA - teachable voice assistant

    ILA is a fully customizable and teachable voice assistant for Java

    ...It is designed to integrate with your home enviroment and for example build up your own, free and open Amazon Echo replacement ;-) Right now the key components of ILA are the open source speech recognition CMU Sphinx-4, Google (Speech Recognition/Text-To-Speech) and MaryTTS (Text-To-Speech). The goal is to make ILA completely free of Google by improving all aspects of the open source systems. Since version 3.3 users can also write own add-ons to extend ILA. ILA's successor is the SEPIA Framework: https://sepia-framework.github.io/ Hope you enjoy ILA - Florian
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    annyang!

    annyang!

    Speech recognition for your site

    ...Use optional words or phrases to define a part of the command as optional. annyang plays nicely with all browsers, progressively enhancing browsers that support SpeechRecognition, while leaving users with older browsers unaffected. Grab the latest version of annyang.min.js, drop it in your html, and start adding commands. You can easily add a GUI for the user to interact with Speech Recognition using Speech KITT. Speech KITT is fully customizable and comes with many different themes, and instructions on how to create your own designs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    JuliusModels

    JuliusModels

    Open source speech models for Julius in English and other languages.

    Open source speech models for Julius speech decoder. Its aim is to give access a wider community of speech recognition enthusiasts to quality models, which they can use in their own projects on different OS platforms (Unix, Windows, etc...) All of the models are based on HTK modelling software and data sets available freely on the Internet.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    FM2TXT

    FM2TXT

    RtlSdr listen to radio, recognize audio, and writes text file log

    Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    JAVT - Just Another Voice Transformer

    JAVT - Just Another Voice Transformer

    Just Another Speech Recognition and Text to Speech software.

    JAVT or Just Another Voice Transformer (formerly, it is called Just Another Video Transcriber) is a Speech Recognition software that also support text to Speech and simple media conversion. JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19

    cbrTekStraktor

    an application to automatically extract text from comic books.

    cbrTekStraktor is an application to automatically extract text from the text bubbles or speech balloons present in comic book reader files (CBR). Its prime goal is to perform analysis on the texts of comic books. cbrTekStraktor can however also be used for scanlation or similar purposes. The application also enables to manually define text areas in CBR files. The application comprises a simple graphical editor for further processing the extracted text.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20

    Distant Speech Recognition

    Beamforming and Speech Recognition Toolkit

    ...These toolkits are meant for facilitating research and development of automatic distant speech recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    SpecimenPhoto controls a Canon Powershot camera for specimen archival photography. Each photograph is assigned a case number, labeled and stored. Identification is manual or "hands free" using separately available barcode and speech recognition modules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    High-order HMM in Matlab

    Implementation of duration high-order hidden Markov model in Matlab.

    Implementation of duration high-order hidden Markov model (DHO-HMM) in Matlab with application in speech recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    jaivox

    jaivox

    Speech recognition application builder and library

    Java library and tools to create open source speech recognition applications. Generates dialogs for conversational interfaces. Works with a popular open source speech recognition library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Responding Partner

    Responding Partner

    Control your PC computer with voice commands

    ...It also enables voice commands and controls to your computer for small task like open media files, open and close program, shutdown and restart computer,open website, type in editor, text to speech,etc. You can extend the ability by installing new plugin which available at files tab. We will continuous to update new plugin and animation character. Engine inside: - Speech Recognition - Text to Speech Requirements - Microphone
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next