Open Source OCR Engine
Offline speech recognition API for Android, iOS, Raspberry Pi
Face recognition with deep neural networks
Awesome multilingual OCR toolkits based on PaddlePaddle
Robust Speech Recognition via Large-Scale Weak Supervision
State-of-the-art 2D and 3D Face Analysis Project
A Lightweight Face Recognition and Facial Attribute Analysis
Speech-to-text, text-to-speech, and speaker recognition
Speech recognition module for Python
Port of OpenAI's Whisper model in C/C++
OCR software, free and offline
Captcha solver extension for humans
On-device Speech Recognition for Apple Silicon
Contexts Optical Compression
A PyTorch-based Speech Toolkit
Audio foundation model excelling in audio understanding
kaldi-asr/kaldi is the official location of the Kaldi project
A pure Javascript Multilingual OCR
OpenVINO™ Toolkit repository
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition
Underthesea - Vietnamese NLP Toolkit
Multilingual Automatic Speech Recognition with word-level timestamps
Training data (data labeling, annotation, workflow) for all data types
Library for OCR-related tasks powered by Deep Learning