Open Source OCR Engine
Face recognition with deep neural networks
Awesome multilingual OCR toolkits based on PaddlePaddle
Robust Speech Recognition via Large-Scale Weak Supervision
State-of-the-art 2D and 3D Face Analysis Project
A Lightweight Face Recognition and Facial Attribute Analysis
Speech-to-text, text-to-speech, and speaker recognition
Port of OpenAI's Whisper model in C/C++
OCR software, free and offline
Captcha solver extension for humans
Contexts Optical Compression
A PyTorch-based Speech Toolkit
Audio foundation model excelling in audio understanding
kaldi-asr/kaldi is the official location of the Kaldi project
A pure Javascript Multilingual OCR
OpenVINO™ Toolkit repository
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition
Underthesea - Vietnamese NLP Toolkit
Multilingual Automatic Speech Recognition with word-level timestamps
Training data (data labeling, annotation, workflow) for all data types
Library for OCR-related tasks powered by Deep Learning
Open Source Computer Vision Library
A full spaCy pipeline and models for scientific/biomedical documents
Build your own AI friend