Open Source OCR Engine
PDF to Markdown with vision models
Formula recognition based on LaTeX-OCR and ONNXRuntime
Free OCR Software: No internet required, easy to use.
OCRmyPDF adds an OCR text layer to scanned PDF files
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Awesome multilingual OCR toolkits based on PaddlePaddle
Ready-to-use OCR with 80+ supported languages
A pure Javascript Multilingual OCR
PDF scientific paper translation with preserved formats
Web application that allows you to perform operations on PDF files
Library for OCR-related tasks powered by Deep Learning
Open Source Document Management System for Digital Archives
A community-supported supercharged version of paperless
A framework to enable multimodal models to operate a computer
A high-quality tool for convert PDF to Markdown and JSON
Scanner
Convert AI papers to GUI
Math OCR model that outputs LaTeX and markdown
Free Open Source Enterprise Grade RPA
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Qwen3-omni is a natively end-to-end, omni-modal LLM
Open source clipboard management tools for Windows, Macos and Linux
A Repo For Document AI