ContextGem: Effortless LLM extraction from documents
dude uncomplicated data extraction: A simple framework
lightweight Go package to parse, analyze and extract metadata
File Parser optimised for LLM Ingestion with no loss
A distributed job server
A versatile toolkit for PDF manipulation
A machine learning software for extracting information
A high-quality tool for convert PDF to Markdown and JSON
Assist in organizing your piles of documents
Python & command-line tool to gather text on the Web
AI video agents framework for next-gen video interactions
Python binding to the Apache Tika™ REST services
A Model Context Protocol server for searching and analyzing arXiv
Uncommon Objects in 3D dataset
A Repo For Document AI
Streaming downloads using Net::HTTP, http.rb or HTTPX
A youtube-dl fork with additional features and fixes
A Powerful Desktop Full-Text Search Engine, Just Like Local Google.
Extracts Data Types Like Email Addresses From All Kinds Of Files
YoungerSibling: Cross-platform OSINT tool for quick data gathering.
Award-winning modern data processing SDK in C++20
Fast and lightweight image viewer
A powerful, free and open-source tool for extracting frames and animat
Editor with scripting language, security features & system interfaces.
Automatic enrichment, enhancement, and explanation of your data