dude uncomplicated data extraction: A simple framework
Turn entire websites into LLM-ready markdown or structured data
ExtractThinker is a Document Intelligence library for LLMs
CLI tool to extract (meta)data from PDF and manipulate PDF files
Lightweight library for scraping web-sites with LLMs
MD/.JSON Document OCR and structured data extraction API
To extract main article from given URL with Node.js
A high-quality tool for convert PDF to Markdown and JSON
Tools to build web AI agents that can authenticate
Flexible Node.js AI-assisted crawler library
Clean network diagrams, One-time setup, zero upkeep
ContextGem: Effortless LLM extraction from documents
Unreal Engine Archives Explorer
Automatic extraction of relevant features from time series
Automate browser-based workflows with LLMs and Computer Vision
Model Context Protocol server that integrates AgentQL's data
Make websites accessible for AI agents
A library for audio and music analysis, feature extraction
A chrome extension for automating your browser by connecting blocks
Zero-copy PDF text extraction library written in Zig
Declarative web scraping
Extract internal monitoring data from application logs
Alternative to Google Analytics that gives you full control over data
A Python tool to help extracting information from structured PDFs