A fast, high-level web crawling and web scraping framework
Elegant Scraper and Crawler Framework for Golang
Zero-copy PDF text extraction library written in Zig
Shower HTML presentation engine
Simple drawings using vector graphics; Cairo "for tourists!"
LaTeX CV generator from a YAML/JSON input file
CircuiTikZ TeX/LaTeX package for drawing circuits
An ebook reader application supporting PDF, DjVu, EPUB, FB2, etc.
Node canvas is a Cairo backed Canvas implementation for NodeJS
Converts books written in Markdown to HTML, LaTeX/PDF and EPUB
MD/.JSON Document OCR and structured data extraction API
Self-hosted collection of powerful web-based tools for everyday tasks
Open-source platform for extracting structured data from documents
Convert Python notebook to web app and share with non-technical users
Parse files for optimal RAG
Split and merge PDF files on any platform
Open-Source Python3 tool for recognizing layouts, tables, and math
Machine learning software to solve data mining problems
An automated tool to fetch data from income tax websites
Video-based AI memory library. Store millions of text chunks in MP4
Declarative vector graphics
Improved Lecture Notes in Computer Science (LNCS) template
The Ray Tracing in One Weekend series of books
LaTeX template for USTC thesis