24 Oct 25
Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text, tables, and images with preserved formatting for enhanced information retrieval and processing. The pdf-to-markdown GitHub repository hosts a tool designed to convert PDF files into Markdown format for easier text extraction and reformatting, with the process running locally on the user’s machine.
🪄 Create rich visualizations with AI. Data-Formulator is a Microsoft-developed Python library available on GitHub designed for simple and efficient data generation and transformation, facilitating tasks like creating synthetic data and preparing datasets for analysis.
Orbidium is an open-source application demo that displays asteroid orbits using data parsed from the NASA Minor Planet Center (MPC) database, featuring basic 2D rendering and parsing of the MPC data file.