[go: up one dir, main page]

Showing 32 open source projects for "pdf meta"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • The top-rated AI recruiting platform for faster, smarter hiring. Icon
    The top-rated AI recruiting platform for faster, smarter hiring.

    Humanly is an AI recruiting platform that automates candidate conversations, screening, and scheduling.

    Humanly is an AI-first recruiting platform that helps talent teams hire in days, not months—without adding headcount. Our intuitive CRM pairs with powerful agentic AI to engage and screen every candidate instantly, surfacing top talent fast. Built on insights from over 4 million candidate interactions, Humanly delivers speed, structure, and consistency at scale—engaging 100% of interested candidates and driving pipeline growth through targeted outreach and smart re-engagement. We integrate seamlessly with all major ATSs to reduce manual work, improve data flow, and enhance recruiter efficiency and candidate experience. Independent audits ensure our AI remains fair and bias-free, so you can hire confidently.
    Learn More
  • 1
    pdfly

    pdfly

    CLI tool to extract (meta)data from PDF and manipulate PDF files

    A Python library designed for manipulating PDF files with functionalities for extraction, transformation, and document generation.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    borb

    borb

    borb is a library for reading, creating and manipulating PDF files

    borb is a library for creating and manipulating PDF files in python. borb is a pure python library to read, write, and manipulate PDF documents. It represents a PDF document as a JSON-like data structure of nested lists, dictionaries and primitives (numbers, string, booleans, etc) This is currently a one-man project, so the focus will always be to support those use-cases that are more common in favor of those that are rare.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    psychmeta

    psychmeta

    Psychometric meta-analysis toolkit

    ...Documentation for psychmeta’s functions is available in the package’s PDF manual. Includes tools for converting effect sizes, computing sporadic artifact corrections, reshaping meta-analytic databases, computing multivariate corrections for range variation, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    PyMuPDF

    PyMuPDF

    Python bindings for MuPDF's rendering library.

    MuPDF is a lightweight PDF, XPS, and E-book viewer. MuPDF consists of a software library, command line tools, and viewers for various platforms. The renderer in MuPDF is tailored for high-quality anti-aliased graphics. It renders text with metrics and spacing accurate to within fractions of a pixel for the highest fidelity in reproducing the look of a printed page on the screen. The viewer is small, fast, yet complete. It supports many document formats, such as PDF, XPS, OpenXPS, CBZ, EPUB,...
    Downloads: 11 This Week
    Last Update:
    See Project
  • SalesTarget.ai | AI-Powered Lead Generation, Email Outreach, and CRM Icon
    SalesTarget.ai | AI-Powered Lead Generation, Email Outreach, and CRM

    SalesTarget.ai streamlines your sales process, providing everything you need to find high- quality leads, automate outreach, and close deals faster

    SalesTarget is ideal for B2B sales teams, startup founders, and marketing professionals looking to streamline lead generation and outreach. It also benefits growing SaaS companies and agencies aiming to scale their outbound efforts efficiently.
    Learn More
  • 5
    ExifTool

    ExifTool

    ExifTool meta information reader/writer

    ExifTool is a battle-tested Perl application for reading, writing, and batch-editing metadata in thousands of file types—images, videos, audio, documents, and more. It understands major standards like EXIF, IPTC, and XMP as well as an enormous range of camera maker notes and container formats (for example, QuickTime/MP4, PDF, TIFF). Typical workflows include extracting metadata to JSON/CSV/XML, renaming files from timestamps or tags, shifting capture times, copying tags between files, and...
    Downloads: 87 This Week
    Last Update:
    See Project
  • 6
    exiftool

    exiftool

    Read, write and edit metadata

    Perl library and command-line application for reading, writing and editing meta information (EXIF, IPTC, XMP, and more) in a wide variety of file formats (JPEG, TIFF, PNG, PDF, RAW, and more).
    Leader badge">
    Downloads: 35,811 This Week
    Last Update:
    See Project
  • 7
    XeroArch

    XeroArch

    XeroArch is a Linux distro made for easy of use and simplicity

    Well configured, just with the necessary software for things like, audio, bluetooth, wi-fi, Nvidia or AMD drivers etc.. Custom repositories, a few extra useful utilities like a pdf-viewer, timeshift, AUR-helpers, supergfxctl(switch graphics card modes), switcheroo-control(run specific programs with discrete GPU) etc.. XeroArch is an ArchLinux distro, but made for simplicity and normal ease of use. Now if you want to play video-games, you just have to download our custom pkg with all the wine dependencies and gaming applications to have a proper gaming experience: "sudo pacman -S xeroarch-gaming-meta" <--(put it in your terminal). ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    CERCA

    CERCA

    CERCA – Citation Extraction & Reference Checking Assistant

    CERCA is an open-source research tool that supports the verification of bibliographic references in scientific manuscripts. It extracts references from PDF files and checks their existence and consistency against authoritative metadata sources, producing explainable diagnostics, audit logs, and reproducible reports. It is intended for: - Researchers performing final manuscript checks - Reviewers assessing reference consistency - Editors supporting editorial quality control - Meta-research and reproducibility workflows CERCA is an experimental tool. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    快速全文搜索/fast-full-text-search 1. 无索引 2. 全能搜索,支持压缩文件、docx、pdf、图片/视频meta等,及一切文本格式。 3. 支持多关键字、正则表达式搜索 4. 支持文件类型过滤,进一步提高搜索速度
    Downloads: 0 This Week
    Last Update:
    See Project
  • The CRM you will want to use every day Icon
    The CRM you will want to use every day

    With CRM, Sales, and Marketing Automation in one, Act! gives you everything you need for happier clients, more revenue, and less stress.

    Act! Premium is perfect for small and midsize businesses looking to market better, sell more, and create customers for life. With unparalleled flexibility and freedom of choice, Act! Premium accommodates the unique ways you do business. Whether it’s customizations to fit your specific business or industry processes or your preferences for deployment and access, the possibilities with Act! Premium are limitless.
    Learn More
  • 10
    WA2L/WinTools

    WA2L/WinTools

    End User Tools for Windows.

    Some end user utilities for the Windows operating system. The utilities can be called thru the "Send To" context menu when right-clicking on a file or directory in the explorer or thru the Windows "Start Menu". The package can be 'installed' portable and does not need admin rights. ◆ 𝗨𝗧𝗜𝗟𝗜𝗧𝗜𝗘𝗦 - https://sourceforge.net/projects/wa2l-wintools/files/ → README ◆ 𝗙𝗘𝗔𝗧𝗨𝗥𝗘𝗦 - https://wa2l-wintools.sourceforge.net/man1/wintools.1.html -...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    Kiwix

    Kiwix

    Wikipedia offline & more

    Kiwix is an offline reader for Web content. It's especially intended to make Wikipedia available offline. With Kiwix, you can enjoy Wikipedia on a boat, in the middle of nowhere... or in Jail. Kiwix manages to do that by reading ZIM files, a highly compressed open format with additional meta-data.
    Leader badge">
    Downloads: 133 This Week
    Last Update:
    See Project
  • 12
    docconv

    docconv

    Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text

    ...Now you can add -tags ocr to any go command when building/fetching/testing docconv to include support for processing images. Documents can be sent as a multipart POST request and the plain text (body) and meta information are then returned as a JSON object.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    BlueSpice free (Support archive)

    BlueSpice free (Support archive)

    Our support forum has moved: community.bluespice.com

    This freely available open-source software turns Wikipedia’s popular software engine MediaWiki into a fully-fledged enterprise wiki solution. Companies can continue cherishing MediaWiki’s numerous advantages and automation capabilities; with BlueSpice, they can now work even more comfortably, safely and more effectively. Compared with basic MediaWiki, BlueSpice provides, amongst other, the following enhancements: comfortable and sophisticated rights management capabilities, a visual editor...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    LPub3D

    LPub3D

    LDraw™ editor for LEGO® style digital building instructions.

    LPub3D is an Open Source WYSIWYG editing application for creating LEGO® style digital building instructions. LPub3D is developed and maintained by Trevor SANDY. It uses the LDraw™ parts library, the most comprehensive library of digital Open Source LEGO® bricks available (www.ldraw.org/ ) and reads the LDraw LDR and MPD model file formats. LPub3D is available for free under the GNU Public License v3 and runs on Windows, Linux and macOS Operating Systems. LPub3D is also...
    Leader badge">
    Downloads: 26 This Week
    Last Update:
    See Project
  • 15
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you will find binary downloads and discussion (https://sourceforge.net/p/crgrep/discussion/) . ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    TemaTres: controlled vocabulary server

    TemaTres: controlled vocabulary server

    Manage, Publish and Share Ontologies, Taxonomies, Thesauri, Glossaries

    Web application for management formal representations of knowledge, thesauri, taxonomies and multilingual vocabularies / Aplicación para la gestión de representaciones formales del conocimiento, tesauros, taxonomías, vocabularios multilingües. For the latest version of code: https://github.com/tematres/TemaTres-Vocabulary-Server
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    i-Map - Plot Geolocation from Images

    i-Map - Plot Geolocation from Images

    Automatically plots latitude, longitude from images on Google maps.

    i-Map is a Photo metadata forensic tool for Geo-location analysis of images that are clicked from GPS enabled devices. In this tool, you can load 100s of images from a suspect's device and analyze them to know various locations where photos were clicked on mobile phone/tablet. After loading images, with a single click, iMap plots all the images on World Map to visually check where they have been captured, generate timeline and activity of suspect and match them with CDR (Call Detail...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    OpenSearchServer Extractor

    OpenSearchServer Extractor

    A RESTFul/JSON Web Service for text and metata extraction

    An open source RESTFul Web Service for text , meta-data extraction and analysis. oss-text-extractor supports various binary formats: Word processor (doc, docx, odt, rtf) Spreadsheet (xls, xlsx, ods) Presentation (ppt, pptx, odp) Publishing (pdf, pub) Web (rss, html/xhtml) Medias (audio, images) Others (vsd, text)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Crème Fraiche

    Crème Fraiche

    eml2pdf converter

    I DO NO LONGER CLAIM PLATFORM-INDEPEDENCE FOR Crème Fraiche. THIS PROGRAM RUNS ON LINUX. Crème Fraiche transforms EML-files, as they are created by email-clients, to PDF. PSE see the rubygems.org site for updates or use the gem-tool right away to install Crème Fraiche: ~$ gem install cremefraiche
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    SearchServer2

    Local and Remote Meta Search Engine

    Searchserver2 creates an index of your local filesystem, index RSS Feeds and Text-Files and sends Remote Requests to other Search Engines like Google or Youtube. Searchserver is a Standalone Webserver written in .net and Contains a Webpage for Searching. Contents like 3D Parts, Images, Movies (youtube and local) and Audio is playing directly in the Browser-Window. Movies and PDF shown as Thumbnails.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    SE Auditor

    Free SEO audit software.

    ...SE Auditor is application that you can use to view statistical data about your website, in order to improve its position within the Web search results. SE Auditor is addressed to SEO professionals, website designers, developers, website testers and owners. SE Auditor enables you to check meta description, keywords, sitemap, the number of links and keyword consistency, the text/HTML ratio and many more ranking / usability / social factors. Unlike other programs of this type SE Auditor is: - free - fast, do not waste your time (the audit process goes smooth as butter) - simple to understand interface (you can see at a glance) - shows the data important in optimizing your site - based on the collected data allows to generate a PDF report with hints - is accurate (for example in counting words, characters, calculating text/html ratio..)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    ScientificPdfParser

    Parses scientific articles from PDF and marks the meta data.

    Parses PDF files of scientific articles based on naive bayes and sophisticated heuristics. The output is a XML file that contains the parsed data. Meta data is detected and marked as such. The meta data contains the following elements: - Title - Authors - Abstract - Text - Headlines - Enumerations - References (Literature) In the first step, the text elements are divided into blocks (similar to paragraphs) and after that, predictions for each element are made. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    This program can protect pdf documents with and change meta dates of pdf files. Program use iText library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Paper collection is research paper (pdf) database system. It store pdf file and meta information ,for example author and publication date, to help finding and management job.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next