docconv

A Go wrapper library to convert PDF, DOC, DOCX, XML, HTML, RTF, ODT, Pages documents and images (see optional dependencies below) to plain text. See go help install for details on the installation location of the installed docd executable. Make sure that the full path to the executable is in your PATH environment variable. To add image support to the docconv library you first need to install and build gosseract. Now you can add -tags ocr to any go command when building/fetching/testing docconv to include support for processing images. Documents can be sent as a multipart POST request and the plain text (body) and meta information are then returned as a JSON object.

Features

Add image support to the docconv library
Go wrapper library to convert PDF, DOC, DOCX, XML, HTML, RTF, ODT
Now you can add -tags ocr to any go command when building/fetching/testing docconv to include support for processing images
The docd tool runs as a service on port 8888
Run locally
Request over the network

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow docconv

docconv Web Site

User Reviews

Be the first to post a review of docconv!

Additional Project Details

Operating Systems

Windows

Programming Language

Related Categories

Go HTML XHTML, Go PDF Editors

Registered

2023-04-27

Similar Business Software

Nutrient SDK

Nutrient is the comprehensive solution for all your PDF needs, offering tools that effortlessly integrate and operate PDF functionality across any platform. 1. SDK PRODUCTS Integrate robust PDF functionality into iOS, Android, Windows, web (JavaScript), or any cross-platform technology,...

See Software
MobiPDF (formerly PDF Extra)

MobiPDF (formerly PDF Extra) is an intuitive and powerful PDF editor and reader designed for today’s modern user - the cost-efficient alternative to Adobe Acrobat Pro you’ve been looking for. FEATURES OVERVIEW: PDF Viewer and Reader: Switch between page views or use "Read Mode" for...

See Software
RAD PDF

Add a fully functional PDF editor to your ASP.NET website in minutes! Compatible with 99% of desktop & mobile browsers, from Internet Explorer 6 through the latest iOS Safari release, RAD PDF simply works. No plugins or other software needed. RAD PDF natively supports the most commonly...

See Software
PDFCreator

PDFCreator simplifies converting printable documents into high-quality PDFs and other formats like JPG, PNG, and TIF. Easily merge multiple files into one PDF and automate saving with the PDF printer feature. Customizable profiles allow quick access to frequently used settings. Whether for...

See Software
Apryse PDF SDK

Apryse (formerly PDFTron) powers the future of document technology. We help businesses, developers, and enterprises handle documents with unmatched speed, accuracy, and security. Whether running in secure server environments or delivering seamless web-based experiences, Apryse makes document...

See Software
Jotform

Trusted by over 25 million users, Jotform is an all-in-one, no-code platform that simplifies data collection, automation, and online sales. Using its drag-and-drop Form Builder, businesses can create customized forms and surveys to collect leads, payments, and e-signatures. With 10,000+...

See Software