pdf2xml convertor based on Xpdf library (http://www.foolabs.com/xpdf/home.html). It converts information contained in a PDF file into XML. First, you need to install xpdf and libxml2 (see documentation).
Hervé Déjean
Xerox Research Centre Europe

http://www.xrce.xerox.com/About-XRCE/People/Herve-Dejean

Features

  • pdf to xml conversion
  • text extraction
  • vectorial instruction extraction

Project Activity

See All Activity >

Categories

XML, Topic, Cataloguing

License

GNU General Public License version 2.0 (GPLv2)

Follow pdf2xml

pdf2xml Web Site

Other Useful Business Software
Auth for GenAI | Auth0 Icon
Auth for GenAI | Auth0

Enable AI agents to securely access tools, workflows, and data with fine-grained control and just a few lines of code.

Easily implement secure login experiences for AI Agents - from interactive chatbots to background workers with Auth0. Auth for GenAI is now available in Developer Preview
Try free now
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
10
1
0
0
1
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

User Reviews

  • The link for the SVN code is not working i want to integrate this functionality in my java project , please provide valid link
  • Thanks very good project! +
  • Used on the irs f1040.pdf to produce f1040.xml; however, when viewed in firefox, firefox indicated it had no styling; hence, it didn't look anything like the pdf file when viewed by adobe reader.
  • Very useful, a must-have program. Great job!
  • Simple, no fuss. works for all types
Read more reviews >

Additional Project Details

Operating Systems

MinGW/MSYS2, Linux, BSD, Windows

Intended Audience

Information Technology, Developers, End Users/Desktop

User Interface

Command-line

Programming Language

C++

Related Categories

C++ XML Software, C++ Topic Software, C++ Cataloguing Software

Registered

2007-07-11