I need to convert a PDF file in to XML format programmatically in Java. Can any one please provide any pointers?
Thanks in advance.
Thanks Jasmin for a prompt response. I want to extract the data and meta data (font size etc), images in the same format. We basically have to convert the PDF to our own properitory XML format for further processing. Is that possible? I would hope that many people would have similar needs in the past and must have done it successfully.
If conversion to direct XML is not possible, then one way I can think of is to first convert it in to HTML format and then from HTML, convert it in to XML format. But then the same question remains - how to convert PDF to HTML ?
Take a look at the ExportPDF( ) operations in GeneratePDF service.
Input Document is the PDF. You should be able to choose the format type to be XML.
Is there any solution to convert PDF to XML without server side Java/.Net?
I need pure Actionscript solution with help of some library either open source or licensed.
Any help will be appreciated very much.
-- Maksym Melnishyn.