Expand my Community achievements bar.

extracting data from a pdf form

Avatar

Level 2

Hi,

livecycle es2, workbench 9.0

I'm new to workbench and have a problem extracting data from a pdf form submitted to a short lived process.

I have set up the following very simple process :


default startpoint >  ProcessForm > exportData > set value > set value > Write Document

The intention is to update the document and write it to disk. So far, each step works except for the 'export data' where I cannot get the pdf to extract to xml.

The Input to the 'export data' step is a variable (myDoc), Data Type: Document,  created from the incoming PDF form.

If I write out myDoc it is an exact copy of the incoming document, so I guess the start and finish steps of of the process are OK.

The incoming (PDF) form I was given had no data schema, but  I thought I could access the form data by exporting to an xml variable....


  Service : FormDataIntegration  / exportData

input (PDF Document)    variable : myDoc

  output(Data extracted)     variable : myXMLData

Then in the next step (set value) access the xml element I am after ..

Mappings

Location:  /process_data/@groupId      Expression: /process_data/myXMLData/xdp/datasets/data/form1/mainPage/groupId

This is did not work, so I got the incoming form, exported the form data to an xml file,  and created a schema using  Stylus Studio. I then imported that into the myXMLdata definition. ( BTW - Do I need to specify the root node after importing it ? )

Still not working !

Extra info : The XML view of my incoming  form shows I have a minimal dataset definition- is this OK ??


<connectionSet xmlns="http://www.xfa.org/schema/xfa-connection-set/2.8/">
   <?originalXFAVersion http://www.xfa.org/schema/xfa-connection-set/2.4/?></connectionSet>;
<xfa:datasets xmlns:xfa="http://www.xfa.org/schema/xfa-data/1.0/">
   <xfa:data xfa:dataNode="dataGroup"/>
</xfa:datasets>

The schema created by stylus studio has none of the xfdf, xfa settings I have seen on other schemas - is this OK ?

Any help to get this fixed greatly appreciated

thanks

steve

2 Replies

Avatar

Level 4

Could you please post the PDF and the XML extract from the Extract Data step?  I ran into an issue like this earlier today.  Just making sure it isn't the same issue.  And if it is, I may be able to help you!

Avatar

Level 2

hey thanks for the offer, but I am now sorted after I found a simple working example on line.

This is a similar process to the one I am working on, and is clearly described and easy to follow...

http://eslifeline.wordpress.com/2009/04/25/extracting-data-from-signed-pdf-using-livecycle-server/

girish bedekar - I thank you !