Expand my Community achievements bar.

Radically easy to access on brand approved content for distribution and omnichannel performant delivery. AEM Assets Content Hub and Dynamic Media with OpenAPI capabilities is now GA.
SOLVED

Adobe Accessibility - Tagged PDFs - Accessing PDF form data

Avatar

Level 1

Hi,

I am an Abobe newbie and am trying to understand the purpose of tagged PDFs. I understand that to ensure accessibility a PDF file must contain embedded PDF tags, e.g. so screen readers can identify and describe content. I have a couple of questions with regards to accessing the data contained within a PDF business document.

For example, we generate an invoice out of our SAP system which is generated using the SAP Adobe LiveCycle Designer. For some customers the invoice is sent electronically, e.g. via FTP or email. Some customers wish to extract the data from the electronic invoice. Could the PDF tag structure embedded within the form be used to extract data from the invoice? Is this a potential use of the PDF tags? I also have read about embedded XML contained within a PDF. Is this different to the embedded PDF tags? I assume it is? And if it is different, would using the embedded XML be the preferred method of extracting form data?

Any advice would be much appreciated.

Rob F

1 Accepted Solution

Avatar

Correct answer by
Former Community Member

There is no dependancy between the two so Yes you can disable accessability and still get the data out.

Paul

View solution in original post

6 Replies

Avatar

Former Community Member

Assuming that you are generating a form out of SAP, then there are tools within Acrobat/Reader that allow you to export the data from the form. It will come out in an XML format that follows the schema that is defined by your SAP system. In Acrobat/Reader 10 the menu item is located under the Tools/Forms/More Form Options/Manage Form Data

Paul

Avatar

Level 1

Thanks for your reply Paul. Yes, have read about the embedded XML within the SAP generated forms. So would this be the recommended method of accessing the data rather than using the PDF tag structure?

Avatar

Former Community Member

Yes the tag structure is used by screen readers when reading the form and does not affect the data structure. That is controlled by binding expressions on the fields and can be controlled by schema definitions or sample xml data files.

Paul

Avatar

Level 1

Hi Paul,

Thanks for you reply. Just another, hopefully quick, question - with a form designed using LCD, could you switch off the PDF tags but retain the embedded XML? i.e. is there are dependency between the 2?

Thanks and regards

Rob

Avatar

Correct answer by
Former Community Member

There is no dependancy between the two so Yes you can disable accessability and still get the data out.

Paul

Avatar

Level 1

Thanks Paul. That's of great help!

The following has evaluated to null or missing: ==> liqladmin("SELECT id, value FROM metrics WHERE id = 'net_accepted_solutions' and user.id = '${acceptedAnswer.author.id}'").data.items [in template "analytics-container" at line 83, column 41] ---- Tip: It's the step after the last dot that caused this error, not those before it. ---- Tip: If the failing expression is known to be legally refer to something that's sometimes null or missing, either specify a default value like myOptionalVar!myDefault, or use <#if myOptionalVar??>when-present<#else>when-missing. (These only cover the last step of the expression; to cover the whole expression, use parenthesis: (myOptionalVar.foo)!myDefault, (myOptionalVar.foo)?? ---- ---- FTL stack trace ("~" means nesting-related): - Failed at: #assign answerAuthorNetSolutions = li... [in template "analytics-container" at line 83, column 5] ----