Expand my Community achievements bar.

Radically easy to access on brand approved content for distribution and omnichannel performant delivery. AEM Assets Content Hub and Dynamic Media with OpenAPI capabilities is now GA.

Search for text in a PDF

Avatar

Level 7

Is there a Livecycle component that will help me find the pages where a phrase is found?  If so, what is it?  And, can you provide an example?

1 Reply

Avatar

Former Community Member

Charles,

Take a look at using the DocumentText element.

<DDX ...>
<DocumentText mode="WithQuads" result="text.xml">
<PDF source="inDoc"/>
</DocumentText>
</DDX>

This example returns text.xml which is a word list xml document. It can output words per page (the default) or it can output the location for each word by setting the mode to "WithQuads".

Steve