Your achievements

Level 1

0% to

Level 2

Tip /
Sign in

Sign in to Community

to gain points, level up, and earn exciting badges like the new
Bedrock Mission!

Learn more

View all

Sign in to view all badges

OCR API

Chaos_83
Level 1
Level 1

We have a license for LiveCycle, but I'm trying to find documentation regarding the OCR capabilities that Adobe has.

The goal is as follows: to create a service that receives a file (image or pdf), runs OCR on it and then returns the text within. If it was a PDF passed in then it would be nice to return a PDF with the newly derived text embedded within the PDF.

I guess I want to know:

- Does Adobe have an API for me to use (most likely from C#) that will allow me to OCR a PDF/image?

- If Adobe does OCR, can it embed the derived text?

- Where can I find documentation/examples to do any of this?

Thanks!

0 Replies
Dave_Merchant
Level 2
Level 2

Moved from Acrobat to the LiveCycle forums - you cannot use Acrobat as part of an automated service.

TundraSteve
Level 8
Level 8

You can use Acrobat in an automated service as long as it's licensed properly.  You mention you have a license of LiveCycle but don't mention which components.  If you have PDF/G it does have an API and does provide the ability to OCR an image and can return the resulting text of the scanned document.  The API documentation is located here http://help.adobe.com/en_US/livecycle/10.0/ProgramLC/javadoc/

Chaos_83
Level 1
Level 1

Thanks TundraSteve! We have a licence for LiveCycle PDF Generator (and Policy Server ftr), is that the same as PDF/G?

Chaos_83
Level 1
Level 1

After looking at this page: http://www.adobe.com/products/eula/livecycle/multicorepolicy/ I believe PDFG (that Steve referred to) is the PDF Generator license.

Great news.

abhishekk603607
Level 1
Level 1

Hi Tundra - Could you please guide us to the exact API from PDF Generator that can help us achieve this. Thanks.

Abhishek