Expand my Community achievements bar.

Dive into Adobe Summit 2024! Explore curated list of AEM sessions & labs, register, connect with experts, ask questions, engage, and share insights. Don't miss the excitement.

OCR API

Avatar

Level 1

We have a license for LiveCycle, but I'm trying to find documentation regarding the OCR capabilities that Adobe has.

The goal is as follows: to create a service that receives a file (image or pdf), runs OCR on it and then returns the text within. If it was a PDF passed in then it would be nice to return a PDF with the newly derived text embedded within the PDF.

I guess I want to know:

- Does Adobe have an API for me to use (most likely from C#) that will allow me to OCR a PDF/image?

- If Adobe does OCR, can it embed the derived text?

- Where can I find documentation/examples to do any of this?

Thanks!

5 Replies

Avatar

Level 2

Moved from Acrobat to the LiveCycle forums - you cannot use Acrobat as part of an automated service.

Avatar

Level 8

You can use Acrobat in an automated service as long as it's licensed properly.  You mention you have a license of LiveCycle but don't mention which components.  If you have PDF/G it does have an API and does provide the ability to OCR an image and can return the resulting text of the scanned document.  The API documentation is located here http://help.adobe.com/en_US/livecycle/10.0/ProgramLC/javadoc/

Avatar

Level 1

Thanks TundraSteve! We have a licence for LiveCycle PDF Generator (and Policy Server ftr), is that the same as PDF/G?

Avatar

Level 1

After looking at this page: http://www.adobe.com/products/eula/livecycle/multicorepolicy/ I believe PDFG (that Steve referred to) is the PDF Generator license.

Great news.

Avatar

Level 1

Hi Tundra - Could you please guide us to the exact API from PDF Generator that can help us achieve this. Thanks.

Abhishek