Your achievements

Level 1

0% to

Level 2

Tip /
Sign in

Sign in to Community

to gain points, level up, and earn exciting badges like the new
Bedrock Mission!

Learn more

View all

Sign in to view all badges

Adobe Summit 2023 [19th to 23rd March, Las Vegas and Virtual] | Complete AEM Session & Lab list

OCR API

Avatar

Level 1

We have a license for LiveCycle, but I'm trying to find documentation regarding the OCR capabilities that Adobe has.

The goal is as follows: to create a service that receives a file (image or pdf), runs OCR on it and then returns the text within. If it was a PDF passed in then it would be nice to return a PDF with the newly derived text embedded within the PDF.

I guess I want to know:

- Does Adobe have an API for me to use (most likely from C#) that will allow me to OCR a PDF/image?

- If Adobe does OCR, can it embed the derived text?

- Where can I find documentation/examples to do any of this?

Thanks!

0 Replies

Avatar

Level 2

Moved from Acrobat to the LiveCycle forums - you cannot use Acrobat as part of an automated service.

Avatar

Level 8

You can use Acrobat in an automated service as long as it's licensed properly.  You mention you have a license of LiveCycle but don't mention which components.  If you have PDF/G it does have an API and does provide the ability to OCR an image and can return the resulting text of the scanned document.  The API documentation is located here http://help.adobe.com/en_US/livecycle/10.0/ProgramLC/javadoc/

Avatar

Level 1

Thanks TundraSteve! We have a licence for LiveCycle PDF Generator (and Policy Server ftr), is that the same as PDF/G?

Avatar

Level 1

After looking at this page: http://www.adobe.com/products/eula/livecycle/multicorepolicy/ I believe PDFG (that Steve referred to) is the PDF Generator license.

Great news.

Avatar

Level 1

Hi Tundra - Could you please guide us to the exact API from PDF Generator that can help us achieve this. Thanks.

Abhishek