Expand my Community achievements bar.

Assets searched with the text in pdf file

Avatar

Level 3

Hi Team,

 

I want to search the PDF assets with its contents in AEM Clouds. for example, if my xyz.pdf file has text as "Avocado Breakfast" in it and when I am searching in DAM with Avocado breakfast text, I should get this PDF as results. 

Just FYI, there is no tag assigned as Avocado breakfast to the pdf in DAM.

 

Thanks,

SD

2 Replies

Avatar

Community Advisor

The OOTB AEM DAM search should serve the search results(pdf) based on what is in the content of the pdf. So if you search "Avocado Breakfast", that xys.pdf should come up in search.

 

"Behind the scenes, Apache Lucene fetches the documents in the repository and indexes the content based on the metadata and text content. The index update thread wakes up every five seconds looking for content updates. Apache Lucene uses Apache Tika, a content analysis tool, to get the internal detail of documents like metadata and text in the document to create the indexes."

 

https://blogs.perficient.com/2017/05/08/indexing-bogging-aem-down-disable-apache-tika/ 

Avatar

Level 3

I am searching with text content but it is not searchable by default, how can I resolve it?