


Hi Team,
I want to search the PDF assets with its contents in AEM Clouds. for example, if my xyz.pdf file has text as "Avocado Breakfast" in it and when I am searching in DAM with Avocado breakfast text, I should get this PDF as results.
Just FYI, there is no tag assigned as Avocado breakfast to the pdf in DAM.
Thanks,
SD
Views
Replies
Sign in to like this content
Total Likes
The OOTB AEM DAM search should serve the search results(pdf) based on what is in the content of the pdf. So if you search "Avocado Breakfast", that xys.pdf should come up in search.
"Behind the scenes, Apache Lucene fetches the documents in the repository and indexes the content based on the metadata and text content. The index update thread wakes up every five seconds looking for content updates. Apache Lucene uses Apache Tika, a content analysis tool, to get the internal detail of documents like metadata and text in the document to create the indexes."
https://blogs.perficient.com/2017/05/08/indexing-bogging-aem-down-disable-apache-tika/
I am searching with text content but it is not searchable by default, how can I resolve it?