Text pre-extraction in AEM is very useful and highly recommended for re/indexing Lucene indexes on repositories with large binaries that contain extractable text (eg. PDFs, Word Docs, PPTs, TXT, etc.). Running re-indexing directly on lucene indexes is very expensive and may cause performance issues. After completing this tutorial you will be able to understand:-
- Text pre-extraction overview.
- When to use text pre-extraction in AEM.
- When not to use text pre-extraction in AEM.
- Prerequisites for using text pre-extraction.
- Execute text pre-extraction.
- Validate OAK Index Consistency.
Interested in learning how to execute Text Pre Extraction and what are the advantages of using it.
Visit :-
http://www.aemcq5tutorials.com/tutorials/adobe-aem-cq5-tutorials/text-pre-extraction/