Abstract:
Apache Tika is used to detect and extract the text from varying file formats. It uses Detector and Parser for the same, as with the name, former is used to detect the content Type of the file and latter is used to parse the text content. Oak uses default Tika config. (XML file defining the Detector and Parser used).
This post illustrates
Blog content:
https://myaemlearnings.blogspot.com/2020/06/apache-tika-config-in-lucene-index-and.html
Making this as a featured post.
Views
Replies
Total Likes
Thanks Kautuk
Views
Replies
Total Likes