@dwazirl You can explore Apache Tika, for text extraction and indexing.If content search in main feature of site, will recommend you to go for external search engine SOLR or Elastic Search.Solr is an enterprise grade, secure, highly scalable, open-source NoSQL search platform from the Apache Lucene...