I would recommend using Apache Tika to extract the text from the PDFs and using ...

		0bit on Aug 26, 2022 \| parent \| context \| favorite \| on: YaCy – your own search engine I would recommend using Apache Tika to extract the text from the PDFs and using Solr (or Elasticsearch) to index and search them.