Process and analyze multi-page PDF documents to extract and index content.
Learn the concepts
Interact with it live
Build it yourself
See it in action
In this tutorial, we walked through the process of building a Python script that is able to search the contents of PDF files in an Amazon S3 bucket using Apache Tika and OpenSearch.