Apache Tika for PDF

Hi @HO_Tuong_Vinh,

There are a few different files you can put in your repository to specify what python packages need to be installed. App dependencies - Streamlit Docs

The simplest is just a requirements.txt files with a line for each python package (e.g. tika). It looks like that package requires Java, so you might also need to have a packages.txt file to make sure Java also gets installed. See this answer: Java_not_found_error - #2 by randyzwitch