User interface similar to ChatGPT for question-answering, designed to handle hundreds of pre-configured PDFs, using Llama 2 available on local machine

I aim to develop a Streamlit application featuring a ChatGPT-like user interface for question-answering. The app will handle hundreds of pre-configured PDFs using the quantized version of Llama 2. However, a challenge is that each time a question is asked, it creates the embedding and loads the model.