Streamlit + LLMs + files = how to avoid re-running the file?

czyzewm · July 27, 2023, 10:16am

Summary

I build a simple app to upload a pdf file, create embeddings, connect to LLM and generate the response.
However, every time I ask a new question the app is doing all the stuff from the beginning. For large pdf files, it may take several minutes to create embeddings.

I am trying to build a logic that can create embeddings only once and pass the ‘embeddings engine’ to the next function to answer the question. Still I havent found a solution…

Steps to reproduce

Code snippet:

 # upload file
    file = st.file_uploader("Upload your PDF", type="pdf")

    if file is not None:
        if st.button('Generate engine: '):
            with st.spinner('Model generating the response engine...'):
                ------------
               LOADING THE FILE AND GENERATING THE EMBEDDINGS
                ------------
                qa = RetrievalQA.from_chain_type(
                 llm=llm, chain_type="stuff", retriever=retriever, return_source_documents=True
                )

        st.header('Ask your data')
        user_q = st.text_area('Enter your questions here: ')

        if st.button('Get response'):
            try:
                with st.spinner('Model is working on it...'):
                    result = qa({"query": user_q})
                    st.subheader('Response: ')
                    st.write(result['result'])
                    st.subheader('Source pages: ')
                    st.write(result['source_documents']['metadata'])
            except Exception as e:
                st.error(f" An error: {e}")

Expected behavior:

I would like it to use the ‘qa’ but do not refresh the creation of embeddings

Actual behavior:

The code snippet I put creates an error: An error: local variable ‘qa’ referenced before the assignment
So I guess the qa object is not passed. How to solve that?

Goyo · July 27, 2023, 10:47am

You can add qa to session_state to preserve its value across reruns.

czyzewm · July 28, 2023, 10:19am

Thank you, that works

system · July 30, 2023, 10:19am

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
App Reruns upon User Input Custom Components session-state , streamlit-cloud	11	2600	January 12, 2023
Question about RAG Langchain Tutorial: isn’t embedding repeated? LLMs and AI	1	921	January 22, 2024
File_uploader triggering second time when we click on other buttons in the same page Using Streamlit file-upload	2	2722	September 27, 2022
How to make the page not rerun Using Streamlit file-upload	2	1587	March 29, 2023
Advice needed: Converting Jupyter Notebook to Streamlit web app for LLM chatbot LLMs and AI discussion	2	352	July 30, 2024

Streamlit + LLMs + files = how to avoid re-running the file?

Summary

Steps to reproduce

Related topics

Hello there 👋🏻

Cookie settings

Strictly necessary cookies

Performance cookies

Functional cookies

Targeting cookies