Modifying LLM response

Areo12 · October 25, 2024, 5:04pm

I’m working on a chatbot with Llamaindex and am using streamlit to build the app. I’m using a basic template found in the streamlit repo (llamaindex-chat-with-streamlit-docs/streamlit_app.py at main · streamlit/llamaindex-chat-with-streamlit-docs · GitHub) and the code displaying the LLM response is shown below:

# If last message is not from assistant, generate a new response
if st.session_state.messages[-1]["role"] != "assistant":
    with st.chat_message("assistant"):
        response_stream = st.session_state.chat_engine.stream_chat(prompt)
        st.write_stream(response_stream.response_gen)
        message = {"role": "assistant", "content": response_stream.response}
        # Add response to message history
        st.session_state.messages.append(message)

This might not be the most efficient method, but I am trying to verify some of the text in the llm response before displaying it to the user. With this method I assume that I can not use the “st.write_stream(response_stream.response_gen)” line since the llm has to finish its entire response before I can verify the text. So, what I want to know is:

I can access the response string using response_stream.response but it returns as empty maybe because it is still generating the answer. How do I obtain the response when it is completed writing the response?
Is there a way to display a loading circle while it generates this response?
Thanks

ferdy · October 26, 2024, 5:11am

The complete response could be in:

response_stream.response

You can inspect it before streaming. Try this idea.

if st.session_state.messages[-1]["role"] != "assistant":
    with st.chat_message("assistant"):
        response_stream = st.session_state.chat_engine.stream_chat(prompt)

        # Inspect the response
        complete_response = response_stream.response
        response_ok = verify_response(complete_response)

        # Stream response if ok.
        if response_ok:

            # Show the response in streams.
            st.write_stream(response_stream.response_gen)

            # Format the response 
            message = {"role": "assistant", "content": response_stream.response}

            # Add response to message history
            st.session_state.messages.append(message)

Areo12 · October 27, 2024, 2:31pm

Unfortunetly, response_stream.response is still empty with this method and no response is shown when I try out the app. From my tests it seems that you have to run the st.write_stream(response_stream.response_gen) line in order to fill response_stream.response.

So if I do something like:

st.write_stream(response_stream.response_gen)
response_ok = verify_response(complete_response)
if response_ok:
            # Show the response in streams.
            st.write_stream(response_stream.response_gen)
            ...

The app generates the unverified response and then shows the verified response after. I’m not sure why the line response_stream.response_gen is necessary.

Topic		Replies	Views
Wanted to Stream LLM response as it arrives to the streamlit application Using Streamlit discussion	4	1146	April 21, 2025
Streaming response line chatgpt LLMs and AI real-time	1	1577	June 12, 2023
Streaming chat response -> chat history Using Streamlit discussion	2	1512	September 27, 2024
Display LLM response stream from OpenAI Assistant API Using Streamlit llms , discussion	4	2167	October 13, 2024
St.write_stream, writing the answers in logs Using Streamlit debugging	2	1906	August 14, 2024

Modifying LLM response

Related topics