Optimization of language model app(GPT3)

seyirex · June 9, 2021, 11:04pm

Hello community, I have been working on a text generation app using GPT3 Neo, but I observed that the app takes longer time to load on and generate text, please who as a way to optimise this app by using cach to stop the app everytime from loading the model everytime it runs here is the link to my code. Click here

snehankekre · June 10, 2021, 5:10am

Hi @seyirex,

That’s a really good idea! To prevent re-loading your entire model into memory with every widget interaction with the app, you can wrap line 20 in st.cache. Doing so will reuse the same cached model across multiple simultaneous users, rather than loading a separate 500 MB model into RAM for each user.

Here’s an example – replace line 20 with the following:

@st.cache(hash_funcs={aitextgen: id})
def load_model():
  model = aitextgen(model="EleutherAI/gpt-neo-125M")
  return model

ai = load_model()

Happy Streamlit-ing!
Snehan

seyirex · June 10, 2021, 7:08am

Thanks alot @snehankekre this is really helpful.

Topic		Replies	Views
Streamlit over Resources after a couple of hours ☁️ Community Cloud cache	4	537	May 12, 2021
Deploy a deep learning model as a web app 🎈 Using Streamlit	3	2041	June 8, 2023
Deep Learning model in Cache explodes 🎈 Using Streamlit cache , windows , file-upload	4	542	May 13, 2022
Seeking Advice on Managing Memory Usage in Streamlit App 🎈 Using Streamlit	7	682	December 11, 2023
Running time in streamlit 🎈 Using Streamlit cache	6	3882	August 15, 2022

Optimization of language model app(GPT3)

Related Topics

Hello there 👋🏻

Cookie settings

Strictly necessary cookies

Performance cookies

Functional cookies

Targeting cookies