Word2vec-google-news-300

hschreib · January 20, 2021, 6:00am

Hi! Brand new to Streamlit and trying to figure out how to convert something I built in Google Colab to Streamlit.

I am trying to allow a user to input a set of words, and receive as output a word most similar to the set. To do this, I’m using gensim.downloader to pull in word2vec-google-news-300 and using the “most_similar” function. I am trying to cache this process in a function “load_model()” function so that the user does not need to download the gensim model every time they want to input a new word to try.

However, it is not working (it seems to stall out every time and never reach success). Any suggestions?

@st.cache()
def load_model():
    with st.spinner('Downloading word2vec model... please hold...'):
        model = api.load('word2vec-google-news-300')
    return model


def main():
    model = load_model()
    st.success('Done!')

randyzwitch · January 20, 2021, 1:49pm

Hi @hschreib, welcome to the Streamlit community!

Are you developing this locally on your machine or is this app deployed somewhere? Is there a GitHub repo we can look at to see the setup?

Best,
Randy

hschreib · January 20, 2021, 3:52pm

Hi @randyzwitch,

Thanks for the response! I am developing this locally on my machine for now but hoping to make it easy to deploy and have others use (hence my interest in Streamlit!).

I don’t have a Github repo yet with the local code, but you can see what I originally built for Google Colab here: mcit-hackathon/mcit_hackathon_codenames_word2vec.ipynb at main · hschrei7/mcit-hackathon · GitHub

(it is essentially the same, but the Google Colab uses some other, worse widgets/UI that I’m trying to improve via Streamlit!)

Henry

hschreib · January 21, 2021, 12:38am

I believe I fixed this issue by putting

@st.cache(allow_output_mutation=True)

above the load_model function

Topic		Replies	Views
Optimization of language model app(GPT3) Using Streamlit	3	483	August 15, 2022
Streamlit over Resources after a couple of hours Community Cloud cache	4	683	May 12, 2021
Loading and caching of models and mutable objects Using Streamlit cache , spacy	7	9607	November 19, 2021
Embedding models and LlamaCPP models getting duplicated on load Using Streamlit cache , session-state	5	1747	May 5, 2024
Deploy a deep learning model as a web app Using Streamlit	3	2578	June 8, 2023

Word2vec-google-news-300

Related topics