I built a Streamlit Chat-with-your-PDF app and now want to deploy the app so that team members of mine can try it out.
Now I was wondering of what would be good (cheap) options to host such an application? Most tutorials are using an OpenAI model but I am using open-source models, so I need something where I can access a GPU in the cloud.
At the moment I am running the App locally on my Mac M2 without any problems, even with using the quantized version of Mixtral (33 GB RAM required). But also smaller models would be an option, which require RAM of 7GB or less.
There’s a tutorial blog to get you started that shows the use of an LLM model (Llama2) hosting platform (Replicate) that can be used by the Streamlit app for response generation:
Thanks! I also thought about putting the streamlit app into a container and share it with colleagues by sending them the files, so that no Cloud service is needed. Do you have any experience with this?
Thanks for stopping by! We use cookies to help us understand how you interact with our website.
By clicking “Accept all”, you consent to our use of cookies. For more information, please see our privacy policy.
Cookie settings
Strictly necessary cookies
These cookies are necessary for the website to function and cannot be switched off. They are usually only set in response to actions made by you which amount to a request for services, such as setting your privacy preferences, logging in or filling in forms.
Performance cookies
These cookies allow us to count visits and traffic sources so we can measure and improve the performance of our site. They help us understand how visitors move around the site and which pages are most frequently visited.
Functional cookies
These cookies are used to record your choices and settings, maintain your preferences over time and recognize you when you return to our website. These cookies help us to personalize our content for you and remember your preferences.
Targeting cookies
These cookies may be deployed to our site by our advertising partners to build a profile of your interest and provide you with content that is relevant to you, including showing you relevant ads on other websites.