Concurrency Performance Stress Test

Hi, here are the details of my application:

  • Python 3.11 and Streamlit 1.28.1
  • Locally Deployed

I’ve been trying to stress test the application and simulate users hitting the end point I am using within the application. From my understanding, a copy of the application is created and launched for each user that connects to the website. Does anyone have experience with a load of around 100 users or ~150 requests per minute? How does this look on Streamlit, especially if we have to send out API requests to OpenAI or something similar for chatbotting.