Streamlit & processing in a production environment

Hello Streamlit-team,
my team and I are looking in the possibility to use streamlit in a production environment and we want to create our own BI tool. The dataset is currently roughly 12GBs and will grow in the future. It consist of roughly 2 million rows. The challenge is: the user will have a lot of filter possibilities. In total, we will have 22 filters.

I know that the “bottleneck” is not streamlit, I would be more interested if someone is using streamlit with AWS services already in a production environment. Where is the data stored? How do you run queries and what are you using for caching?

It would be super nice if someone could give me a rundown on their architecture - from the data storing to the querying, possible caching and finally passing to streamlit.

Thanks in advance and happy new year