Parquet vs Feather

caiosoter · December 20, 2023, 11:15pm

Hey guys, I am working on a project that has a dataset with more than 200 million rows, and I would like to use it efficiently. My first question is, since I will be using a S3 store from AWS, which file is the best to handle this amount of data? My second question is, since st.connection does not support “feather” files, this means that I should use parquet instead or it doesnt matter?

Thanks in advance.

dataprofessor · December 23, 2023, 12:48pm

Hi @caiosoter

Perhaps these 2 related blogs may provide you some insights on the use of data in the building of a performant app:

Hope this helps!

system · June 20, 2024, 12:48pm

This topic was automatically closed 180 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Read large files from s3 using st.connection Using Streamlit discussion	3	216	June 16, 2024
Slow website architecture Using Streamlit	2	282	July 25, 2024
Streamlit & processing in a production environment Using Streamlit discussion	0	141	January 3, 2025
Uploading large files via Streamlit to S3 Using Streamlit discussion	1	192	March 1, 2025
Reading data quickly Using Streamlit	7	3246	January 12, 2022

Parquet vs Feather

Related topics

Hello there 👋🏻

Cookie settings

Strictly necessary cookies

Performance cookies

Functional cookies

Targeting cookies