Hey guys, I am working on a project that has a dataset with more than 200 million rows, and I would like to use it efficiently. My first question is, since I will be using a S3 store from AWS, which file is the best to handle this amount of data? My second question is, since st.connection does not support “feather” files, this means that I should use parquet instead or it doesnt matter?
Thanks in advance.