I am a newbie on Streamlit. Hello to all. I am trying to use cloud by connecting via github. I created this app by taking a short Coursera course. But when i try to figure it out go live. It goes some of errors which I couldn’t handle. I think I figured out requirements and DATA_URL part but maybe I am missing something, I don’t know.
Check your csv file in your GitHub repo, it contains garbage, it is not a valid csv file.
Edit: And now we have also a FileNotFoundError - the filename does not match.
Thank you for your super fast reply. Actually I tried to upload it again but it is bigger than 25mb. I don’t know how could I solve this. Maybe I can directly go to the link of csv file and then get it from there for my app
The download from NYC website is way too slow.
Also, the CSV format is large and slow for big datasets…
I would preprocess the data offline, remove unnecessary columns from the dataframe and save the pandas dataframe in a compact and fast file format in github (e.g. feather).
Another, but much more complex option would be to use the API from this dataset.
However, this only makes sense if you want to make specific queries of subsets of the data.
NYC Open Data provides its own API for this dataset:
I forked your github project and added a preprocessing script that produces a parquet file, which is much smaller than csv and even feather file.
See my idea here:
Your preprocess looks cool. Thank you very much really. I didnt know about parquet file which is also useful.
But one little problem, I want data to be stay online as they updating the data every month or so. I am now searching for how to make that preprocess code repeatedly on a certain time.
It has nothing to do with the app itself, it is a dependency issue of streamlit that popped up yesterday, due to a change in the library click that is used by streamlit.
Thanks for stopping by! We use cookies to help us understand how you interact with our website.
By clicking “Accept all”, you consent to our use of cookies. For more information, please see our privacy policy.
Cookie settings
Strictly necessary cookies
These cookies are necessary for the website to function and cannot be switched off. They are usually only set in response to actions made by you which amount to a request for services, such as setting your privacy preferences, logging in or filling in forms.
Performance cookies
These cookies allow us to count visits and traffic sources so we can measure and improve the performance of our site. They help us understand how visitors move around the site and which pages are most frequently visited.
Functional cookies
These cookies are used to record your choices and settings, maintain your preferences over time and recognize you when you return to our website. These cookies help us to personalize our content for you and remember your preferences.
Targeting cookies
These cookies may be deployed to our site by our advertising partners to build a profile of your interest and provide you with content that is relevant to you, including showing you relevant ads on other websites.