Restart streamlit server after vaex file update

Rafael_Del_Rey · June 4, 2022, 9:44pm

I m using streamlit to create a exploratory data tool. The application is running in a container in ECS Fargate, and this container runs two processes:

python update_service.py, that monitors an S3 bucket for updated vaex hdf5 file. Whenever a new file is found, it is downloaded into the container where it is used by an streamlit app;
steamlit run app.py, which starts streamlit server;

Due to the memory mapped nature of vaex hdf5 files, whenever the file is updated on the file system by update_service.py, Linux delete the original file in use, but keeps it’s content in an orphaned (deleted) state, spending storage. Each update, makes the free disk space smaller. It seems that depending on what you do with the data, a handle is not released, even if you close the data, delete the variable and call gc.collect(). Lsof command shows the opened hdf5 file in a “deleted” state.

I tracked down the issue to the function to_pandas_df(). I call it to transform a
vaex dataframe into a pandas one, before showing it in a graph or aggrid. It looks like when I do this, vaex lose track of the memory mapped file handle. Fixing this permanently might requires some change on vaex core, but in the meantime, I was thinking a temporary workaround would be restarting the streamlit server altogether, whenever the hdf5 is modified. A “Rerun” or a browser Refresh dont do the trick. It has to be a total restart of the streamlit server (like stopping and command “streamlit run app.py” again).

How could I do such server restart properly?

system · June 4, 2023, 9:44pm

This topic was automatically closed 365 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Hot-Reloading Issue Using Streamlit debugging	3	2862	April 17, 2024
Invalidating and rebuilding caches during the night without user interaction Using Streamlit cache	4	581	May 27, 2024
Rerun function not working after deployment Using Streamlit discussion	13	1026	December 16, 2024
Streamlit app consistently restarting on Streamlit Sharing even though it runs fine locally Using Streamlit	9	815	June 18, 2023
Streamlit Cloud : Deployed App crashes without any error after 30-20 minutes Community Cloud cache , session-state , pandas	3	1147	November 30, 2022

Restart streamlit server after vaex file update

Related topics

Hello there 👋🏻

Cookie settings

Strictly necessary cookies

Performance cookies

Functional cookies

Targeting cookies