Caching pandas dataframe

bjornvandijkman · March 11, 2020, 12:41pm

Hello everyone,

I am trying to read in a pandas dataframe using the code down below. However, it is giving me the following error: **UnhashableType** : Cannot hash object of type _io.StringIO

I looked at the documentation but I still cannot figure out why it does not work for me. Any suggestions?

import streamlit as st
import pandas as pd

# Uploader widget
st.sidebar.title("Upload Your File")
filename = st.sidebar.file_uploader("Choose a file", type=['xlsx', 'csv'])
delimiter_choice = st.sidebar.selectbox("In case you uploaded a CSV file, "
                                        "how is your data delimited?", [';', ','])
st.sidebar.markdown("---")


# Function that tries to read file as a csv
# if selected file is not a csv file then it will load as an excel file
@st.cache
def try_read_df(f):
    try:
        return pd.read_csv(f, sep=delimiter_choice)
    except:
        return pd.read_excel(f)


if filename:
    df = try_read_df(filename)

st.write(df)

andfanilo · March 11, 2020, 1:01pm

Hi @bjornvandijkman,

You are probably hitting this issue which comes from this original discussion where you want to cache the results of a Dataframe that is being created from an uploaded file. Streamlit doesn’t know yet how to handle a file stream from its file uploader widget.

Until the issue is being solved natively by Streamlit, you can try to hash part of the uploaded file or use the more consuming solution of reading and hashing the entire file

bjornvandijkman · March 11, 2020, 1:26pm

Thanks! The last solution in the linked issue works for me.

tc1 · March 11, 2020, 3:40pm

Hey @bjornvandijkman & @andfanilo,

Here is the pull request that will natively solve this

Topic		Replies	Views
Hash function error for uploaded text file Using Streamlit cache	10	6928	November 19, 2021
InternalHashError: 1 - DF readin @st.cache bug Using Streamlit	5	914	January 12, 2022
UnhashableType: Cannot hash object of type _thread._local Using Streamlit cache	16	11433	January 4, 2023
Using caching with API calls and messy DataFrames Using Streamlit cache , pandas	5	1260	November 19, 2021
When loading a function using @st.cache i get errors Using Streamlit	7	1131	December 24, 2023

Caching pandas dataframe

Related topics

Hello there 👋🏻

Cookie settings

Strictly necessary cookies

Performance cookies

Functional cookies

Targeting cookies