File uploading and reading using st.file_uploader

blackary · October 13, 2022, 2:13pm

The issue is that model.transcribe is expecting either a file name string, or a numpy array, or a Tensor, and the UploadedFile is none of these. (see whisper/transcribe.py at main · openai/whisper · GitHub for more details)

The easiest way to solve this is to save the uploaded file to a temporary file with a known path.

from tempfile import NamedTemporaryFile

import streamlit as st
import whisper

audio = st.file_uploader("Upload an audio file", type=["mp3"])

if audio is not None:
    with NamedTemporaryFile(suffix="mp3") as temp:
        temp.write(audio.getvalue())
        temp.seek(0)
        model = whisper.load_model("base")
        result = model.transcribe(temp.name)
        st.write(result["text"])

This seems to work well. I can’t speak to the legal constraints, but if you are running this app on your local machine, then it won’t go anywhere else when you upload it. If you are running your app on a remote server, this method will certainly (at least temporarily) put the file on the server’s disk.

Topic		Replies	Views
Error in audio transcription app using whisper hosted on streamlit cloud Deployment streamlit-cloud	4	1076	August 7, 2023
Upload and write any file type (File_Uploader&Textract) Using Streamlit file-upload	3	2028	November 19, 2021
Uploading file from vm Using Streamlit	3	523	September 15, 2023
Uploading wave file and play back Using Streamlit	2	2239	August 15, 2022
Displaying audio file fails with "TypeError: 'bytes' object is not callable" Using Streamlit	3	989	August 15, 2022

File uploading and reading using st.file_uploader

Related topics

Hello there 👋🏻

Cookie settings

Strictly necessary cookies

Performance cookies

Functional cookies

Targeting cookies