I have uploaded a PDF file using st.file_uploader() from streamlit and then trying to parse it using LlamaParse() currently running on localhost. Issue is I am getting blank list as output[] when I am using this uploaded_file object from the below code. I am not sure if a returned uploaded object …

Uploaded pdf not parsing (Returns blank output) through llama in streamlit

Goyo September 20, 2024, 6:38pm 14

I was unable to test this until right now.

When I pass a file-like object (the uploaded file)

st.session_state.doc_parsed = LlamaParse(
    result_type="markdown", api_key=key_input
).load_data(st.session_state.uploaded_file)

I get an empty list as well, along with a message in the terminal:

Error while parsing the file '<bytes/buffer>': file_name must be provided in extra_info when passing bytes

Passing extra_info={"file_name": "_"} to load_data as sugested by the message fixed it for me. As far as I can tell any non-empty string works. This is absent from the examples and I have no idea why it is needed.

2 Likes

Topic		Replies	Views
Unable to use uploaded pdf file for pdftotext parsing on streamlit Using Streamlit debugging	4	42	October 12, 2024
How to upload a pdf file in streamlit Using Streamlit file-upload	14	27101	May 28, 2024
httpx.ReadTimeout: timed out when running Llama3 model that queries PDF files on Streamlit Using Streamlit llms , debugging	1	720	November 19, 2024
Make chatbot to read and answer from pdf files Community Cloud	23	7478	June 12, 2024
Expected str, bytes or os.PathLike object, not UploadedFile for PDF file Community Cloud	3	2972	May 13, 2022

Uploaded pdf not parsing (Returns blank output) through llama in streamlit

Related topics

Hello there 👋🏻

Cookie settings

Strictly necessary cookies

Performance cookies

Functional cookies

Targeting cookies