How to upload a pdf file in streamlit

andfanilo · April 7, 2020, 11:28am

Hello @Gyanaranjan_pathi, welcome to the Streamlit forums

On the uploading part, you can use Streamlit’s file_uploader to display a file uploader on your app, as such :

import streamlit as st

uploaded_file = st.file_uploader('Choose your .pdf file', type="pdf")
if uploaded_file is not None:
    df = extract_data(uploaded_file)

Then your PDF upload will be available as a StringIO object in the uploaded_file variable, so now to extract data from the PDF, you will need a Python library that can read your pdf as StringIO or a filelike object.

I used pdfplumber to extract tables from PDFs in one of my Streamlit apps, pdfplumber.load accepts StringIO so you can do :

def extract_data(feed):
    data = []
    with pdfplumber.load(feed) as pdf:
        pages = pdf.pages
        for p in pages:
            data.append(p.extract_tables())
    return None # build more code to return a dataframe

but there are multiple other librairies like camelot, tabula-py or pdfminersix and I had to test multiple ones for my use case before going with pdfplumber so you may need to test multiple ones too depending on the info you need to extract !

Hope this helps

Topic		Replies	Views
Streamlit-Azure Integration: Uploading, Processing, and Displaying PDF Files in Web Applications Show the Community! cache , file-upload , streamlit-cloud	1	2758	April 18, 2024
How to enable raw string literal 'r' and binary format 'rb' during pdf upload/read? Using Streamlit	4	1896	April 1, 2023
How to run streamlit on google colab? Using Streamlit streamlit-cloud , debugging	2	911	October 13, 2024
Streamlit App - Converting an Uploaded PDF to Seperate Images for Downloading Using Streamlit file-download	3	2744	February 1, 2024
Unable to use uploaded pdf file for pdftotext parsing on streamlit Using Streamlit debugging	4	40	October 12, 2024

How to upload a pdf file in streamlit

Related topics

Hello there 👋🏻

Cookie settings

Strictly necessary cookies

Performance cookies

Functional cookies

Targeting cookies