Hello!
I deployed my Streamlit app and it didn’t show any error in the “manage app” terminal, but when I try to use my app it shows an error saying that Tesseract is not installed, or it’s not in my way. Could you help me solve this?
app: https://share.streamlit.io/guilhermedonizetti/ocr_python/main.py
git repo: GitHub - guilhermedonizetti/OCR_Python: Aplicação em Python para Optical Character Recognition (OCR), uma técnica para extrair textos em imagens. Adicionalmente, o programa tenta encontrar padrões (CPFs e datas) e analisar a quantidade de palavras boas e más.
Hi @guilhermedonizetti, welcome to the Streamlit community!!
The installation instructions for pytesseract
say that Google’s tesseract-ocr is a dependency. Meaning, you need to install tesseract-ocr
as an apt-get dependency (for Linux applications outside the Python environment).
The way to do that is to include the necessary dependencies in a packages.txt
file in your repo:
-
Create a new
packages.txt
file with the following lines:tesseract-ocr tesseract-ocr-por
-
The first line installs the base
tesseract-ocr
application and supports only English by default. To support Portuguese, you need to install a language specific tesseract data file, which we do in the second line. -
Reboot your app
Once you make the above changes, your app should successfully deploy!
Happy Streamlit’ing!
Snehan
Hi there!
I’ve run into the same issue and the solution above doesn’t work for me. All the code worked before I deployed the app (I could extract text from images). However, once I deployed the app, the part with tesseract stopped working (it displays the same issue as the person above had).
git repo: GitHub - AkmOleksandr/App
app: https://akmoleksandr-app-summarizerai-bge19e.streamlit.app/
I would really appreciate help,
Thank you in advance.