Hello!
I deployed my Streamlit app and it didn’t show any error in the “manage app” terminal, but when I try to use my app it shows an error saying that Tesseract is not installed, or it’s not in my way. Could you help me solve this?
app: https://share.streamlit.io/guilhermedonizetti/ocr_python/main.py
git repo: GitHub - guilhermedonizetti/OCR_Python: Aplicação em Python para Optical Character Recognition (OCR), uma técnica para extrair textos em imagens. Adicionalmente, o programa tenta encontrar padrões (CPFs e datas) e analisar a quantidade de palavras boas e más.
Hi @guilhermedonizetti, welcome to the Streamlit community!!
The installation instructions for pytesseract
say that Google’s tesseract-ocr is a dependency. Meaning, you need to install tesseract-ocr
as an apt-get dependency (for Linux applications outside the Python environment).
The way to do that is to include the necessary dependencies in a packages.txt
file in your repo:
-
Create a new
packages.txt
file with the following lines:tesseract-ocr tesseract-ocr-por
-
The first line installs the base
tesseract-ocr
application and supports only English by default. To support Portuguese, you need to install a language specific tesseract data file, which we do in the second line. -
Reboot your app
Once you make the above changes, your app should successfully deploy!
Happy Streamlit’ing!
Snehan
@snehankekre it worked correctly, perfect.
Thank you very much!!
Hi there!
I’ve run into the same issue and the solution above doesn’t work for me. All the code worked before I deployed the app (I could extract text from images). However, once I deployed the app, the part with tesseract stopped working (it displays the same issue as the person above had).
git repo: GitHub - AkmOleksandr/App
app: https://akmoleksandr-app-summarizerai-bge19e.streamlit.app/
I would really appreciate help,
Thank you in advance.