Hi @Charly_Wargnier! I think your app was affected by a known bug on our side, which sometimes causes the app screen to be stuck in the “oven” interstitial. Can you try accessing the app now?
Do let me know if you are still running into issues, thanks!
BTW from what I can see, the app is reporting a ModuleNotFoundError: No module named 'requests_html'. The logs in the terminal might provide a hint why this is happening.
ERROR: Could not find a version that satisfies the requirement pywin32==228 (from -r requirements.txt (line 77)) (from versions: none)
ERROR: No matching distribution found for pywin32==228 (from -r requirements.txt (line 77))
On my Mac:
$ pip install pywin32
ERROR: Could not find a version that satisfies the requirement pywin32 (from versions: none)
ERROR: No matching distribution found for pywin32
I managed to make it work, not via pip yet by adding the following lines in app.py:
import nltk
nltk.download('punkt')
The app seems to be running smoothly with various URLs as long as the scraped content is not too large. If too much content to analyze, the app is crashing and restarting.
It looks like the app is running out of memory on the platform. Do you have a heuristic on how frequently this would happen for the typical use case of the app?
Hi Charly! I increased the memory limits on your app to a higher value and was able to see the example you gave run successfully. LMK if you run into any other issues with this or any other app.
I’ve tried with the URL that failed before and I’ve still got a crash. Here’s the URL:
I’ve also pasted the latest log FYI.
I’ll try with smaller pieces of content later on tonight. I guess we could always add a caveat to the app - stating that it can analyze up to N characters.
Hi Charly, nice work on the app.
Your hack to getting the nltk work by downloading the dependencies in the main file might increase the latency.
The app will attempt to download this file for every instance, which is not effective.
I also have this issue in my app with 4 nltk dependencies. including this in my main file will definitely increase latency.
It will be great if streamlit team can provision for a special requirement file for nltk dependencies that cannot be downloaded using PIP
PIP use requirements.txt for dependencies installations
a special dependency file e.g nltk.txt if provided by streamlit team can used by nltk.download during deployment so that these files don’t get downloaded for every instance of the app