I googled around but could not find anything to fix my problem. They talk of internal canonical pages (many pages with same content in a given website).
The issue is that google thinks that my site is a copy of the awesome streamlit site.
It is a google algorithm that guards against copycat sites. Since google thinks my site is a copy, google does not add my ‘copycat’ site to the index. I do not think I can simply add a parameter somewhere saying “I am not a copycat“ because in that case all the copycat sites would do it and that would defy the aim of google.
The other website that had the same problem (link above) did not solve it. What they did was make a separate standard site that google crawls and put a link on that site to the streamlit app. Not nice.
I wonder if this is a common issue or it happens only if the streamlit app has some particular characteristics that make google think that it is equivalent to the awesome streamlit site.
My markdown content has nothing in common and the code has nothing in common.
So, by exclusion, is it that my script is relatively long (6000 lines)? So if you want to be indexed keep you code under lines?