Meet RAGxplorer - an interactive tool to visualize the retrieval techniques and diagnose which chunks are being retrieved. I hope this application is useful especially for those who are learning RAGs and exploring the embedding space of your documents.
Application
With RAGxplorer, you can:
Upload your own own PDF
Configure the chunk size and chunk overlap
Visualise where their query is in the embedding space, and the chunks that are a top-k match
[NEW!] Experiment with Query Expansion techniques (e.g. multi-questions, hypothetical answer/HyDE)
Once, again thank you to the streamlit team for hosting this competition.
Here’s the repo I hope this helps folks building RAG applications!
For now, the best experience is to clone the repo and run this locally!
The vector database is an in-memory one (Chroma), and I suspect many people using it, or quitting the app while the vector database is being built, causes some issues. Some tweaks to the control flow and deployment may resolve that.
Any feedback for improvement would be most appreciated. I will still continue to work on this.