Any LLM developers with recommendations for testing libraries?

shawngiese · April 16, 2024, 2:40pm

I am looking to put together some automated tests for my LLM RAG chatbot. I would be interested to hear from anyone already doing this.

tonykip · April 16, 2024, 7:45pm

Hi @shawngiese,

Thanks for sharing this question!

Are you looking for tests for the app side or the RAG logic?

shawngiese · April 16, 2024, 9:09pm

I eventually will want to look for tests on the app side but for now I am mainly looking for ways to test RAG logic so I can combine that with the human tests : )

tonykip · April 19, 2024, 2:03pm

That’s an interesting idea. I may suggest you look into some of the observability platforms like LangSmith or Nomic that help you evaluate your RAG pipelines.

shawngiese · April 19, 2024, 9:39pm

Thanks tonykip. I liked the looks of giskard RAG Evaluation Toolkit where I can set my own ground truths to match either the role of the LLM or the embeddings. Have not tried it yet but it’s docs were speaking along the lines of what I was thinking.

tonykip · April 23, 2024, 1:43pm

Cool! I haven’t tried them before, so thanks for sharing!

system · October 20, 2024, 1:43pm

This topic was automatically closed 180 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
🤔 What’s your biggest concern when building LLM-powered apps? Show the Community!	1	426	January 29, 2024
New esravolkan/chatbot-with-llm Streamlit App Show the Community!	0	24	January 3, 2025
Excel Chatbot Built with Streamlit: Better Than a Vanilla LLM? Show the Community! llms , chatbot	3	359	April 29, 2025
Ollama RAG & Deep Research App Show the Community! llms , research	0	351	March 25, 2025
Testing MANUS AI for Streamlit Show the Community!	1	163	April 4, 2025

Any LLM developers with recommendations for testing libraries?

Related topics

Hello there 👋🏻

Cookie settings

Strictly necessary cookies

Performance cookies

Functional cookies

Targeting cookies