Show ragas Evaluating progress on streamlit ui

Bhaiya · January 24, 2025, 12:27pm

from ragas import evaluate

result = evaluate(eval_dataset, metrics=filtered_metrics, llm=evaluator_llm, embeddings=evaluator_embeddings)

inside ragas evaluate function
@track_was_completed
def evaluate(
dataset: t.Union[Dataset, EvaluationDataset],
metrics: t.Optional[t.Sequence[Metric]] = None,
llm: t.Optional[BaseRagasLLM | LangchainLLM] = None,
embeddings: t.Optional[BaseRagasEmbeddings | LangchainEmbeddings] = None,
callbacks: Callbacks = None,
in_ci: bool = False,
run_config: RunConfig = RunConfig(),
token_usage_parser: t.Optional[TokenUsageParser] = None,
raise_exceptions: bool = False,
column_map: t.Optional[t.Dict[str, str]] = None,
show_progress: bool = True,
batch_size: t.Optional[int] = None,
) → EvaluationResult:

how i can show this progress on streamlit as well, so that user will be aware abot the progress. any idea? can someone help

i am seeing this progress on my terminal which i want to show in ui as well

Evaluating: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 50/50 [00:15<00:00, 3.30it/s]

blackary · January 24, 2025, 2:22pm

Hi @Bhaiya, welcome to the forum!

I discovered that you can actually pass a custom tqdm progress bar, and we can customize that tqdm bar so that it also updates a streamlit st.progress_bar.

Here’s a complete example that worked for me:

import streamlit as st
from datasets import load_dataset
from langchain_openai import ChatOpenAI, OpenAIEmbeddings
from ragas import EvaluationDataset, SingleTurnSample, evaluate
from ragas.embeddings import LangchainEmbeddingsWrapper
from ragas.llms import LangchainLLMWrapper
from ragas.metrics import AspectCritic
from tqdm.auto import tqdm as std_tqdm

evaluator_llm = LangchainLLMWrapper(ChatOpenAI(model="gpt-4o"))
evaluator_embeddings = LangchainEmbeddingsWrapper(OpenAIEmbeddings())


test_data = {
    "user_input": "summarize given text\nThe company reported an 8% rise in Q3 2024, driven by strong performance in the Asian market. Sales in this region have significantly contributed to the overall growth. Analysts attribute this success to strategic marketing and product localization. The positive trend in the Asian market is expected to continue into the next quarter.",
    "response": "The company experienced an 8% increase in Q3 2024, largely due to effective marketing strategies and product adaptation, with expectations of continued growth in the coming quarter.",
}

metric = AspectCritic(
    name="summary_accuracy",
    llm=evaluator_llm,
    definition="Verify if the summary is accurate.",
)
test_data = SingleTurnSample(**test_data)
metric.single_turn_score(test_data)

eval_dataset_raw = load_dataset(
    "explodinggradients/earning_report_summary", split="train"
)
eval_dataset = EvaluationDataset.from_hf_dataset(eval_dataset_raw)

st.write("Features in dataset:", eval_dataset.features())
n_samples = len(eval_dataset)
st.write("Total samples in dataset:", n_samples)

progress_bar = st.progress(0, text="Evaluation progress")


class TqdmExt(std_tqdm):
    def update(self, n=1):
        displayed = super().update(n)
        if displayed:
            progress_bar.progress(
                self.n / n_samples, text=f"Evaluating sample {self.n} of {n_samples}"
            )
        return displayed


custom_tqdm = TqdmExt()

results = evaluate(eval_dataset, metrics=[metric], _pbar=custom_tqdm)
st.write(results.to_pandas())

system · January 29, 2025, 7:41pm

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.

blackary · February 21, 2025, 2:25pm

I see @Bhaiya is running into an issue:

Hi, Thanks for the reply, but the code is not working for me , its giving below error.

result = func(*args, **kwargs)
TypeError: evaluate() got an unexpected keyword argument ‘_pbar’

Sounds like you need to update to a newer version of ragas – you can see that argument here in the current version of the code ragas/src/ragas/evaluation.py at main · explodinggradients/ragas · GitHub

Topic		Replies	Views
STqdm : a tqdm-like progress bar for streamlit Custom Components	47	27206	November 27, 2024
Displaying a tqdm bar with multiprocessing Using Streamlit	11	6142	November 10, 2023
Progress when downloading with huggingface_hub.snapshot_download Using Streamlit discussion	2	260	March 9, 2025
Progress bar Using Streamlit	4	3704	March 11, 2023
How to display the "Training Progress" on the app? Using Streamlit	7	7489	January 12, 2022

Show ragas Evaluating progress on streamlit ui

Related topics

Hello there 👋🏻

Cookie settings

Strictly necessary cookies

Performance cookies

Functional cookies

Targeting cookies