About 2,830 results
Open links in new tab
  1. Ragas

    Ragas is a library that helps you move from "vibe checks" to systematic evaluation loops for your AI applications. It provides tools to supercharge the evaluation of Large Language Model …

  2. Introduction | Ragas

    Ragas is a framework that helps you evaluate your Retrieval Augmented Generation (RAG) pipelines. RAG denotes a class of LLM applications that use external data to augment the …

  3. Metrics - Ragas

    Dec 4, 2025 · Ragas Office Hours - If you need help setting up Evals for your AI application, sign up for our Office Hours here.

  4. Get Started - Ragas

    For an in-depth explanation of the core concepts behind Ragas, check out the Core Concepts page. You can also explore the How-to Guides for specific applications of Ragas.

  5. Core Concepts - Ragas

    : Ragas Metrics Use our library of available metrics or create custom metrics tailored to your use case. Metrics for evaluating RAG, Agentic workflows and more... Test Data Generation …

  6. ️ How-to Guides - Ragas

    The how-to guides offer a more comprehensive overview of all the tools Ragas offers and how to use them. This will help you tackle messier real-world usecases when you’re using the …

  7. Evaluate a simple LLM application - Ragas

    The purpose of this guide is to illustrate a simple workflow for testing and evaluating an LLM application with ragas. It assumes minimum knowledge in AI application building and evaluation.

  8. Overview - Ragas

    In Ragas, we categorize metrics based on the type of output they produce. This classification helps clarify how each metric behaves and how its results can be interpreted or aggregated.

  9. ️ How-to Guides - Ragas

    Each guide in this section provides a focused solution to real-world problems that you, as an experienced user, may encounter while using Ragas. These guides are designed to be …

  10. Faithfulness - Ragas

    from ragas.dataset_schema import SingleTurnSample from ragas.metrics import Faithfulness sample = SingleTurnSample( user_input="When was the first super bowl?", response="The …