Other component-wise evaluation
Component-wise evaluation involves evaluating individual components of the pipeline, such as the retrieval and generation stages, to gain insights into their effectiveness and identify areas for improvement. We already shared two metrics for each of these stages, but here are a couple more that are available in the ragas platform:
- Context relevancy: This metric gauges the relevancy of the retrieved context, calculated based on both the question and contexts. The values fall within the range of
(0-1), with higher values indicating better relevancy. - Context entity recall: This metric gives the measure of recall of the retrieved context, based on the number of entities present in both
ground_truthdata andcontextsdata relative to the number of entities present in theground_truthdata alone. Simply put, it is a measure of what fraction of entities are recalled fromground_truthdata. This metric is particularly useful in fact-based use cases...