LLM-based Answer Correctness outputs a score between 0.0 - 1.0 assessing the overall quality of the answer, given the question and ground truth answer.
Scoring rubric in LLM Prompt:
0.0 means that the answer is completely irrelevant to the question.
0.25 means that the answer is relevant to the question but contains major errors.
0.5 means that the answer is relevant to the question and is partially correct.
0.75 means that the answer is relevant to the question and is correct.
1.0 means that the answer is relevant to the question and is correct and complete.
Example Usage
Required data items: question, answer, ground_truths