B. Marinov

Bachelor thesis (1)

1 records found

Evaluating Faithfulness of LLM Generated Explanations for Claims: Are Current Metrics Effective?

Analysing the Capabilities of Evaluation Metrics to Represent the Difference Between Generated and Expert-written Explanations

Bachelor thesis (2025) - B. Marinov (author) , Pradeep Kumar Murukannaiah (mentor) , S. Mukherjee (mentor) , Xucong Zhang (graduation committee member)

Large Language Models (LLMs) are increasingly used to generate fact-checking explanations, but evaluating how faithful these justifications are remains a major challenge. In this paper, we examine how well four popular automatic metrics—G-Eval, UniEval, FactCC, and QAGs—capture f ...