I. Mockaitytė

Bachelor thesis (1)

1 records found

Unheard and Misunderstood: Addressing Injustice in LLMs

How are hermeneutical injustices encoded in Reinforcement Learning from Human Feedback (RLHF) in the context of LLMs?

Bachelor thesis (2025) - I. Mockaitytė (author) , A. Arzberger (mentor) , J. Yang (mentor) , M.L. Tielman (graduation committee member)

This study investigates how hermeneutical injustices can become encoded in the Reinforcement Learning from Human Feedback processes used to fine-tune large language models (LLMs). While current research on fairness in LLMs has focused on bias and fairness, there remains a signifi ...