J. Yang | TU Delft Repository

Iterative Prompt Refinement via Knowledge Alignment: A Case Study in Systematic Review Screening

Master thesis (2025) - A.S. Kuiper (author) , Jie Yang (mentor) , C. Lofi (graduation committee member) , Pradeep Murukannaiah (graduation committee member)

Applying Large Language Models (LLMs) to high-stakes classification tasks like systematic review screening is challenged by prompt sensitivity and a lack of transparency. We introduce IMAPR (Iterative Multi-signal Adaptive Prompt Refinement), a novel framework where a single LLM ...

Unheard and Misunderstood

Reinforcing Hermeneutical Justice in Annotation Design for ADHD Voices

Bachelor thesis (2025) - A. Yotkov (author) , J. Yang (mentor) , A. Arzberger (mentor) , M.L. Tielman (graduation committee member)

The main way large language models (LLMs) learn to represent and interpret various experiences is through the process of supervised fine-tuning (SFT). However, current practices are not designed to be inclusive for people with ADHD, which leads to generative hermeneutical ignoran ...

Incorporating User Feedback into Post-Training LLM Improvement to Promote Hermeneutical Justice

An interface to amplify marginalized voices

Bachelor thesis (2025) - A. Turgut (author) , A. Arzberger (mentor) , J. Yang (mentor) , M.L. Tielman (graduation committee member)

Generative AI can contribute to the misunderstanding or erasure of marginalized groups due to the insufficient nuanced data on their lived experiences. This limits the shared un- derstanding of their perspectives and contributes to a phenomenon called hermeneutical epistemic inju ...

Unheard and Misunderstood: Addressing Injustice in LLMs

How are hermeneutical injustices encoded in Reinforcement Learning from Human Feedback (RLHF) in the context of LLMs?

Bachelor thesis (2025) - I. Mockaitytė (author) , A. Arzberger (mentor) , J. Yang (mentor) , M.L. Tielman (graduation committee member)

This study investigates how hermeneutical injustices can become encoded in the Reinforcement Learning from Human Feedback processes used to fine-tune large language models (LLMs). While current research on fairness in LLMs has focused on bias and fairness, there remains a signifi ...

Unheard and Misunderstood

Tracing Hermeneutical Injustice in ADHD Narratives Generated by Large Language Models

Bachelor thesis (2025) - D. Zhang (author) , J. Yang (mentor) , A. Arzberger (mentor) , M.L. Tielman (graduation committee member)

This study investigates how large language models (LLMs) narrate ADHD-related experiences and whether their narrative forms give rise to hermeneutical injustice. Rather than comparing experience itself, this study analyzes how experiences are narrated. Using a hybrid coding strat ...

Prompt Engineering for Hermeneutical Justice in LLMs

An Empirical Study on ADHD-Related Causal Reasoning

Bachelor thesis (2025) - S. Sankara Subramanian Lakshmi (author) , J. Yang (mentor) , A. Arzberger (mentor) , M.L. Tielman (graduation committee member)

Large Language Models are increasingly integrated into everyday applications, but their responses often reflect dominant cultural narratives, which can lead to misrepresentation of marginalized communities. This paper addresses the underexplored issue of hermeneutical epistemic i ...

Assistance Required: A Qualitative Study of Researcher Needs for AI Research Assistants

Master thesis (2025) - M.J.C. Otten (author) , Jie Yang (mentor) , P.K. Murukannaiah (mentor) , Luciano C. Siebert (graduation committee member)

The use of research assistants has increased significantly, providing support and automation for researchers. However, there is limited research on researchers using research assistants and what assistance researchers require for each research stage.
We interview researchers ...

From recognition to understanding: enriching visual models through multi-modal semantic integration

Doctoral thesis (2025) - Shahin Sharifi Noorian (author) , G.J.P.M. Houben (promotor) , Alessandro Bozzon (promotor) , J. Yang (copromotor)

This thesis addresses the semantic gap in visual understanding, improving visual models with semantic reasoning capabilities so they can handle tasks like image captioning, question-answering, and scene understanding. The main focus is on integrating visual and textual data, leve ...

Contrastive Self-Explanation Method (CoSEM): Generating Large Language Model Contrastive Self-Explanations

Master thesis (2024) - R. Kargul (author) , Jie Yang (mentor) , S.E. Carter (mentor) , Stefan Buijsman (mentor) , Maria S. Pera (graduation committee member) , M.L. Tielman (graduation committee member)

Large language models (LLMs) are widely used tools that assist us by answering various questions. Humans implicitly use contrast as a natural way to think about and seek explanations (i.e., "Why A and not B?"). Explainability is a challenging aspect of LLMs, as we do not truly un ...

Understanding Users’ Contextual Factors and Personal Values for Watching YouTube Videos:

A Crowdsourcing Approach with Personal Reflection Integration

Master thesis (2024) - Y. Zhang (author) , U.K. Gadiraju (mentor) , Jie Yang (mentor) , Huijuan Wang (graduation committee member) , Di Yan (mentor)

User feedback plays a significant role in helping recommendation systems to make personalized and accurate predictions. Despite the fact that many methods of collecting user feedback have been proposed, little research exists that addresses both the breadth and depth of data coll ...

Enhancing Sentence Decomposition in Large Language Models Through Linguistic Features

Master thesis (2024) - X. XU (author) , Maria S. Pera (mentor) , Jie Yang (mentor) , S. Dumančić (graduation committee member) , G. He (mentor)

This thesis investigates the enhancement of sentence decomposition in Large Language Models (LLMs) through the integration of linguistic features, including constituency parsing, dependency parsing, and abstract meaning representation. Traditional decomposition methods, which of ...

Developing a user-centered explainability tool to support the NLP Data Scientist in creating LLM-based solutions

Master thesis (2024) - J.W. Nelen (author) , C. Lofi (mentor) , J Yang (mentor) , Jan Van Gemert (graduation committee member) , F. Hermsen (mentor)

With the advent of large language models (LLMs), developing solutions for Natural Language Processing (NLP) tasks has become more approachable. However, these models are opaque, which presents several challenges, such as prompt engineering, quality assessment, and error analysis. ...

With the advent of large language models (LLMs), developing solutions for Natural Language Processing (NLP) tasks has become more approachable. However, these models are opaque, which presents several challenges, such as prompt engineering, quality assessment, and error analysis. Explainability methods can have several potential benefits, such as improving accuracy, increasing trust, and assessing quality. However, limited research exists on how explainability techniques can be applied to LLMs in practice, particularly using human-centred methodologies. Therefore, this study takes a user-centered approach, investigating the needs and challenges of the NLP data scientist and developing an explainability tool to address these needs. This approach is done by conducting a formative study to deepen our understanding of the user, combined with relevant literature. The observations from the formative study were used to develop a tool tailored to the user’s specific needs. This development was done by creating requirements and a design based on the findings of the formative study, followed by a proof of concept implementation. User satisfaction was assessed through practical interviews with a fairness dataset, providing insights into the usefulness and usability of the explanation techniques and the tool. The tool implements three explanation techniques: uncertainty, token-level feature attribution, and contrastive explanations. These can be viewed using a web application separated from the Python development environment, making it easy to interact with. Other key features are that it can be easily integrated into the user’s existing workflow, is usable in practice and can be presented to different stakeholders within the project. The evaluation concluded that the tool fits the workflow and does indeed help the NLP data scientist to understand the model. However, the evaluation also showed that the explainability techniques did not provide the necessary insights to achieve the user’s goal, mainly to improve the model’s accuracy and make the error analysis actionable. More research should be done to see which other explainability techniques could provide insights that would lead to objectively better performance of these models. Finally, more explainability techniques should be developed that do not focus on debugging the model but rather on revealing its behaviour and thus providing a better understanding of how to improve it.

Text summarisation in healthcare to reduce workload

Summarising patient experiences for healthcare professionals

Master thesis (2024) - J.M. Dannenberg (author) , Jiwon Jung (graduation committee member) , Christoph Lofi (graduation committee member) , J Yang (mentor) , Neil Yorke-Smith (graduation committee member)

Summarising patient interactions creates a huge workload for the healthcare professionals. This research finds that patient interactions contain a lot of noise that is subjective of nature. To explore the problem area interviews with a summarisation prototype have been conducted ...

Split Inference on Networked Microcontrollers

Master thesis (2024) - J. Lu (author) , Q. Wang (mentor) , Jie Yang (coach)

With the rapid development of Artificial Intelligence (AI), the size and complexity of models are increasing rapidly. The limited memory and computing power of microcontroller units (MCUs) pose significant challenges for running AI applications on them. This thesis presents a met ...

A study on bias against women in recruitment algorithms

Surveying the fairness literature in the search for a solution

Bachelor thesis (2024) - J.H. van den Berg (author) , Sarah E. Carter (mentor) , Jie Yang (mentor) , Marcus M. Specht (graduation committee member) , S.N.R. Buijsman (graduation committee member)

Algorithms have a more prominent presence than ever in the domain of recruitment. Many different tasks ranging from finding candidates to scanning resumes are handled more and more by algorithms and less by humans. Automating these tasks has led to bias being exhibited towards di ...

Influence of Data Processing on the Algorithm Fairness vs. Accuracy Trade-off

Building Pareto Fronts for Equitable Algorithmic Decisions

Bachelor thesis (2024) - A.D. Salvi (author) , Sarah E. Carter (mentor) , Jie Yang (mentor) , Marcus M. Specht (graduation committee member) , S.N.R. Buijsman (graduation committee member)

Algorithmic bias due to training from biased data is a widespread issue. Bias mitigation techniques such as fairness-oriented data pre-, in-, and post-processing can help but usually come at the cost of model accuracy. For this contribution, we first conducted a literature review ...

From Data to Decision

Investigating Bias Amplification in Decision-Making Algorithms

Bachelor thesis (2024) - E. Mihalache (author) , Sarah E. Carter (mentor) , Jie Yang (mentor) , S.N.R. Buijsman (graduation committee member) , Marcus M. Specht (graduation committee member)

This research investigates how biases in datasets influence the outputs of decision-making algorithms, specifically whether these biases are merely reflected or further amplified by the algorithms. Using the Adult/Census Income dataset from the UCI Machine Learning Repository, th ...

Leveraging Database Honeypots to Gather Threat Intelligence

Master thesis (2024) - Y. Song (author) , Harm Griffioen (mentor) , G. Smaragdakis (graduation committee member) , Asterios Katsifodimos (coach) , Jie Yang (coach)

In the digital age, the proliferation of personal data within databases has made them prime targets for cyberattacks. As the volume of data increases, so does the frequency and sophistication of these attacks. This thesis investigates database security threats by deploying open s ...

Blind Spot Illumination in LLMs through Data Valuation and Synthetic Sample Generation

Master thesis (2024) - Chun-Chi Chen (author) , Philip Lippmann (mentor) , Jie Yang (mentor) , A Katsifodimos (graduation committee member) , Q. Wang (coach)

Large language models (LMs) are increasingly used in critical tasks, making it important that these models can be trusted. The confidence an LM assigns to its prediction is often used to indicate how much trust can be placed in that prediction. However, a high confidence can be i ...

How Differently Do People Hate? Understanding The Linguistic Difference Of Regional English Hate Speech

Master thesis (2024) - B. Zhang (author) , Avishek Anand (graduation committee member) , J. Yang (mentor) , Sarah E. Carter (mentor)