A. Anand | TU Delft Repository

Low-Rank Ternary Adapters for Fine-Tuning Transformers

Master thesis (2025) - A.D. Manolache (author) , Yunqiang Li (mentor) , Jan Van Gemert (mentor) , Avishek Anand (graduation committee member)

Parameter-Efficient Fine-Tuning (PEFT) methods for Transformers are designed for floating-point weights. When applied to extremely low-bit models (e.g., ternary {-1,0,1) they convert the base weights to floating point (dequantization) to add the update and then quantize again, wh ...

Augmenting Fast-Forward Indexes: Dynamic Hybrid Re-Ranking with Parameter-Efficient Adaptors

Master thesis (2025) - A. Segura Lorente (author) , Avishek Anand (mentor) , Avishek Anand (graduation committee member) , L.J.L. Leonhardt (mentor) , Julián Urbano (graduation committee member)

Transformer-based architectures have significantly advanced the field of Information Retrieval (IR) by enabling semantic understanding that surpasses traditional term-frequency models. Hybrid approaches, which combine efficient sparse retrievers like BM25 with effective dense ret ...

Transformer-based architectures have significantly advanced the field of Information Retrieval (IR) by enabling semantic understanding that surpasses traditional term-frequency models. Hybrid approaches, which combine efficient sparse retrievers like BM25 with effective dense retrieval methods are becoming more popular for balancing performance and efficiency. However, the application of advanced hybrid systems often requires dedicated infrastructure and technical expertise, as existing toolkits are primarily research-oriented and not designed for simple integration.
To bridge this gap, this work introduces an open-source Elasticsearch plugin that implements the interpolation-based re-ranking framework from Leonhardt et al. This plugin simplifies the adoption of hybrid re-ranking by making the technique easily accessible within existing pipelines.
Furthermore, this work explores learning-based optimization methods to optimize the interpolation constant α. The research investigates two main strategies: (1) direct, gradient-based optimization to determine an optimized global value for α , and (2) the development of lightweight adaptor models that dynamically predict context-aware α values for each query or query-document pair. These adaptors are based on Feedforward Neural Networks and Neural Tensor Network architectures.
Empirical results suggest that these learning-based methods, particularly the dynamic adaptors, can outperform traditional grid search methods for tuning α, due to their dynamic adjustment in a query/document basis. The benefits were especially strong in out-of-domain scenarios, where the adaptors showed improved performance for encoders that were not pre-trained on the target domain, without requiring fine-tuning of the full backbone architecture.
This work delivers a practical, easy-to-use tool for hybrid re-ranking in Elasticsearch, a novel methodology for optimizing its core re-ranking parameter, and proposes adaptor models that can deliver better performance than a fixed interpolation value in hybrid re-ranking.

Data Hound: Linking Educational Value to LLM Code Completion Performance During Inference

Bachelor thesis (2025) - B.R.M. Annink (author) , Arie van Deursen (mentor) , Maliheh Izadi (mentor) , Jonathan Katzy (mentor) , R. M. Popescu (mentor) , Avishek Anand (graduation committee member)

This paper investigates the relation between the educational value of input code and the subsequent inference performance of code large language models (LLMs) on completion tasks. Results were attained using The Heap dataset and using SmolLM2, StarCoder 2 and Mellum models. Perfo ...

Analyzing the Impact of Self-Admitted Technical Debt on the Code Completion Performance of Large Language Models

Bachelor thesis (2025) - L.C. Witte (author) , Arie van Deursen (mentor) , Maliheh Izadi (mentor) , Jonathan Katzy (mentor) , R. M. Popescu (mentor) , Avishek Anand (graduation committee member)

Large Language Models (LLMs) are increasingly integrated into development workflows for tasks such as code completion, bug fixing, and refactoring. While prior work has shown that removing low-quality data—including data smells like Self-Admitted Technical Debt (SATD)—from traini ...

Data hound: Analysing non-English data smells in large code datasets

Bachelor thesis (2025) - B.M. Buzatu (author) , Arie van Deursen (graduation committee member) , Maliheh Izadi (graduation committee member) , Jonathan Katzy (mentor) , R. M. Popescu (mentor) , Avishek Anand (graduation committee member)

Large Language Models (LLMs) are increasingly used for code-centric tasks. However, their training data often exhibits data smells that may hinder downstream quality. This research focuses on the “Uneven Natural Languages” smell and the presence of non-English text in source code ...

Practical Neuron-level Pruning Framework for Bayesian Neural Networks

Student report (2025) - V. Kuboň (author) , Luca Laurenti (graduation committee member) , Steven Adams (mentor) , Avishek Anand (graduation committee member)

Bayesian Neural Networks (BNNs) offer uncertainty quantification but are computationally expensive, limiting their practical deployment. This paper introduces a neuron-level pruning framework that reduces BNN complexity while preserving predictive performance. Unlike existing wei ...

Detecting Patient Information Conflicts through Conflict Reasoning in Knowledge Graphs

Enhancing Accuracy and Reliability in a Diabetes Support System

Bachelor thesis (2025) - J.M. van Paridon (author) , Catholijn M. Jonker (mentor) , J.D. Top (mentor) , Avishek Anand (graduation committee member)

Lifestyle management systems aim to provide personalized health guidance by interpreting patient's self-reported data. However, these systems often overlook the temporal consistency of behavioral patterns, risking inaccurate or misleading recommendations. To address this, we pres ...

To Deceive or Self-Deceive?

Framing Language to Discourage Deception in Diabetes Lifestyle Management Systems

Bachelor thesis (2025) - M. Mădăraș (author) , Catholijn M. Jonker (mentor) , J.D. Top (mentor) , Avishek Anand (graduation committee member)

Deceptive self-reporting in diabetes lifestyle management (DLM) systems limits their ability to offer meaningful and accurate support. Deception can function as a self-protective mechanism, driven by factors such as low self-esteem or the desire to protect self-image. This resear ...

Enhancing Diabetes Care through AI-Driven Lie Detection in a Diabetes Support System

Testing the validity of lie detection using an SVM model trained on linguistic cues

Bachelor thesis (2025) - R.L.T. van Westerlaak (author) , Catholijn M. Jonker (mentor) , Avishek Anand (graduation committee member) , J.D. Top (mentor)

This paper presents a deception-detection module for a diabetes support system, addressing the challenge of unreliable patient self-reporting and ultimately attempting to improve diabetes care. The research is for a system called CHIP developed by the Hybrid Intelligence project ...

Detecting Patient Deception and Adherence in Diabetes Support Using AI-Generated Conversation Summaries

Leveraging chat summaries to Enhance Doctor-Patient Communication

Bachelor thesis (2025) - H.M.G. Koot (author) , Catholijn M. Jonker (mentor) , J.D. Top (mentor) , Avishek Anand (graduation committee member)

Unreliable patient self-reporting complicates diabetes management. This study investigates how AI-generated summaries of patient-chatbot conversations can be structured to help healthcare professionals detect deception and non-adherence. To address this, we developed a novel pipe ...

Entropy-Based Modeling For Detecting Behavioral Anomalies in Users of a Diabetes Lifestyle Management Support System

Identifying non-adherence indicators in a chatbot-based diabetes support system

Bachelor thesis (2025) - Sorin - Andrei Ciuntu Ciuntu (author) , CM Jonker (mentor) , J.D. Top (mentor) , Avishek Anand (graduation committee member)

Individuals with diabetes face rigorous demands when it comes to managing their health, yet patients sometimes struggle to stay adherent to treatment. CHIP is an AI-based conversational platform that allows patients to report lifestyle factors and receive personalized suppor ...

Efficient Query Estimation by Vector Averaging in Dual-Encoder Re-Ranking

Estimating Query Embeddings as Weighted Average of Document Embeddings and Lightweight Query Encoding

Master thesis (2025) - B. van den Berg (author) , L.J.L. Leonhardt (mentor) , Avishek Anand (mentor) , Avishek Anand (graduation committee member) , Julián Urbano (graduation committee member)

A central problem in information retrieval (IR) is passage ranking, where the task is to retrieve passages from a corpus and order them in decreasing relevance to an arbitrary search query.
Traditional lexical retrieval methods are susceptible to the vocabulary mismatch probl ...

Exploring Neural IR Approaches in Europeana

Unlocking Multilingual Insights for Cultural Heritage Search

Master thesis (2025) - S. Basir (author) , Julián Urbano (mentor) , Avishek Anand (graduation committee member) , Monica Marrero (mentor)

Europeana is a digital library of Europe's cultural heritage, housing a large corpus of data representing artworks, literature, historical locations and many culturally significant items. Europeana currently relies of traditional text-matching retrieval, such as BM25, to facilita ...

Europeana is a digital library of Europe's cultural heritage, housing a large corpus of data representing artworks, literature, historical locations and many culturally significant items. Europeana currently relies of traditional text-matching retrieval, such as BM25, to facilitate their search and discovery across millions of multilingual metadata-based records. However, these models are not capable of semantic understanding and require additional treatments to facilitate multilingual retrieval which costs Europeana resources, these treatments entail translating queries and data from other language into English and enriching content by adding entities from linked open data. Europeana's current methodology is ultimately limited in its ability to provide semantically relevant multilingual search results.

This thesis investigates the application of Neural Information Retrieval (NIR) to enhance Europeana's search capabilities. This investigation aims to assess the impact of NIR on multilingual retrieval and retrieval performance while also determining the value of existing translation and enrichment processes. To support this investigation, we contribute by developing a structured and preprocessed dataset specifically for NIR, as no such dataset previously existed for NIR. We conduct an extensive evaluation of NIR models, analyzing the impact of fine-tuning, query treatments, and document treatments on retrieval quality. Additionally, we assess the computational requirements, scalability, and practicality of deploying NIR, identifying trade-offs in retrieval efficiency and resource consumption, to provide an idea of an infrastructure Europeana would need to implement NIR.

This research required meticulous planning across all stages—from data collection and formatting to model training and evaluation—since applying NIR at this scale for metadata search is new for Europeana. Therefore, research not only provides insights into the viability of NIR as a replacement or enhancement to Europeana's existing search system but also lays the foundation for future advancements in multilingual retrieval for Europeana.

Through this thesis, we found that NIR models can offer promising improvements in multilingual retrieval and semantic search, reducing reliance on exact term matching. Our analysis suggests that not all of Europeana’s current preprocessing treatments are necessary for NIR models, as they inherently capture cross-lingual relationships more effectively than BM25, though the benefits vary depending on the model and configuration used. Overall, we recommend that a hybrid retrieval system that leverages both lexical and neural approaches may be the most practical solution for Europeana and warrants further exploration.

The integration of NIR presents several challenges, particularly in terms of infrastructure and evaluation. NIR models are sensitive to changes in document structure and content, requiring careful consideration of indexing and fine-training. Furthermore, while these models improve semantic search, they may struggle with entity-based queries, where BM25’s exact matching approach remains valuable.

A major limitation of this study was the absence of explicit relevance judgements in our dataset, which constrained our ability to make definitive conclusions about retrieval effectiveness. Future work should prioritize the development of a comprehensive evaluation framework, incorporating expert and user-based relevance assessments, to enable a more robust analysis of NIR’s impact.

The Impact of the Retrieval Stage in Interpolation-based Re-Ranking

Bachelor thesis (2024) - D.C. Ciacu (author) , L.J.L. Leonhardt (mentor) , Avishek Anand (graduation committee member) , Alan Hanjalic (coach)

Efficient and effective information retrieval (IR) systems are needed to fetch a large number of relevant documents and present them based on their relevance to the input queries. Previous work reported the use of sparse and dense retrievers. Sparse retrievers offer low latency b ...

Finding the Needle in the Pre-Trained Model Zoo

The Use of Rich Metadata and Graph Learning to Estimate Task Transferability

Master thesis (2024) - Hilco Van Der Wilk (author) , R. Hai (mentor) , Z. Li (mentor) , Avishek Anand (graduation committee member) , Q. Song (graduation committee member)

The democratization of machine learning through public repositories, often known as model zoos, has significantly increased the availability of pre-trained models for practitioners. However, this abundance can make it difficult to choose the most suitable pre-trained model for fi ...

Ripple Watermarking for Latent Tabular Diffusion Models

Master thesis (2024) - J. Tang (author) , Y. Chen (mentor) , A. Anand (graduation committee member)

Synthetic tabular data generated by tabular generative models represent an effective means of augmenting and sharing data. It is of paramount importance to trace and audit such synthetic data, avoiding potential harms and risks associated with inappropriate usage. While watermark ...

Improving Adversarial Attacks on Decision Tree Ensembles

Exploring the impact of starting points on attack performance

Master thesis (2024) - M. Pigmans (author) , S.E. Verwer (mentor) , Avishek Anand (graduation committee member)

Most of the adversarial attacks suitable for attacking decision tree ensembles work by doing multiple local searches from randomly selected starting points, around the to be attacked victim. In this thesis we investigate the impact of these starting points on the performance of t ...

Most of the adversarial attacks suitable for attacking decision tree ensembles work by doing multiple local searches from randomly selected starting points, around the to be attacked victim. In this thesis we investigate the impact of these starting points on the performance of the attack, and find that the starting points significantly impact the performance: some do much better than others. However, we do find that this is not the case for all attacked points, as there are large differences between points in how difficult they are to attack and for all datasets some points are always optimally attacked.

We compare the baseline randomly selected points to three alternative strategies. First, we try alternate random distributions, playing with both the standard deviation, to create a more narrow cone around the victim point, and mean, creating bimodal distributions further away from the victim point. We find that for some datasets these can give up to $5$-$7\%$ improved performance on subsets of the dataset, but these improvements do not generalize to the remainder of the dataset. In general, as long as the distribution is wide enough to successfully find starting points we do not find a substantial performance change.

Secondly, we try to remove the randomness and attack from a fixed direction. For the simpler datasets we find it is possible for a starting direction to perform better than random starting points, but for larger datasets performance becomes much worse. We also try an attack from all main directions around the victim point, which we find performs much worse than $5$-$20$ times fewer random points.

Lastly, we create an attack strategy where we select the closest points that scored well on previously attacked victims. We find that on smaller test sets this gets outperformed by the baseline, but when we extend the attack and give more possible previously well performing starting points we match or outperform the baseline slightly.

Ultra Low Latency Object Tracking for Tactile Internet

Master thesis (2024) - B. Zeybekoglu (author) , Rangarao Venkatesha Prasad (mentor) , Avishek Anand (graduation committee member) , Herman Kroep (graduation committee member)

Bilateral teleoperation with force feedback aims to transmit human expertise over long distances by transferring the sensation of physical contact. One of the primary challenges in achieving this goal is the ultra low latency requirement. Tactile internet and ...

How Differently Do People Hate? Understanding The Linguistic Difference Of Regional English Hate Speech

Master thesis (2024) - B. Zhang (author) , Avishek Anand (graduation committee member) , J. Yang (mentor) , Sarah E. Carter (mentor)

Logs to the Rescue

Creating meaningful representations from log files for Anomaly Detection

Master thesis (2023) - G.H.R. Timmerman (author) , S.E. Verwer (mentor) , Avishek Anand (graduation committee member) , T. Mulder (graduation committee member)

This thesis offers a comprehensive exploration of log-based anomaly detection within the domain of cybersecurity incident response. The research describes a different approach and explores relevant log features for language model training, experimentation with different language ...