A. Anand | TU Delft Repository

Breaking the Lens of the Telescope

Online Relevance Estimation over Large Retrieval Sets

Conference paper (2025) - Mandeep Rathee (author) , V. Viswanathan (author) , Sean MacAvaney (author) , A. Anand (author)

Advanced relevance models, such as those that use large language models (LLMs), provide highly accurate relevance estimations. However, their computational costs make them infeasible for processing large document corpora. To address this, retrieval systems often employ a telescop ...

Quam

Adaptive Retrieval through Query Affinity Modelling

Conference paper (2025) - Mandeep Rathee (author) , Sean MacAvaney (author) , A. Anand (author)

A central task in information retrieval and the NLP communities is relevance modeling, which aims to rank documents based on their expressed information needs Many knowledge-intensive retrieval tasks are powered by a first-stage retrieval stage for context selection, followed by ...

Correctness is not Faithfulness in Retrieval Augmented Generation Attributions

Conference paper (2025) - Jonas Wallat (author) , Maria Heuss (author) , Maarten de Rijke (author) , A. Anand (author)

Large language models (LLMs) have transformed information retrieval through chat interfaces, but their hallucination tendencies pose significant risks. While Retrieval Augmented Generation (RAG) with citations has emerged as a solution by allowing users to verify responses throug ...

ir_explain

A Python Library of Explainable IR Methods

Conference paper (2025) - Sourav Saha (author) , Harsh Agarwal (author) , V. Viswanathan (author) , A. Anand (author) , Swastik Mohanty (author) , Debapriyo Majumdar (author) , Mandar Mitra (author)

While recent advancements in Neural Ranking Models have resulted in significant improvements over traditional statistical retrieval models, it is generally acknowledged that the use of large neural architectures and the application of complex language models in Information Retrie ...

Evaluating List Construction and Temporal Understanding capabilities of Large Language Models

Conference paper (2025) - Alexandru Dumitru (author) , V. Viswanathan (author) , Adam Jatowt (author) , A. Anand (author)

Large Language Models (LLMs) have demonstrated immense advances in a wide range of natural language tasks. However, these models are susceptible to hallucinations and errors on particularly temporal understanding tasks involving multiple entities in answers. In such tasks, they f ...

Causal Probing for Dual Encoders

Conference paper (2024) - Jonas Wallat (author) , Hauke Hinrichs (author) , A. Anand (author)

Dual encoders are highly effective and widely deployed in the retrieval phase for passage and document ranking, question answering, or retrieval-augmented generation (RAG) setups. Most dual-encoder models use transformer models like BERT to map input queries and output targets to ...

DISCO

DISCovering Overfittings as Causal Rules for Text Classification Models

Conference paper (2024) - Zijian Zhang (author) , Vinay Setty (author) , Yumeng Wang (author) , A. Anand (author)

With the rapid advancement of neural language models, the deployment of overparameterized models has surged, increasing the need for interpretable explanations comprehensible to human inspectors. Existing post-hoc interpretability methods, which often focus on unigram features of ...

Efficient Neural Ranking Using Forward Indexes and Lightweight Encoders

Journal article (2024) - L.J.L. Leonhardt (author) , Henrik Müller (author) , Koustav Rudra (author) , M. Khosla (author) , Abhijit Anand (author) , A. Anand (author)

Dual-encoder-based dense retrieval models have become the standard in IR. They employ large Transformer-based language models, which are notoriously inefficient in terms of resources and latency.We propose Fast-Forward indexes - vector forward indexes which exploit the semantic m ...

QuanTemp

A real-world open-domain benchmark for fact-checking numerical claims

Conference paper (2024) - V. Viswanathan (author) , Abhijit Anand (author) , A. Anand (author) , Vinay Setty (author)

With the growth of misinformation on the web, automated fact checking has garnered immense interest for detecting growing misinformation and disinformation. Current systems have made significant advancements in handling synthetic claims sourced from Wikipedia, and noteworthy prog ...

Data Augmentation for Sample Efficient and Robust Document Ranking

Journal article (2024) - Abhijit Anand (author) , L.J.L. Leonhardt (author) , Jaspreet Singh (author) , Koustav Rudra (author) , A. Anand (author)

Contextual ranking models have delivered impressive performance improvements over classical models in the document ranking task. However, these highly over-parameterized models tend to be data-hungry and require large amounts of data even for fine-tuning. In this article, we prop ...

The Surprising Effectiveness of Rankers Trained on Expanded Queries

Conference paper (2024) - Abhijit Anand (author) , V. Viswanathan (author) , Vinay Setty (author) , A. Anand (author)

An significant challenge in text-ranking systems is handling hard queries that form the tail end of the query distribution. Difficulty may arise due to the presence of uncommon, underspecified, or incomplete queries. In this work, we improve the ranking performance of hard or dif ...

Understanding the User

An Intent-Based Ranking Dataset

Conference paper (2024) - Abhijit Anand (author) , L.J.L. Leonhardt (author) , V. Viswanathan (author) , A. Anand (author)

As information retrieval systems continue to evolve, accurate evaluation and benchmarking of these systems become pivotal. Web search datasets, such as MS MARCO, primarily provide short keyword queries without accompanying intent or descriptions, posing a challenge in comprehendi ...

Temporal Blind Spots in Large Language Models

Conference paper (2024) - Jonas Wallat (author) , Adam Jatowt (author) , A. Anand (author)

Large language models (LLMs) have recently gained significant attention due to their unparalleled zero-shot performance on various natural language processing tasks. However, the pre-Training data utilized in LLMs is often confined to a specific corpus, resulting in inherent fres ...

In-Context Ability Transfer for Question Decomposition in Complex QA

Preprint (2023) - V. Viswanathan (author) , A. Anand (author) , Sourangshu Bhattacharya (author)

Answering complex questions is a challenging task that requires question decomposition and multistep reasoning for arriving at the solution. While existing supervised and unsupervised approaches are specialized to a certain task and involve training, recently proposed prompt-base ...

Explainable Information Retrieval

Conference paper (2023) - A. Anand (author) , Procheta Sen (author) , Sourav Saha (author) , Manisha Verma (author) , Mandar Mitra (author)

This tutorial presents explainable information retrieval (ExIR), an emerging area focused on fostering responsible and trustworthy deployment of machine learning systems in the context of information retrieval. As the field has rapidly evolved in the past 4-5 years, numerous appr ...

A Deep Reinforcement Learning Approach to Configuration Sampling Problem

Conference paper (2023) - Amir Abolfazli (author) , Jakob Spiegelberg (author) , A. Anand (author) , Gregory Palmer (author)

Configurable software systems have become increasingly popular as they enable customized software variants. The main challenge in dealing with configuration problems is that the number of possible configurations grows exponentially as the number of features increases. Therefore, ...

Probing BERT for Ranking Abilities

Conference paper (2023) - Jonas Wallat (author) , Fabian Beringer (author) , Abhijit Anand (author) , A. Anand (author)

Contextual models like BERT are highly effective in numerous text-ranking tasks. However, it is still unclear as to whether contextual models understand well-established notions of relevance that are central to IR. In this paper, we use probing, a recent approach used to analyze ...

Listwise Explanations for Ranking Models Using Multiple Explainers

Conference paper (2023) - L. Lyu (author) , A. Anand (author)

This paper proposes a novel approach towards better interpretability of a trained text-based ranking model in a post-hoc manner. A popular approach for post-hoc interpretability text ranking models are based on locally approximating the model behavior using a simple ranker. Since ...

Extractive Explanations for Interpretable Text Ranking

Journal article (2023) - L.J.L. Leonhardt (author) , Koustav Rudra (author) , A. Anand (author)

Neural document ranking models perform impressively well due to superior language understanding gained from pre-Training tasks. However, due to their complexity and large number of parameters these (typically transformer-based) models are often non-interpretable in that ranking d ...

An in-depth analysis of passage-level label transfer for contextual document ranking

Journal article (2023) - Koustav Rudra (author) , Zeon Trevor Fernando (author) , A. Anand (author)

Pre-trained contextual language models such as BERT, GPT, and XLnet work quite well for document retrieval tasks. Such models are fine-tuned based on the query-document/query-passage level relevance labels to capture the ranking signals. However, the documents are longer than the ...