Performance Comparison of Different Query Expansion and Pseudo-Relevance Feedback Methods

A comparison of Bo1, KL, RM3, and Axiomatic Query Expansion against BM25

Bachelor thesis (2024)

Authors

L.J.P. de Swart Electrical Engineering, Mathematics and Computer Science

Contributors

L.J.L. Leonhardt Web Information Systems - (mentor)

A. Anand Web Information Systems - (mentor)

A. Hanjalic Intelligent Systems (graduation committee member)

Faculty

Electrical Engineering, Mathematics and Computer Science

Information Retrieval Axiomatic Query Expansion KL RM3 Bo1

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:e5d59495-35c2-48d3-9447-4a55c8da51bf

Published Date

28-06-2024

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

This paper is an analysis of the performance and logic behind different query expansion models. Query expansion and pseudo relevance feedback are techniques for adding more terms to a query based on the results of an initial query and the data in the body of documents. Four different query expansion models that are provided in the pyterrier python library and its extensions have been analysed, namely Bo1, KL, RM3, and Axiomatic query expansion. It was found that Axiomatic query expansion often does not perform any query expansion, and when it does, has no increase in performance. Bo1 and KL, although different in exact logic, have similar results most of the time. The most significant difference is the execution time, with Bo1 being faster with larger datasets and KL being faster with many documents on smaller datasets. Lastly, RM3 while not having a dominant performance has a lot of potential for good results with the right combination or parameters.

Files

Research_paper-3.pdf

(.pdf | 0.191 Mb)