Toward Rank Correlation as a Measure of Confidence in Information Retrieval Experiment Results

None, None

Toward Rank Correlation as a Measure of Confidence in Information Retrieval Experiment Results

Master Thesis (2018)

Author(s)

A.I. Bavdaz (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Julián Urbano – Mentor

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Information Retrieval Rank Correlation Reliability Confidence Intervals

To reference this document use:

https://resolver.tudelft.nl/uuid:0bc29b72-2e55-4f93-a1d0-fe6885d003dd

More Info

expand_more

Publication Year

2018

Language

English

Copyright

Graduation Date

02-08-2018

Awarding Institution

Delft University of Technology

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In the field of Information Retrieval (IR), test collections are an important part of IR system evaluation. When evaluating IR systems on a test collection, the results may not accurately represent the performance of the systems on topics not contained in that test collection. Therefore, we want to get a sense of the accuracy of results on a given test collection. In this thesis, we use an approach that estimates the accuracy of test collections by estimating rank correlation between the observed and true mean scores of systems. We further evaluate this approach on new data and develop interval estimators as well. This way we provide a better sense of confidence on the system evaluation results by accounting for the inherent variability in sampling topics.

Files

Thesis_Final.pdf

(pdf | 0.467 Mb)

License info not available