An Axiomatic Approach to Diagnosing Neural IR Models

None, None; None, None; None, None

An Axiomatic Approach to Diagnosing Neural IR Models

Conference Paper (2019)

Author(s)

Daniël Rennings (Student TU Delft)

Felipe Moraes Gomes (TU Delft - Web Information Systems)

C Hauff (TU Delft - Web Information Systems)

Research Group

Web Information Systems

Copyright

DOI related publication

https://doi.org/10.1007/978-3-030-15712-8_32

To reference this document use:

https://resolver.tudelft.nl/uuid:f0e04c03-d4be-498d-b1d6-4b7ee637f9e6

More Info

expand_more

Publication Year

2019

Language

English

Copyright

Research Group

Web Information Systems

Pages (from-to)

489-503

ISBN (print)

978-3-030-15711-1

ISBN (electronic)

978-3-030-15712-8

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Traditional retrieval models such as BM25 or language models have been engineered based on search heuristics that later have been formalized into axioms. The axiomatic approach to information retrieval (IR) has shown that the effectiveness of a retrieval method is connected to its fulfillment of axioms. This approach enabled researchers to identify shortcomings in existing approaches and “fix” them. With the new wave of neural net based approaches to IR, a theoretical analysis of those retrieval models is no longer feasible, as they potentially contain millions of parameters. In this paper, we propose a pipeline to create diagnostic datasets for IR, each engineered to fulfill one axiom. We execute our pipeline on the recently released large-scale question answering dataset WikiPassageQA (which contains over 4000 topics) and create diagnostic datasets for four axioms. We empirically validate to what extent well-known deep IR models are able to realize the axiomatic pattern underlying the datasets. Our evaluation shows that there is indeed a positive relation between the performance of neural approaches on diagnostic datasets and their retrieval effectiveness. Based on these findings, we argue that diagnostic datasets grounded in axioms are a good approach to diagnosing neural IR models.

Files

ECIR2019_rennings_3_.pdf

(pdf | 0.373 Mb)

License info not available