Supervised Contrastive Learning Approach for Contextual Ranking

None, None; None, None; None, None; None, None

Supervised Contrastive Learning Approach for Contextual Ranking

Conference Paper (2022)

Author(s)

Abhijit Anand (L3S)

Jurek Leonhardt (L3S)

Koustav Rudra (Indian Institute of Technology (IIT))

Avishek Anand (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group

Web Information Systems

Interpolation Data augmentation Document ranking Ranking performance Supervised contrastive loss

DOI related publication

https://doi.org/10.1145/3539813.3545139 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:a63f859b-6be5-4314-a94f-951148392671

More Info

expand_more

Publication Year

2022

Language

English

Research Group

Web Information Systems

Pages (from-to)

61-71

ISBN (electronic)

978-1-4503-9412-3

Event

8th ACM SIGIR International Conference on the Theory of Information Retrieval, ICTIR 2022 (2022-07-11 - 2022-07-12), Virtual, Online, Spain

Downloads counter

307

Collections

Institutional Repository

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Contextual ranking models have delivered impressive performance improvements over classical models in the document ranking task. However, these highly over-parameterized models tend to be data-hungry and require large amounts of data even for fine tuning. This paper proposes a simple yet effective method to improve ranking performance on smaller datasets using supervised contrastive learning for the document ranking problem. We perform data augmentation by creating training data using parts of the relevant documents in the query-document pairs. We then use a supervised contrastive learning objective to learn an effective ranking model from the augmented dataset. Our experiments on subsets of the TREC-DL dataset show that, although data augmentation leads to an increasing the training data sizes, it does not necessarily improve the performance using existing pointwise or pairwise training objectives. However, our proposed supervised contrastive loss objective leads to performance improvements over the standard non-augmented setting showcasing the utility of data augmentation using contrastive losses. Finally, we show the real benefit of using supervised contrastive learning objectives by showing marked improvements in smaller ranking datasets relating to news (Robust04), finance (FiQA), and scientific fact checking (SciFact).

Files

3539813.3545139.pdf

(pdf | 3.72 Mb)

- Embargo expired in 01-07-2023

License info not available