- document
-
Rudra, Koustav (author), Fernando, Zeon Trevor (author), Anand, A. (author)Pre-trained contextual language models such as BERT, GPT, and XLnet work quite well for document retrieval tasks. Such models are fine-tuned based on the query-document/query-passage level relevance labels to capture the ranking signals. However, the documents are longer than the passages and such document ranking models suffer from the token...journal article 2023
- document
-
Smirnova, Alisa (author), Yang, J. (author), Yang, Dingqi (author), Cudre-Mauroux, Philippe (author)Noisy labels represent one of the key issues in supervised machine learning. Existing work for label noise reduction mainly takes a probabilistic approach that infers true labels from data distributions in low-level feature spaces. Such an approach is not only limited by its capability to learn high-quality data representations, but also by...journal article 2022