Learning to Rank from Relevance Judgments Distributions

Alberto Purpura; Gianmaria Silvello; Gian Antonio Susto

IIR 2022

Conference paper

28 Jun 2022

Learning to Rank from Relevance Judgments Distributions

Abstract

LEarning TO Rank (LETOR) algorithms are usually trained on annotated corpora where a single relevance label is assigned to each available document-topic pair. Within the Cranfield framework, relevance labels result from merging either multiple expertly curated or crowdsourced human assessments. In this paper, we explore how to train LETOR models with relevance judgments distributions (either real or synthetically generated) assigned to document-topic pairs instead of single-valued relevance labels. We propose five new probabilistic loss functions to deal with the higher expressive power provided by relevance judgments distributions and show how they can be applied both to neural and gradient boosting machine (GBM) architectures. Overall, we observe that relying on relevance judgments distributions to train different LETOR models can boost their performance and even outperform strong baselines such as LambdaMART on several test collections.

Paper

Identifying single-item faked responses in personality tests: A new TF-IDF-based method

Alberto Purpura, Dora Giorgianni, et al.

PLoS ONE

View all publications

Abstract

Related

Identifying single-item faked responses in personality tests: A new TF-IDF-based method