Towards Robust Neural Rankers with Large Language Model: A Contrastive Training Approach

Ziyang Pan; Kangjia Fan; Rongyu Liu; Daifeng Li

doi:10.3390/app131810148

Applied Sciences (Sep 2023)

Towards Robust Neural Rankers with Large Language Model: A Contrastive Training Approach

Ziyang Pan,
Kangjia Fan,
Rongyu Liu,
Daifeng Li

Affiliations

Ziyang Pan: School of Information Management, Sun Yat-sen University, Guangzhou 510275, China
Kangjia Fan: School of Information Management, Sun Yat-sen University, Guangzhou 510275, China
Rongyu Liu: School of Information Management, Sun Yat-sen University, Guangzhou 510275, China
Daifeng Li: School of Information Management, Sun Yat-sen University, Guangzhou 510275, China

DOI: https://doi.org/10.3390/app131810148
Journal volume & issue: Vol. 13, no. 18
p. 10148

Abstract

Read online

Pre-trained language model-based neural rankers have been widely applied in information retrieval (IR). However, the robustness issue of current IR models has not received sufficient attention, which could significantly impact the user experience in practical applications. In this study, we focus on the defensive ability of IR models against query attacks while guaranteeing their retrieval performance. We discover that improving the robustness of IR models not only requires a focus on model architecture and training methods but is also closely related to the quality of data. Different from previous research, we use large language models (LLMs) to generate query variations with the same intent, which exhibit richer and more realistic expressions while maintaining consistent query intent. Based on LLM-generated query variations, we propose a novel contrastive training framework that substantially enhances the robustness of IR models to query perturbations. Specifically, we combine the contrastive loss in the representation space of query variations with the ranking loss in the retrieval training stage to improve the model’s ability to understand the underlying semantic information of queries. Experimental results on two public datasets, WikiQA and ANTIQUE, demonstrate that the proposed contrastive training approach effectively improves the robustness of models facing query attack scenarios while outperforming baselines in retrieval performance. Compared with the best baseline approach, the improvements in average robustness performance of Reranker IR models are 24.9%, 26.5%, 27.0%, and 75.0% on WikiQA and 8.7%, 1.9%, 6.3%, and 13.6% on ANTIQUE, in terms of the MAP (Mean Average Precision), MRR (Mean Reciprocal Rank), nDCG@10 (Normalized Discounted Cumulative Gain) and P@10 (Precision), respectively.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords