Distributionally robust learning-to-rank under the Wasserstein metric

被引:0
|
作者
Sotudian, Shahabeddin [1 ]
Chen, Ruidi [1 ]
Paschalidis, Ioannis Ch. [1 ,2 ,3 ]
机构
[1] Boston Univ, Dept Elect & Comp Engn, Div Syst Engn, Boston, MA 02215 USA
[2] Boston Univ, Dept Biomed Engn, Boston, MA 02215 USA
[3] Boston Univ, Fac Comp & Data Sci, Boston, MA 02215 USA
来源
PLOS ONE | 2023年 / 18卷 / 03期
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
ALGORITHM;
D O I
10.1371/journal.pone.0283574
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Despite their satisfactory performance, most existing listwise Learning-To-Rank (LTR) models do not consider the crucial issue of robustness. A data set can be contaminated in various ways, including human error in labeling or annotation, distributional data shift, and malicious adversaries who wish to degrade the algorithm's performance. It has been shown that Distributionally Robust Optimization (DRO) is resilient against various types of noise and perturbations. To fill this gap, we introduce a new listwise LTR model called Distributionally Robust Multi-output Regression Ranking (DRMRR). Different from existing methods, the scoring function of DRMRR was designed as a multivariate mapping from a feature vector to a vector of deviation scores, which captures local context information and cross-document interactions. In this way, we are able to incorporate the LTR metrics into our model. DRMRR uses a Wasserstein DRO framework to minimize a multi-output loss function under the most adverse distributions in the neighborhood of the empirical data distribution defined by a Wasserstein ball. We present a compact and computationally solvable reformulation of the min-max formulation of DRMRR. Our experiments were conducted on two real-world applications: medical document retrieval and drug response prediction, showing that DRMRR notably outperforms state-of-the-art LTR models. We also conducted an extensive analysis to examine the resilience of DRMRR against various types of noise: Gaussian noise, adversarial perturbations, and label poisoning. Accordingly, DRMRR is not only able to achieve significantly better performance than other baselines, but it can maintain a relatively stable performance as more noise is added to the data.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Learning-to-Rank with Nested Feedback
    Sagtani, Hitesh
    Jeunen, Olivier
    Ustimenko, Aleksei
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT III, 2024, 14610 : 306 - 315
  • [42] Data-driven affinely adjustable distributionally robust framework for unit commitment based on Wasserstein metric
    Hou, Wenting
    Zhu, Rujie
    Wei, Hua
    Hiep TranHoang
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2019, 13 (06) : 890 - 895
  • [43] Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations
    Peyman Mohajerin Esfahani
    Daniel Kuhn
    Mathematical Programming, 2018, 171 : 115 - 166
  • [44] Adaptive dynamic programming and distributionally robust optimal control of linear stochastic system using the Wasserstein metric
    Liang, Qingpeng
    Hu, Jiangping
    Shi, Kaibo
    Wu, Yanzhi
    Xiang, Linying
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2024, 38 (08) : 2810 - 2832
  • [45] Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations
    Esfahani, Peyman Mohajerin
    Kuhn, Daniel
    MATHEMATICAL PROGRAMMING, 2018, 171 (1-2) : 115 - 166
  • [46] Distributionally robust optimal dispatching method of integrated electricity and heating system based on improved Wasserstein metric
    Li, Hongwei
    Liu, Hongpeng
    Ma, Jianwei
    Li, Danyang
    Zhang, Wei
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2023, 151
  • [47] Distributionally Robust Optimization Model for a Minimum Cost Consensus with Asymmetric Adjustment Costs Based on the Wasserstein Metric
    Wu, Ziqi
    Zhu, Kai
    Qu, Shaojian
    MATHEMATICS, 2022, 10 (22)
  • [48] On distributionally robust chance constrained programs with Wasserstein distance
    Weijun Xie
    Mathematical Programming, 2021, 186 : 115 - 155
  • [49] On Affine Policies for Wasserstein Distributionally Robust Unit Commitment
    Cho, Youngchae
    Yang, Insoon
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 7167 - 7173
  • [50] Wasserstein distributionally robust learning for predicting the cycle time of printed circuit board production
    Liu, Feng
    Lu, Yingjie
    Li, Debiao
    Chiong, Raymond
    COMPUTERS IN INDUSTRY, 2025, 164