Learning Rates for Nonconvex Pairwise Learning

被引:2
|
作者
Li, Shaojie [1 ]
Liu, Yong [1 ]
机构
[1] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing 100872, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Convergence; Stability analysis; Measurement; Training; Statistics; Sociology; Optimization; Generalization performance; learning rates; nonconvex optimization; pairwise learning; EMPIRICAL RISK; ALGORITHMS; STABILITY; RANKING; MINIMIZATION;
D O I
10.1109/TPAMI.2023.3259324
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pairwise learning is receiving increasing attention since it covers many important machine learning tasks, e.g., metric learning, AUC maximization, and ranking. Investigating the generalization behavior of pairwise learning is thus of great significance. However, existing generalization analysis mainly focuses on the convex objective functions, leaving the nonconvex pairwise learning far less explored. Moreover, the current learning rates of pairwise learning are mostly of slower order. Motivated by these problems, we study the generalization performance of nonconvex pairwise learning and provide improved learning rates. Specifically, we develop different uniform convergence of gradients for pairwise learning under different assumptions, based on which we characterize empirical risk minimizer, gradient descent, and stochastic gradient descent. We first establish learning rates for these algorithms in a general nonconvex setting, where the analysis sheds insights on the trade-off between optimization and generalization and the role of early-stopping. We then derive faster learning rates of order O(1/n) for nonconvex pairwise learning with a gradient dominance curvature condition, where n is the sample size. Provided that the optimal population risk is small, we further improve the learning rates to O(1/n(2)), which, to the best of our knowledge, are the first O(1/n(2)) rates for pairwise learning.
引用
收藏
页码:9996 / 10011
页数:16
相关论文
共 50 条
  • [41] Label ranking by learning pairwise preferences
    Huellermeier, Eyke
    Fuernkranz, Johannes
    Cheng, Weiwei
    Brinker, Klaus
    ARTIFICIAL INTELLIGENCE, 2008, 172 (16-17) : 1897 - 1916
  • [42] Pairwise Learning for Imbalanced Data Classification
    Liu, Shu
    Wu, Qiang
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 186 - 189
  • [43] LLR: Learning learning rates by LSTM for training neural networks
    Yu, Changyong
    Qi, Xin
    Ma, Haitao
    He, Xin
    Wang, Cuirong
    Zhao, Yuhai
    NEUROCOMPUTING, 2020, 394 (394) : 41 - 50
  • [44] Simple Stochastic and Online Gradient Descent Algorithms for Pairwise Learning
    Yang, Zhenhuan
    Lei, Yunwen
    Wang, Puyu
    Yang, Tianbao
    Ying, Yiming
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [45] Graphical Convergence of Subgradients in Nonconvex Optimization and Learning
    Davis, Damek
    Drusvyatskiy, Dmitriy
    MATHEMATICS OF OPERATIONS RESEARCH, 2022, 47 (01) : 209 - 231
  • [46] A Semismooth Newton Algorithm for High-Dimensional Nonconvex Sparse Learning
    Shi, Yueyong
    Huang, Jian
    Jiao, Yuling
    Yang, Qinglong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (08) : 2993 - 3006
  • [47] Learning Proximal Operator Methods for Nonconvex Sparse Recovery with Theoretical Guarantee
    Yang, Chengzhu
    Gu, Yuantao
    Chen, Badong
    Ma, Hongbing
    So, Hing Cheung
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 (68) : 5244 - 5259
  • [48] Enhanced Acceleration for Generalized Nonconvex Low-Rank Matrix Learning
    Zhang, Hengmin
    Yang, Jian
    Du, Wenli
    Zhang, Bob
    Zha, Zhiyuan
    Wen, Bihan
    CHINESE JOURNAL OF ELECTRONICS, 2025, 34 (01) : 98 - 113
  • [49] Low-Rank Structure Learning via Nonconvex Heuristic Recovery
    Deng, Yue
    Dai, Qionghai
    Liu, Risheng
    Zhang, Zengke
    Hu, Sanqing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (03) : 383 - 396
  • [50] Learning Collaborative Sparsity Structure via Nonconvex Optimization for Feature Recognition
    Du, Zhaohui
    Chen, Xuefeng
    Zhang, Han
    Yan, Ruqiang
    Yin, Wotao
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (10) : 4417 - 4430