Coordinate Descent with Bandit Sampling

被引:0
作者
Salehi, Farnood [1 ]
Thiran, Patrick [1 ]
Celis, L. Elisa [1 ]
机构
[1] Ecole Polytech Fed Lausanne, Sch Comp & Commun Sci, Lausanne, Switzerland
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018) | 2018年 / 31卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Coordinate descent methods usually minimize a cost function by updating a random decision variable (corresponding to one coordinate) at a time. Ideally, we would update the decision variable that yields the largest decrease in the cost function. However, finding this coordinate would require checking all of them, which would effectively negate the improvement in computational tractability that coordinate descent is intended to afford. To address this, we propose a new adaptive method for selecting a coordinate. First, we find a lower bound on the amount the cost function decreases when a coordinate is updated. We then use a multi-armed bandit algorithm to learn which coordinates result in the largest lower bound by interleaving this learning with conventional coordinate descent updates except that the coordinate is selected proportionately to the expected decrease. We show that our approach improves the convergence of coordinate descent methods both theoretically and experimentally.
引用
收藏
页数:11
相关论文
共 50 条
[21]   Discrete Coordinate Descent (DCD) [J].
Farsa, Davood Zaman ;
Rahnamayan, Shahryar .
2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, :184-190
[22]   On the complexity of parallel coordinate descent [J].
Tappenden, Rachael ;
Takac, Martin ;
Richtarik, Peter .
OPTIMIZATION METHODS & SOFTWARE, 2018, 33 (02) :372-395
[23]   On Matching Pursuit and Coordinate Descent [J].
Locatello, Francesco ;
Raj, Anant ;
Karimireddy, Sai Praneeth ;
Raetsch, Gunnar ;
Schoelkopf, Bernhard ;
Stich, Sebastian U. ;
Jaggi, Martin .
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[24]   A flexible coordinate descent method [J].
Fountoulakis, Kimon ;
Tappenden, Rachael .
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2018, 70 (02) :351-394
[25]   Efficient Sampling of Protein Loop Regions Using Conformational Hashing Complemented with Random Coordinate Descent [J].
del Alamo, Diego ;
Fischer, Axel W. ;
Moretti, Rocco ;
Alexander, Nathan S. ;
Mendenhall, Jeffrey ;
Hyman, Nicholas J. ;
Meiler, Jens .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2021, 17 (01) :560-570
[26]   Anderson acceleration of coordinate descent [J].
Bertrand, Quentin ;
Massias, Mathurin .
24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
[27]   Approximate Steepest Coordinate Descent [J].
Stich, Sebastian U. ;
Raj, Anant ;
Jaggi, Martin .
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
[28]   A flexible coordinate descent method [J].
Kimon Fountoulakis ;
Rachael Tappenden .
Computational Optimization and Applications, 2018, 70 :351-394
[29]   Bandit-NAS: Bandit Sampling Method for Neural Architecture Search [J].
Lin, Yiqi ;
Wang, Ru .
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[30]   Thompson Sampling for the Multinomial Logit Bandit [J].
Agrawal, Shipra ;
Avadhanula, Vashist ;
Goyal, Vineet ;
Zeevi, Assaf .
MATHEMATICS OF OPERATIONS RESEARCH, 2025,