LARGE-SCALE NONCONVEX STOCHASTIC OPTIMIZATION BY DOUBLY STOCHASTIC SUCCESSIVE CONVEX APPROXIMATION

被引:0
|
作者
Mokhtari, Aryan [1 ]
Koppel, Alec [1 ]
Scutari, Gesualdo [2 ]
Ribeiro, Alejandro [1 ]
机构
[1] Univ Penn, Dept Elect & Syst Engn, Philadelphia, PA 19104 USA
[2] Purdue Univ, Sch Ind Engn, W Lafayette, IN 47907 USA
基金
美国国家科学基金会;
关键词
Non-convex optimization; stochastic methods; large-scale optimization; parallel optimization; lasso; COORDINATE DESCENT METHOD; CONVERGENCE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We consider supervised learning problems over training sets in which both the number of training examples and the dimension of the feature vectors are large. We focus on the case where the loss function defining the quality of the parameter we wish to estimate may be non-convex, but also has a convex regularization. We propose a Doubly Stochastic Successive Convex approximation scheme (DSSC) able to handle non-convex regularized expected risk minimization. The method operates by decomposing the decision variable into blocks and operating on random subsets of blocks at each step. The algorithm belongs to the family of successive convex approximation methods since we replace the original non-convex stochastic objective by a strongly convex sample surrogate function, and solve the resulting convex program, for each randomly selected block in parallel. The method operates on subsets of features (block coordinate methods) and training examples (stochastic approximation) at each step. In contrast to many stochastic convex methods whose almost sure behavior is not guaranteed in non-convex settings, DSSC attains almost sure convergence to a stationary solution of the problem. Numerical experiments on a non-convex variant of a lasso regression problem show that DSSC performs favorably in this setting.
引用
收藏
页码:4701 / 4705
页数:5
相关论文
共 50 条
  • [1] High-Dimensional Nonconvex Stochastic Optimization by Doubly Stochastic Successive Convex Approximation
    Mokhtari, Aryan
    Koppel, Alec
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 : 6287 - 6302
  • [2] Distributed Stochastic Nonconvex Optimization and Learning based on Successive Convex Approximation
    Di Lorenzo, Paolo
    Scardapane, Simone
    CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 2224 - 2228
  • [3] PARALLEL STOCHASTIC SUCCESSIVE CONVEX APPROXIMATION METHOD FOR LARGE-SCALE DICTIONARY LEARNING
    Koppel, Alec
    Mokhtari, Aryan
    Ribeiro, Alejandro
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2771 - 2775
  • [4] Online Successive Convex Approximation for Two-Stage Stochastic Nonconvex Optimization
    Liu, An
    Lau, Vincent K. N.
    Zhao, Min-Jian
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (22) : 5941 - 5955
  • [5] Doubly Stochastic Algorithms for Large-Scale Optimization
    Koppel, Alec
    Mokhtari, Aryan
    Ribeiro, Alejandro
    2016 50TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2016, : 1705 - 1709
  • [6] Stochastic Successive Convex Approximation for General Stochastic Optimization Problems
    Ye, Chencheng
    Cui, Ying
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2020, 9 (06) : 755 - 759
  • [7] Stochastic Successive Convex Approximation for Non-Convex Constrained Stochastic Optimization
    Liu, An
    Lau, Vincent K. N.
    Kananian, Borna
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2019, 67 (16) : 4189 - 4203
  • [8] Distributed Computational Framework for Large-Scale Stochastic Convex Optimization
    Rostampour, Vahab
    Keviczky, Tamas
    ENERGIES, 2021, 14 (01)
  • [9] A Stochastic Quasi-Newton Method for Large-Scale Nonconvex Optimization With Applications
    Chen, Huiming
    Wu, Ho-Chun
    Chan, Shing-Chow
    Lam, Wong-Hing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (11) : 4776 - 4790
  • [10] Parallel Successive Convex Approximation for Nonsmooth Nonconvex Optimization
    Razaviyayn, Meisam
    Hong, Mingyi
    Luo, Zhi-Quan
    Pang, Jong-Shi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27