Distributed Stochastic Optimization of Regularized Risk via Saddle-Point Problem

被引：2

作者：

Matsushima, Shin ^{[1
]}

Yun, Hyokun ^{[2
]}

Zhang, Xinhua ^{[3
]}

Vishwanathan, S. V. N. ^{[2
,4
]}

机构：

[1] Univ Tokyo, Tokyo, Japan

[2] Amazon Com, Seattle, WA 98170 USA

[3] Univ Illinois, Chicago, IL 60607 USA

[4] Univ Calif Santa Cruz, Santa Cruz, CA 95064 USA

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT I | 2017年 / 10534卷

关键词：

SUBGRADIENT METHODS;

D O I：

10.1007/978-3-319-71249-9_28

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many machine learning algorithms minimize a regularized risk, and stochastic optimization is widely used for this task. When working with massive data, it is desirable to perform stochastic optimization in parallel. Unfortunately, many existing stochastic optimization algorithms cannot be parallelized efficiently. In this paper we show that one can rewrite the regularized risk minimization problem as an equivalent saddle-point problem, and propose an efficient distributed stochastic optimization (DSO) algorithm. We prove the algorithm's rate of convergence; remarkably, our analysis shows that the algorithm scales almost linearly with the number of processors. We also verify with empirical evaluations that the proposed algorithm is competitive with other parallel, general purpose stochastic and batch optimization algorithms for regularized risk minimization.

引用

页码：460 / 476

页数：17

共 33 条

[11]

[Anonymous], NEURAL INFORM PROCES

[12]

[Anonymous], NIPS

[13]

Bertsekas Dimitri, 2015, Parallel and Distributed Computation: Numerical Methods

[14]

Bottou L, 2012, OPTIMIZATION FOR MACHINE LEARNING, P351

[15] Distributed optimization and statistical learning via the alternating direction method of multipliers [J].

Boyd S. ;

Parikh N. ;

Chu E. ;

Peleato B. ;

Eckstein J. .

Foundations and Trends in Machine Learning, 2010, 3 (01) :1-122

[16]

Boyd S, 2004, CONVEX OPTIMIZATION

[17]

Duchi J, 2011, J MACH LEARN RES, V12, P2121

[18]

Fan RE, 2008, J MACH LEARN RES, V9, P1871

[19]

Hastie T., 2009, ELEMENTS STAT LEARNI, V2, DOI [10.1007/978-0-387-84858-7, DOI 10.1007/978-0-387-84858-7]

[20]

Hsieh C., 2015, ICML

← 1 2 3 4 →