Polynomial-time algorithms for multimarginal optimal transport problems with structure

被引:11
作者
Altschuler, Jason M. [1 ]
Boix-Adsera, Enric [1 ]
机构
[1] MIT, Lab Informat & Decis Syst LIDS, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
Multimarginal optimal transport; Polynomial-time algorithms; Implicit linear programming; Structured linear programs; MARGINAL OPTIMAL TRANSPORT; GENERALIZED SOLUTIONS; RANDOM-VARIABLES; DISTRIBUTIONS; COMPLEXITY; APPROXIMATION; BARYCENTERS; FLOWS; PERT;
D O I
10.1007/s10107-022-01868-7
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Multimarginal Optimal Transport (MOT) has attracted significant interest due to applications in machine learning, statistics, and the sciences. However, in most applications, the success of MOT is severely limited by a lack of efficient algorithms. Indeed, MOT in general requires exponential time in the number of marginals k and their support sizes n. This paper develops a general theory about what "structure" makes MOT solvable in poly(n, k) time. We develop a unified algorithmic framework for solving MOT in poly(n, k) time by characterizing the structure that different algorithms require in terms of simple variants of the dual feasibility oracle. This framework has several benefits. First, it enables us to show that the Sinkhorn algorithm, which is currently the most popular MOT algorithm, requires strictly more structure than other algorithms do to solve MOT in poly(n, k) time. Second, our framework makes it much simpler to develop poly(n, k) time algorithms for a given MOT problem. In particular, it is necessary and sufficient to (approximately) solve the dual feasibility oracle-which is much more amenable to standard algorithmic techniques. We illustrate this ease-ofuse by developing poly(n, k)-time algorithms for three general classes of MOT cost structures: (1) graphical structure; (2) set-optimization structure; and (3) low-rank plus sparse structure. For structure (1), we recover the known result that Sinkhorn has poly(n, k) runtime; moreover, we provide the first poly(n, k) time algorithms for computing solutions that are exact and sparse. For structures (2)-(3), we give the first poly(n, k) time algorithms, even for approximate computation. Together, these three structures encompass many-if not most-current applications of MOT.
引用
收藏
页码:1107 / 1178
页数:72
相关论文
共 94 条
  • [1] Price of Correlations in Stochastic Optimization
    Agrawal, Shipra
    Ding, Yichuan
    Saberi, Amin
    Ye, Yinyu
    [J]. OPERATIONS RESEARCH, 2012, 60 (01) : 150 - 162
  • [2] BARYCENTERS IN THE WASSERSTEIN SPACE
    Agueh, Martial
    Carlier, Guillaume
    [J]. SIAM JOURNAL ON MATHEMATICAL ANALYSIS, 2011, 43 (02) : 904 - 924
  • [3] Akagi Y., 2020, PREPRINT
  • [4] Altschuler J.M., 2022, SIAM J MATH DATA SCI
  • [5] Altschuler J, 2019, ADV NEUR IN, V32
  • [6] Altschuler J, 2017, ADV NEUR IN, V30
  • [7] Near-linear convergence of the Random Osborne algorithm for Matrix Balancing
    Altschuler, Jason M.
    Parrilo, Pablo A.
    [J]. MATHEMATICAL PROGRAMMING, 2023, 198 (01) : 363 - 397
  • [8] Hardness results for Multimarginal Optimal Transport problems
    Altschuler, Jason M.
    Boix-Adsera, Enric
    [J]. DISCRETE OPTIMIZATION, 2021, 42
  • [9] ALTSCHULER JM, 2021, J MACH LEARN RES, V22
  • [10] Discrete Wasserstein barycenters: optimal transport for discrete data
    Anderes, Ethan
    Borgwardt, Steffen
    Miller, Jacob
    [J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2016, 84 (02) : 389 - 409