PLUGIN ESTIMATION OF SMOOTH OPTIMAL TRANSPORT MAPS

被引:11
作者
Manole, Tudor [1 ]
Balakrishnan, Sivaraman [1 ]
Niles-Weed, Jonathan [2 ,3 ]
Wasserman, Larry [1 ]
机构
[1] Carnegie Mellon Univ, Dept Stat & Data Sci, Pittsburgh, PA 15213 USA
[2] NYU, Courant Inst Math Sci, New York, NY USA
[3] NYU, Ctr Data Sci, New York, NY USA
基金
美国国家科学基金会;
关键词
Optimal transport map; Wasserstein distance; Brenier potential; minimax estimation; density estimation; central limit theorem; semiparametric efficiency; CENTRAL LIMIT-THEOREMS; MONGE-AMPERE EQUATION; WASSERSTEIN DISTANCE; PROBABILITY DENSITY; BOUNDARY-REGULARITY; EMPIRICAL MEASURES; MINIMAX ESTIMATION; CONVERGENCE; RATES; TIME;
D O I
10.1214/24-AOS2379
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We analyze a number of natural estimators for the optimal transport map between two distributions and show that they are minimax optimal. We adopt the plugin approach: our estimators are simply optimal couplings between measures derived from our observations, appropriately extended so that they define functions on Rd. d . When the underlying map is assumed to be Lipschitz, we show that computing the optimal coupling between the empirical measures, and extending it using linear smoothers, already gives a minimax optimal estimator. When the underlying map enjoys higher regularity, we show that the optimal coupling between appropriate nonparametric density estimates yields faster rates. Our work also provides new bounds on the risk of corresponding plugin estimators for the quadratic Wasserstein distance, and we show how this problem relates to that of estimating optimal transport maps using stability arguments for smooth and strongly convex Brenier potentials. As an application of our results, we derive central limit theorems for plugin estimators of the squared Wasserstein distance, which are centered at their population counterpart when the underlying distributions have sufficiently smooth densities. In contrast to known central limit theorems for empirical estimators, this result easily lends itself to statistical inference for the quadratic Wasserstein distance.
引用
收藏
页码:966 / 998
页数:33
相关论文
共 121 条
[1]   ON OPTIMAL MATCHINGS [J].
AJTAI, M ;
KOMLOS, J ;
TUSNADY, G .
COMBINATORICA, 1984, 4 (04) :259-264
[2]   Existence of Eulerian Solutions to the Semigeostrophic Equations in Physical Space: The 2-Dimensional Periodic Case [J].
Ambrosio, Luigi ;
Colombo, Maria ;
De Philippis, Guido ;
Figalli, Alessio .
COMMUNICATIONS IN PARTIAL DIFFERENTIAL EQUATIONS, 2012, 37 (12) :2209-2227
[3]  
[Anonymous], 1999, Non-Parametric Curve Estimation: Methods, Theory and Applications
[4]  
[Anonymous], 1998, PROB APPL S
[5]  
Benamou JD, 2000, NUMER MATH, V84, P375, DOI 10.1007/s002119900117
[6]   FlipTest: Fairness Testing via Optimal Transport [J].
Black, Emily ;
Yeom, Samuel ;
Fredrikson, Matt .
FAT* '20: PROCEEDINGS OF THE 2020 CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, 2020, :111-121
[7]   One-Dimensional Empirical Measures, Order Statistics, and Kantorovich Transport Distances [J].
Bobkov, Sergey ;
Ledoux, Michel .
MEMOIRS OF THE AMERICAN MATHEMATICAL SOCIETY, 2019, 261 (1259) :1-+
[8]   On the mean speed of convergence of empirical and occupation measures in Wasserstein distance [J].
Boissard, Emmanuel ;
Le Gouic, Thibaut .
ANNALES DE L INSTITUT HENRI POINCARE-PROBABILITES ET STATISTIQUES, 2014, 50 (02) :539-563
[9]   FROM KNOTHE'S REARRANGEMENT TO BRENIER'S OPTIMAL TRANSPORT MAP [J].
Bonnotte, Nicolas .
SIAM JOURNAL ON MATHEMATICAL ANALYSIS, 2013, 45 (01) :64-87