Central limit theorems for entropy-regularized optimal transport on finite spaces and statistical applications

被引:24
作者
Bigot, Jeremie [1 ]
Cazelles, Elsa [1 ]
Papadakis, Nicolas [2 ]
机构
[1] Univ Bordeaux, Inst Math Bordeaux, Talence, France
[2] CNRS, UMR 5251, Inst Math Bordeaux, Talence, France
关键词
Optimal transport; Sinkhorn divergence; central limit theorem; bootstrap; hypothesis testing; WASSERSTEIN; MODELS; TESTS;
D O I
10.1214/19-EJS1637
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The notion of entropy-regularized optimal transport, also known as Sinkhorn divergence, has recently gained popularity in machine learning and statistics, as it makes feasible the use of smoothed optimal transportation distances for data analysis. The Sinkhorn divergence allows the fast computation of an entropically regularized Wasserstein distance between two probability distributions supported on a finite metric space of (possibly) high-dimension. For data sampled from one or two unknown probability distributions, we derive the distributional limits of the empirical Sinkhorn divergence and its centered version (Sinkhorn loss). We also propose a bootstrap procedure which allows to obtain new test statistics for measuring the discrepancies between multivariate probability distributions. Our work is inspired by the results of Sommerfeld and Munk in [33] on the asymptotic distribution of empirical Wasserstein distance on finite space using unregularized transportation costs. Incidentally we also analyze the asymptotic distribution of entropy-regularized Wasserstein distances when the regularization parameter tends to zero. Simulated and real datasets are used to illustrate our approach.
引用
收藏
页码:5120 / 5150
页数:31
相关论文
共 41 条
[1]  
[Anonymous], 180408962 ARXIV
[2]  
[Anonymous], ARXIV170501299V1
[3]  
[Anonymous], ALL STAT CONCISE COU
[4]  
[Anonymous], 170600292 ARXIV
[5]  
[Anonymous], 2017, NIPS 17 WORKSH OPT T
[6]  
[Anonymous], ACM T GRAPHICS SIGGR
[7]  
[Anonymous], J ROYAL STAT SOC B
[8]  
[Anonymous], P INT C ART INT STAT
[9]  
[Anonymous], 2018, ARXIV181009880
[10]  
[Anonymous], SSRN