Representing and Learning High Dimensional Data with the Optimal Transport Map from a Probabilistic Viewpoint

被引:9
|
作者
Park, Serim [1 ]
Thorpe, Matthew [2 ]
机构
[1] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA
[2] Univ Cambridge, Dept Appl Math & Theoret Phys, Cambridge CB3 0WA, England
关键词
DIFFEOMORPHISM; DISTRIBUTIONS;
D O I
10.1109/CVPR.2018.00820
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a generative model in the space of diffeomorphic deformation maps. More precisely, we utilize the Kantarovich-Wasserstein metric and accompanying geometry to represent an image as a deformation from templates. Moreover, we incorporate a probabilistic viewpoint by assuming that each image is locally generated from a reference image. We capture the local structure by modelling the tangent planes at reference images. Once basis vectors for each tangent plane are learned via probabilistic PCA, we can sample a local coordinate, that can be inverted back to image space exactly. With experiments using 4 different datasets, we show that the generative tangent plane model in the optimal transport (OT) manifold can be learned with small numbers of images and can be used to create infinitely many 'unseen' images. In addition, the Bayesian classification accompanied with the probabilist modeling of the tangent planes shows improved accuracy over that done in the image space. Combining the results of our experiments supports our claim that certain datasets can be better represented with the KantarovichWasserstein metric. We envision that the proposed method could be a practical solution to learning and representing data that is generated with templates in situatons where only limited numbers of data points are available.
引用
收藏
页码:7864 / 7872
页数:9
相关论文
共 50 条
  • [1] Uniform confidence band for optimal transport map on one-dimensional data
    Ponnoprat, Donlapark
    Okano, Ryo
    Imaizumi, Masaaki
    ELECTRONIC JOURNAL OF STATISTICS, 2024, 18 (01): : 515 - 552
  • [2] A Unified Probabilistic Model for Learning Latent Factors and Their Connectivities from High-Dimensional Data
    Monti, Ricardo Pio
    Hyvarinen, Aapo
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2018, : 300 - 309
  • [3] Feature-Robust Optimal Transport for High-Dimensional Data
    Petrovich, Mathis
    Liang, Chao
    Sato, Ryoma
    Liu, Yanbin
    Tsai, Yao-Hung Hubert
    Zhu, Linchao
    Yang, Yi
    Salakhutdinov, Ruslan
    Yamada, Makoto
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT V, 2023, 13717 : 291 - 307
  • [4] Probabilistic classifiers with high-dimensional data
    Kim, Kyung In
    Simon, Richard
    BIOSTATISTICS, 2011, 12 (03) : 399 - 412
  • [5] A SCALABLE DEEP LEARNING APPROACH FOR SOLVING HIGH-DIMENSIONAL DYNAMIC OPTIMAL TRANSPORT
    Wan, Wei
    Zhang, Yuejin
    Bao, Chenglong
    Dong, Bin
    Shi, Zuoqiang
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2023, 45 (04): : B544 - B563
  • [6] Optimal projections of high dimensional data
    Corchado, E
    Fyfe, C
    2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 589 - 596
  • [7] Inference and Verification of Probabilistic Graphical Models from High-Dimensional Data
    Ma, Yinjiao
    Damazyn, Kevin
    Klinger, Jakob
    Gong, Haijun
    DATA INTEGRATION IN THE LIFE SCIENCES, DILS 2015, 2015, 9162 : 223 - 239
  • [8] Inference With Aggregate Data in Probabilistic Graphical Models: An Optimal Transport Approach
    Singh, Rahul
    Haasler, Isabel
    Zhang, Qinsheng
    Karlsson, Johan
    Chen, Yongxin
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (09) : 4483 - 4497
  • [9] Probabilistic visualisation of high-dimensional binary data
    Tipping, ME
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 592 - 598
  • [10] Algorithms for similarity relation learning from high dimensional data
    Janusz, Andrzej
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8375 : 174 - 292