Linear and Deep Order-Preserving Wasserstein Discriminant Analysis

被引:3
作者
Su, Bing [1 ]
Zhou, Jiahuan [2 ]
Wen, Ji-Rong [1 ]
Wu, Ying [2 ]
机构
[1] Renmin Univ China, Beijing Key Lab Big Data Management & Anal Method, Gaoling Sch Artificial Intelligence, Beijing 100872, Peoples R China
[2] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Hidden Markov models; Feature extraction; Dimensionality reduction; Three-dimensional displays; Joints; Training; Distortion measurement; Optimal transport; order-preserving Wasserstein distance; barycenter; dimensionality reduction; sequence classification; ACTION RECOGNITION; LDA;
D O I
10.1109/TPAMI.2021.3050750
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Supervised dimensionality reduction for sequence data learns a transformation that maps the observations in sequences onto a low-dimensional subspace by maximizing the separability of sequences in different classes. It is typically more challenging than conventional dimensionality reduction for static data, because measuring the separability of sequences involves non-linear procedures to manipulate the temporal structures. In this paper, we propose a linear method, called order-preserving Wasserstein discriminant analysis (OWDA), and its deep extension, namely DeepOWDA, to learn linear and non-linear discriminative subspace for sequence data, respectively. We construct novel separability measures between sequence classes based on the order-preserving Wasserstein (OPW) distance to capture the essential differences among their temporal structures. Specifically, for each class, we extract the OPW barycenter and construct the intra-class scatter as the dispersion of the training sequences around the barycenter. The inter-class distance is measured as the OPW distance between the corresponding barycenters. We learn the linear and non-linear transformations by maximizing the inter-class distance and minimizing the intra-class scatter. In this way, the proposed OWDA and DeepOWDA are able to concentrate on the distinctive differences among classes by lifting the geometric relations with temporal constraints. Experiments on four 3D action recognition datasets show the effectiveness of OWDA and DeepOWDA.
引用
收藏
页码:3123 / 3138
页数:16
相关论文
共 86 条
  • [11] Generalized Rank Pooling for Activity Recognition
    Cherian, Anoop
    Fernando, Basura
    Harandi, Mehrtash
    Gould, Stephen
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1581 - 1590
  • [12] Cho S, 2020, IEEE WINT CONF APPL, P624, DOI [10.1109/WACV45572.2020.9093639, 10.1109/wacv45572.2020.9093639]
  • [13] Cuturi M, 2017, PR MACH LEARN RES, V70
  • [14] Cuturi M, 2014, PR MACH LEARN RES, V32, P685
  • [15] Deng YX, 2017, IEEE DEVICE RES CONF
  • [16] Dorfer M, 2015, PROC INT C LEARN REP, P1, DOI [10.48550/arXiv.1511.04707, DOI 10.48550/ARXIV.1511.04707]
  • [17] Escalera S, 2013, ICMI'13: PROCEEDINGS OF THE 2013 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, P365
  • [18] Multi-modal Gesture Recognition Challenge 2013: Dataset and Results
    Escalera, Sergio
    Gonzalez, Jordi
    Baro, Xavier
    Reyes, Miguel
    Lopes, Oscar
    Guyon, Isabelle
    Athitsos, Vassilis
    Escalante, Hugo J.
    [J]. ICMI'13: PROCEEDINGS OF THE 2013 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2013, : 445 - 452
  • [19] Rank Pooling for Action Recognition
    Fernando, Basura
    Gavves, Efstratios
    Oramas, Jose M.
    Ghodrati, Amir
    Tuytelaars, Tinne
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) : 773 - 787
  • [20] Fernando B, 2015, PROC CVPR IEEE, P5378, DOI 10.1109/CVPR.2015.7299176