Linear and Deep Order-Preserving Wasserstein Discriminant Analysis

被引：3

作者：

Su, Bing ^{[1
]}

Zhou, Jiahuan ^{[2
]}

Wen, Ji-Rong ^{[1
]}

Wu, Ying ^{[2
]}

机构：

[1] Renmin Univ China, Beijing Key Lab Big Data Management & Anal Method, Gaoling Sch Artificial Intelligence, Beijing 100872, Peoples R China

[2] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2022年 / 44卷 / 06期

基金：

中国国家自然科学基金; 美国国家科学基金会;

关键词：

Hidden Markov models; Feature extraction; Dimensionality reduction; Three-dimensional displays; Joints; Training; Distortion measurement; Optimal transport; order-preserving Wasserstein distance; barycenter; dimensionality reduction; sequence classification; ACTION RECOGNITION; LDA;

D O I：

10.1109/TPAMI.2021.3050750

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Supervised dimensionality reduction for sequence data learns a transformation that maps the observations in sequences onto a low-dimensional subspace by maximizing the separability of sequences in different classes. It is typically more challenging than conventional dimensionality reduction for static data, because measuring the separability of sequences involves non-linear procedures to manipulate the temporal structures. In this paper, we propose a linear method, called order-preserving Wasserstein discriminant analysis (OWDA), and its deep extension, namely DeepOWDA, to learn linear and non-linear discriminative subspace for sequence data, respectively. We construct novel separability measures between sequence classes based on the order-preserving Wasserstein (OPW) distance to capture the essential differences among their temporal structures. Specifically, for each class, we extract the OPW barycenter and construct the intra-class scatter as the dispersion of the training sequences around the barycenter. The inter-class distance is measured as the OPW distance between the corresponding barycenters. We learn the linear and non-linear transformations by maximizing the inter-class distance and minimizing the intra-class scatter. In this way, the proposed OWDA and DeepOWDA are able to concentrate on the distinctive differences among classes by lifting the geometric relations with temporal constraints. Experiments on four 3D action recognition datasets show the effectiveness of OWDA and DeepOWDA.

引用

页码：3123 / 3138

页数：16

共 86 条

[11] Generalized Rank Pooling for Activity Recognition
Cherian, Anoop
Fernando, Basura
Harandi, Mehrtash
Gould, Stephen
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1581 - 1590
[12] Cho S, 2020, IEEE WINT CONF APPL, P624, DOI [10.1109/WACV45572.2020.9093639, 10.1109/wacv45572.2020.9093639]
[13] Cuturi M, 2017, PR MACH LEARN RES, V70
[14] Cuturi M, 2014, PR MACH LEARN RES, V32, P685
[15] Deng YX, 2017, IEEE DEVICE RES CONF
[16] Dorfer M, 2015, PROC INT C LEARN REP, P1, DOI [10.48550/arXiv.1511.04707, DOI 10.48550/ARXIV.1511.04707]
[17] Escalera S, 2013, ICMI'13: PROCEEDINGS OF THE 2013 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, P365
[18] Multi-modal Gesture Recognition Challenge 2013: Dataset and Results
Escalera, Sergio
Gonzalez, Jordi
Baro, Xavier
Reyes, Miguel
Lopes, Oscar
Guyon, Isabelle
Athitsos, Vassilis
Escalante, Hugo J.
[J]. ICMI'13: PROCEEDINGS OF THE 2013 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2013, : 445 - 452
[19] Rank Pooling for Action Recognition
Fernando, Basura
Gavves, Efstratios
Oramas, Jose M.
Ghodrati, Amir
Tuytelaars, Tinne
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) : 773 - 787
[20] Fernando B, 2015, PROC CVPR IEEE, P5378, DOI 10.1109/CVPR.2015.7299176

← 1 2 3 4 5 6 7 8 9 →