Learning Low-Dimensional Temporal Representations with Latent Alignments

被引：4

作者：

Su, Bing ^{[1
]}

Wu, Ying ^{[2
]}

机构：

[1] Chinese Acad Sci, Inst Software, Sci & Technol Integrated Informat Syst Lab, Beijing 100190, Peoples R China

[2] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2020年 / 42卷 / 11期

基金：

美国国家科学基金会; 中国国家自然科学基金;

关键词：

Feature extraction; Hidden Markov models; Training; Motion segmentation; Dimensionality reduction; Three-dimensional displays; Data models; latent alignment; temporal sequences; discriminant analysis; ACTION RECOGNITION; DISCRIMINANT-ANALYSIS; REDUCTION; MODELS; SEGMENTATION; POSE;

D O I：

10.1109/TPAMI.2019.2919303

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Low-dimensional discriminative representations enhance machine learning methods in both performance and complexity. This has motivated supervised dimensionality reduction (DR), which transforms high-dimensional data into a discriminative subspace. Most DR methods require data to be i.i.d. However, in some domains, data naturally appear in sequences, where the observations are temporally correlated. We propose a DR method, namely, latent temporal linear discriminant analysis (LT-LDA), to learn low-dimensional temporal representations. We construct the separability among sequence classes by lifting the holistic temporal structures, which are established based on temporal alignments and may change in different subspaces. We jointly learn the subspace and the associated latent alignments by optimizing an objective that favors easily separable temporal structures. We show that this objective is connected to the inference of alignments and thus allows for an iterative solution. We provide both theoretical insight and empirical evaluations on several real-world sequence datasets to show the applicability of our method.

引用

页码：2842 / 2857

页数：16

共 84 条

[1] [Anonymous], 2007, P 24 INT C MACH LEAR
[2] [Anonymous], 2006, P 23 INT C MACH LEAR, DOI DOI 10.1145/1143844.1143875
[3] [Anonymous], 2007, Advances in neural information processing systems
[4] Baradel F., 2017, Pose-conditioned Spatio-Temporal Attention for Human Action Recognition
[5] Barbic J, 2004, PROC GRAPH INTERF, P185
[6] Coding Kendall's Shape Trajectories for 3D Action Recognition
Ben Tanfous, Amor
Drira, Hassen
Ben Amor, Boulbaba
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2840 - 2849
[7] Max-Min Distance Analysis by Using Sequential SDP Relaxation for Dimension Reduction
Bian, Wei
Tao, Dacheng
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) : 1037 - 1050
[8] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
Carreira, Joao
Zisserman, Andrew
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
[9] Cavazza J, 2016, INT C PATT RECOG, P408, DOI 10.1109/ICPR.2016.7899668
[10] Generalized Rank Pooling for Activity Recognition
Cherian, Anoop
Fernando, Basura
Harandi, Mehrtash
Gould, Stephen
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1581 - 1590

← 1 2 3 4 5 6 7 8 9 →