Learning Low-Dimensional Temporal Representations with Latent Alignments

被引：4

作者：

Su, Bing ^{[1
]}

Wu, Ying ^{[2
]}

机构：

[1] Chinese Acad Sci, Inst Software, Sci & Technol Integrated Informat Syst Lab, Beijing 100190, Peoples R China

[2] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2020年 / 42卷 / 11期

基金：

中国国家自然科学基金; 美国国家科学基金会;

关键词：

Feature extraction; Hidden Markov models; Training; Motion segmentation; Dimensionality reduction; Three-dimensional displays; Data models; latent alignment; temporal sequences; discriminant analysis; ACTION RECOGNITION; DISCRIMINANT-ANALYSIS; REDUCTION; MODELS; SEGMENTATION; POSE;

D O I：

10.1109/TPAMI.2019.2919303

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Low-dimensional discriminative representations enhance machine learning methods in both performance and complexity. This has motivated supervised dimensionality reduction (DR), which transforms high-dimensional data into a discriminative subspace. Most DR methods require data to be i.i.d. However, in some domains, data naturally appear in sequences, where the observations are temporally correlated. We propose a DR method, namely, latent temporal linear discriminant analysis (LT-LDA), to learn low-dimensional temporal representations. We construct the separability among sequence classes by lifting the holistic temporal structures, which are established based on temporal alignments and may change in different subspaces. We jointly learn the subspace and the associated latent alignments by optimizing an objective that favors easily separable temporal structures. We show that this objective is connected to the inference of alignments and thus allows for an iterative solution. We provide both theoretical insight and empirical evaluations on several real-world sequence datasets to show the applicability of our method.

引用

页码：2842 / 2857

页数：16

共 84 条

[61] Hierarchical Dynamic Parsing and Encoding for Action Recognition
Su, Bing
Zhou, Jiahuan
Ding, Xiaoqing
Wang, Hao
Wu, Ying
[J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 202 - 217
[62] Su B, 2015, PROC CVPR IEEE, P4539, DOI 10.1109/CVPR.2015.7299084
[63] Linear Sequence Discriminant Analysis: A Model-Based Dimensionality Reduction Method for Vector Sequences
Su, Bing
Ding, Xiaoqing
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 889 - 896
[64] Sutskever I, 2014, ADV NEUR IN, V27
[65] Deep Canonical Time Warping for Simultaneous Alignment and Representation Learning of Sequences
Trigeorgis, George
Nicolaou, Mihalis A.
Schuller, Bjorn W.
Zafeiriou, Stefanos
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (05) : 1128 - 1138
[66] van der Maaten L, 2008, J MACH LEARN RES, V9, P2579
[67] Action Recognition with Improved Trajectories
Wang, Heng
Schmid, Cordelia
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3551 - 3558
[68] Gaussian process dynamical models for human motion
Wang, Jack M.
Fleet, David J.
Hertzmann, Aaron
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (02) : 283 - 298
[69] Learning Maximum Margin Temporal Warping for Action Recognition
Wang, Jiang
Wu, Ying
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2688 - 2695
[70] Mining Actionlet Ensemble for Action Recognition with Depth Cameras
Wang, Jiang
Liu, Zicheng
Wu, Ying
Yuan, Junsong
[J]. 2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 1290 - 1297

← 1 2 3 4 5 6 7 8 9 →