Learning Low-Dimensional Temporal Representations with Latent Alignments

被引:4
作者
Su, Bing [1 ]
Wu, Ying [2 ]
机构
[1] Chinese Acad Sci, Inst Software, Sci & Technol Integrated Informat Syst Lab, Beijing 100190, Peoples R China
[2] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Feature extraction; Hidden Markov models; Training; Motion segmentation; Dimensionality reduction; Three-dimensional displays; Data models; latent alignment; temporal sequences; discriminant analysis; ACTION RECOGNITION; DISCRIMINANT-ANALYSIS; REDUCTION; MODELS; SEGMENTATION; POSE;
D O I
10.1109/TPAMI.2019.2919303
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Low-dimensional discriminative representations enhance machine learning methods in both performance and complexity. This has motivated supervised dimensionality reduction (DR), which transforms high-dimensional data into a discriminative subspace. Most DR methods require data to be i.i.d. However, in some domains, data naturally appear in sequences, where the observations are temporally correlated. We propose a DR method, namely, latent temporal linear discriminant analysis (LT-LDA), to learn low-dimensional temporal representations. We construct the separability among sequence classes by lifting the holistic temporal structures, which are established based on temporal alignments and may change in different subspaces. We jointly learn the subspace and the associated latent alignments by optimizing an objective that favors easily separable temporal structures. We show that this objective is connected to the inference of alignments and thus allows for an iterative solution. We provide both theoretical insight and empirical evaluations on several real-world sequence datasets to show the applicability of our method.
引用
收藏
页码:2842 / 2857
页数:16
相关论文
共 84 条
  • [1] [Anonymous], 2007, P 24 INT C MACH LEAR
  • [2] [Anonymous], 2006, P 23 INT C MACH LEAR, DOI DOI 10.1145/1143844.1143875
  • [3] [Anonymous], 2007, Advances in neural information processing systems
  • [4] Baradel F., 2017, Pose-conditioned Spatio-Temporal Attention for Human Action Recognition
  • [5] Barbic J, 2004, PROC GRAPH INTERF, P185
  • [6] Coding Kendall's Shape Trajectories for 3D Action Recognition
    Ben Tanfous, Amor
    Drira, Hassen
    Ben Amor, Boulbaba
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2840 - 2849
  • [7] Max-Min Distance Analysis by Using Sequential SDP Relaxation for Dimension Reduction
    Bian, Wei
    Tao, Dacheng
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) : 1037 - 1050
  • [8] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
    Carreira, Joao
    Zisserman, Andrew
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
  • [9] Cavazza J, 2016, INT C PATT RECOG, P408, DOI 10.1109/ICPR.2016.7899668
  • [10] Generalized Rank Pooling for Activity Recognition
    Cherian, Anoop
    Fernando, Basura
    Harandi, Mehrtash
    Gould, Stephen
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1581 - 1590