Learning Low-Dimensional Temporal Representations with Latent Alignments

被引:4
作者
Su, Bing [1 ]
Wu, Ying [2 ]
机构
[1] Chinese Acad Sci, Inst Software, Sci & Technol Integrated Informat Syst Lab, Beijing 100190, Peoples R China
[2] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Feature extraction; Hidden Markov models; Training; Motion segmentation; Dimensionality reduction; Three-dimensional displays; Data models; latent alignment; temporal sequences; discriminant analysis; ACTION RECOGNITION; DISCRIMINANT-ANALYSIS; REDUCTION; MODELS; SEGMENTATION; POSE;
D O I
10.1109/TPAMI.2019.2919303
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Low-dimensional discriminative representations enhance machine learning methods in both performance and complexity. This has motivated supervised dimensionality reduction (DR), which transforms high-dimensional data into a discriminative subspace. Most DR methods require data to be i.i.d. However, in some domains, data naturally appear in sequences, where the observations are temporally correlated. We propose a DR method, namely, latent temporal linear discriminant analysis (LT-LDA), to learn low-dimensional temporal representations. We construct the separability among sequence classes by lifting the holistic temporal structures, which are established based on temporal alignments and may change in different subspaces. We jointly learn the subspace and the associated latent alignments by optimizing an objective that favors easily separable temporal structures. We show that this objective is connected to the inference of alignments and thus allows for an iterative solution. We provide both theoretical insight and empirical evaluations on several real-world sequence datasets to show the applicability of our method.
引用
收藏
页码:2842 / 2857
页数:16
相关论文
共 84 条
  • [61] Hierarchical Dynamic Parsing and Encoding for Action Recognition
    Su, Bing
    Zhou, Jiahuan
    Ding, Xiaoqing
    Wang, Hao
    Wu, Ying
    [J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 202 - 217
  • [62] Su B, 2015, PROC CVPR IEEE, P4539, DOI 10.1109/CVPR.2015.7299084
  • [63] Linear Sequence Discriminant Analysis: A Model-Based Dimensionality Reduction Method for Vector Sequences
    Su, Bing
    Ding, Xiaoqing
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 889 - 896
  • [64] Sutskever I, 2014, ADV NEUR IN, V27
  • [65] Deep Canonical Time Warping for Simultaneous Alignment and Representation Learning of Sequences
    Trigeorgis, George
    Nicolaou, Mihalis A.
    Schuller, Bjorn W.
    Zafeiriou, Stefanos
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (05) : 1128 - 1138
  • [66] van der Maaten L, 2008, J MACH LEARN RES, V9, P2579
  • [67] Action Recognition with Improved Trajectories
    Wang, Heng
    Schmid, Cordelia
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3551 - 3558
  • [68] Gaussian process dynamical models for human motion
    Wang, Jack M.
    Fleet, David J.
    Hertzmann, Aaron
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (02) : 283 - 298
  • [69] Learning Maximum Margin Temporal Warping for Action Recognition
    Wang, Jiang
    Wu, Ying
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2688 - 2695
  • [70] Mining Actionlet Ensemble for Action Recognition with Depth Cameras
    Wang, Jiang
    Liu, Zicheng
    Wu, Ying
    Yuan, Junsong
    [J]. 2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 1290 - 1297