Human action recognition with salient trajectories and multiple kernel learning

被引:0
|
作者
Yang Yi
Pan Hu
Xiaokang Deng
机构
[1] Sun Yat-sen University,School of Data & Computer Science
[2] Xinhua College of Sun Yat-sen University,undefined
[3] Guangdong Province Key Laboratory of Big Data Analysis and Processing,undefined
来源
Multimedia Tools and Applications | 2018年 / 77卷
关键词
Action recognition; Bag-of-visual-words; Trajectories filter; Clustering; Multiple kernel learning;
D O I
暂无
中图分类号
学科分类号
摘要
Action recognition in videos plays an important role in the field of computer vision and multimedia, and there exist lots of challenges due to the complexity of spatial and temporal information. Trajectory-based approach has shown to be efficient recently, and a new framework and algorithm of trajectory space information based multiple kernel learning (TSI-MKL) is exploited in this paper. First, dense trajectories are extracted as raw features, and three saliency maps are computed corresponding to color, space, and optical flow on frames at the same time. Secondly, a new method combining above saliency maps is proposed to filter the achieved trajectories, by which a set of salient trajectories only containing foreground motion regions is obtained. Afterwards, a novel two-layer clustering is developed to cluster the obtained trajectories into several semantic groups and the ultimate video representation is generated by encoding each group. Finally, representations of different semantic groups are fed into the proposed kernel function of a multiple kernel classifier. Experiments are conducted on three popular video action datasets and the results demonstrate that our presented approach performs competitively compared with the state-of-the-art.
引用
收藏
页码:17709 / 17730
页数:21
相关论文
共 50 条
  • [1] Human action recognition with salient trajectories and multiple kernel learning
    Yi, Yang
    Hu, Pan
    Deng, Xiaokang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (14) : 17709 - 17730
  • [2] Localized Multiple Kernel Learning for Realistic Human Action Recognition in Videos
    Song, Yan
    Zheng, Yan-Tao
    Tang, Sheng
    Zhou, Xiangdong
    Zhang, Yongdong
    Lin, Shouxun
    Chua, Tat-Seng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (09) : 1193 - 1202
  • [3] Multilayer deep features with multiple kernel learning for action recognition
    Sheng, Biyun
    Li, Jun
    Xiao, Fu
    Yang, Wankou
    NEUROCOMPUTING, 2020, 399 : 65 - 74
  • [4] Realistic action recognition with salient foreground trajectories
    Yi, Yang
    Zheng, Zhenxian
    Lin, Maoqing
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 75 : 44 - 55
  • [5] Human action recognition using modified slow feature analysis and multiple kernel learning
    Yongliang Xiao
    Limin Xia
    Multimedia Tools and Applications, 2016, 75 : 13041 - 13056
  • [6] Human action recognition using modified slow feature analysis and multiple kernel learning
    Xiao, Yongliang
    Xia, Limin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (21) : 13041 - 13056
  • [7] Hierarchical Bayesian Multiple Kernel Learning Based Feature Fusion for Action Recognition
    Sun, Wen
    Yuan, Chunfeng
    Wang, Pei
    Yang, Shuang
    Hu, Weiming
    Cai, Zhaoquan
    MULTIMODAL PATTERN RECOGNITION OF SOCIAL SIGNALS IN HUMAN-COMPUTER-INTERACTION, MPRSS 2016, 2017, 10183 : 85 - 97
  • [8] Multiple Kernel Learning and Optical Flow for Action Recognition in RGB-D Video
    Viet, Vo Hoai
    Ngoc, Ly Quoc
    Son, Tran Thai
    Hoang, Pham Minh
    2015 SEVENTH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2015, : 222 - 227
  • [9] Aggregating the temporal coherent descriptors in videos using multiple learning kernel for action recognition
    Saleh, Adel
    Abdel-Nasser, Mohamed
    Angel Garcia, Miguel
    Puig, Domenec
    PATTERN RECOGNITION LETTERS, 2018, 105 : 4 - 12
  • [10] Tracking Salient Keypoints for Human Action Recognition
    Wang, Hanli
    Yi, Yun
    2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 3048 - 3053