Soft video parsing by label distribution learning

被引:0
|
作者
Miaogen Ling
Xin Geng
机构
[1] Southeast University,Department of Computer Science and Engineering
来源
关键词
video parsing; label distribution learning; subactions; graduality;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we tackle the problem of segmenting out a sequence of actions from videos. The videos contain background and actions which are usually composed of ordered sub-actions. We refer the sub-actions and the background as semantic units. Considering the possible overlap between two adjacent semantic units, we propose a bidirectional sliding window method to generate the label distributions for various segments in the video. The label distribution covers a certain number of semantic unit labels, representing the degree to which each label describes the video segment. The mapping from a video segment to its label distribution is then learned by a Label Distribution Learning (LDL) algorithm. Based on the LDL model, a soft video parsing method with segmental regular grammars is proposed to construct a tree structure for the video. Each leaf of the tree stands for a video clip of background or sub-action. The proposed method shows promising results on the THUMOS’14, MSR-II and UCF101 datasets and its computational complexity is much less than the compared state-of-the-art video parsing method.
引用
收藏
页码:302 / 317
页数:15
相关论文
共 50 条
  • [1] Soft video parsing by label distribution learning
    Ling, Miaogen
    Geng, Xin
    FRONTIERS OF COMPUTER SCIENCE, 2019, 13 (02) : 302 - 317
  • [2] Soft Video Parsing by Label Distribution Learning
    Geng, Xin
    Ling, Miaogen
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1331 - 1337
  • [3] Label Distribution Learning for Video Summarization
    Liu Y.
    Tang S.
    Gao Y.
    Li Z.
    Li H.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (01): : 104 - 110
  • [4] Spatiotemporal Modeling and Label Distribution Learning for Video Summarization
    Chu, Wei-Ta
    Liu, Yu-Hsin
    2019 IEEE 21ST INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2019), 2019,
  • [5] Soft Facial Landmark Detection by Label Distribution Learning
    Su, Kai
    Geng, Xin
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5008 - 5015
  • [6] Crowd counting in public video surveillance by label distribution learning
    Zhang, Zhaoxiang
    Wang, Mo
    Geng, Xin
    NEUROCOMPUTING, 2015, 166 : 151 - 163
  • [7] Video Scene Parsing with Predictive Feature Learning
    Jin, Xiaojie
    Li, Xin
    Xiao, Huaxin
    Shen, Xiaohui
    Lin, Zhe
    Yang, Jimei
    Chen, Yunpeng
    Dong, Jian
    Liu, Luoqi
    Jie, Zequn
    Feng, Jiashi
    Yan, Shuicheng
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5581 - 5589
  • [8] Learning and parsing video events with goal and intent prediction
    Pei, Mingtao
    Si, Zhangzhang
    Yao, Benjamin Z.
    Zhu, Song-Chun
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (10) : 1369 - 1383
  • [9] Label Distribution Learning by Exploiting Label Distribution Manifold
    Wang, Jing
    Geng, Xin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (02) : 839 - 852
  • [10] Label Enhancement for Label Distribution Learning
    Xu, Ning
    Liu, Yun-Peng
    Geng, Xin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (04) : 1632 - 1643