Attentive Temporal Pyramid Network for Dynamic Scene Classification

被引:0
|
作者
Huang, Yuanjun [1 ,2 ,3 ,4 ]
Cao, Xianbin [1 ,3 ,4 ]
Zhen, Xiantong [1 ,3 ,4 ]
Han, Jungong [2 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing 100191, Peoples R China
[2] Univ Lancaster, Lancaster LA1 4YW, England
[3] Beihang Univ, Minist Ind & Informat Technol China, Key Lab Adv technol Near Space Informat Syst, Beijing, Peoples R China
[4] Beijing Adv Innovat Ctr Big Data Based Precis Med, Beijing, Peoples R China
来源
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2019年
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dynamic scene classification is an important yet challenging problem especially with the presence of defected or irrelevant frames due to unconstrained imaging conditions such as illumination, camera motion and irrelevant background. In this paper, we propose the attentive temporal pyramid network (ATP-Net) to establish effective representations of dynamic scenes by extracting and aggregating the most informative and discriminative features. The proposed ATP-Net detects informative features of frames that contain the most relevant information to scenes by a temporal pyramid structure with the incorporated attention mechanism. These frame features are effectively fused by a newly designed kernel aggregation layer based on kernel approximation into a discriminative holistic representations of dynamic scenes. The proposed ATP-Net leverages the strength of attention mechanism to select the most relevant frame features and the ability of kernels to achieve optimal feature fusion for discriminative representations of dynamic scenes. Extensive experiments and comparisons are conducted on three benchmark datasets and the results show our superiority over the state-of-the-art methods on all these three benchmark datasets.
引用
收藏
页码:8497 / 8504
页数:8
相关论文
共 50 条
  • [21] PSPNet-SLAM: A Semantic SLAM Detect Dynamic Object by Pyramid Scene Parsing Network
    Long, Xudong
    Zhang, Weiwei
    Zhao, Bo
    IEEE ACCESS, 2020, 8 : 214685 - 214695
  • [22] Hierarchical pyramid attentive network with spatial separable convolution for crowd counting
    Zhang, Shihui
    Zhang, Xiaoxiao
    Li, He
    He, Huan
    Song, Dandan
    Wang, Lei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 108
  • [23] Temporal Pyramid Network for Action Recognition
    Yang, Ceyuan
    Xu, Yinghao
    Shi, Jianping
    Dai, Bo
    Zhou, Bolei
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 588 - 597
  • [24] Temporal Pyramid Recurrent Neural Network
    Ma, Qianli
    Lin, Zhenxi
    Chen, Enhuan
    Cottrell, Garrison W.
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 5061 - 5068
  • [25] Scene Text Detection with Supervised Pyramid Context Network
    Xie, Enze
    Zang, Yuhang
    Shao, Shuai
    Yu, Gang
    Yao, Cong
    Li, Guangyao
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9038 - 9045
  • [26] ATTENTIVE MAX FEATURE MAP AND JOINT TRAINING FOR ACOUSTIC SCENE CLASSIFICATION
    Shim, Hye-jin
    Jung, Jee-weon
    Kim, Ju-ho
    Yu, Ha-Jin
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1036 - 1040
  • [27] A New Feature Pyramid Network For Road Scene Segmentation
    Zhan, Wujing
    Chen, Jiaxing
    Fan, Lei
    Ou, Xinqi
    Chen, Long
    2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 3487 - 3492
  • [28] VIDEO SCENE CLASSIFICATION USING SPATIAL PYRAMID BASED FEATURES
    Sert, Mustafa
    Ergun, Hilal
    2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 1946 - 1949
  • [29] DSTSPYN: a dynamic spatial-temporal similarity pyramid network for traffic flow prediction
    Wang, Xing
    Chen, Feifei
    Jin, Biao
    Lin, Mingwei
    Zou, Fumin
    Zeng, Ruihao
    APPLIED INTELLIGENCE, 2025, 55 (03)
  • [30] AGFP-Net: Attentive geometric feature pyramid network for land cover classification using airborne multispectral LiDAR data
    Li, Dilong
    Shen, Xin
    Guan, Haiyan
    Yu, Yongtao
    Wang, Hanyun
    Zhang, Guo
    Li, Jonathan
    Li, Deren
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 108