Attentive Temporal Pyramid Network for Dynamic Scene Classification

被引:0
|
作者
Huang, Yuanjun [1 ,2 ,3 ,4 ]
Cao, Xianbin [1 ,3 ,4 ]
Zhen, Xiantong [1 ,3 ,4 ]
Han, Jungong [2 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing 100191, Peoples R China
[2] Univ Lancaster, Lancaster LA1 4YW, England
[3] Beihang Univ, Minist Ind & Informat Technol China, Key Lab Adv technol Near Space Informat Syst, Beijing, Peoples R China
[4] Beijing Adv Innovat Ctr Big Data Based Precis Med, Beijing, Peoples R China
来源
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2019年
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dynamic scene classification is an important yet challenging problem especially with the presence of defected or irrelevant frames due to unconstrained imaging conditions such as illumination, camera motion and irrelevant background. In this paper, we propose the attentive temporal pyramid network (ATP-Net) to establish effective representations of dynamic scenes by extracting and aggregating the most informative and discriminative features. The proposed ATP-Net detects informative features of frames that contain the most relevant information to scenes by a temporal pyramid structure with the incorporated attention mechanism. These frame features are effectively fused by a newly designed kernel aggregation layer based on kernel approximation into a discriminative holistic representations of dynamic scenes. The proposed ATP-Net leverages the strength of attention mechanism to select the most relevant frame features and the ability of kernels to achieve optimal feature fusion for discriminative representations of dynamic scenes. Extensive experiments and comparisons are conducted on three benchmark datasets and the results show our superiority over the state-of-the-art methods on all these three benchmark datasets.
引用
收藏
页码:8497 / 8504
页数:8
相关论文
共 50 条
  • [31] Self-attentive Pyramid Network for Single Image De-raining
    Guo, Taian
    Dai, Tao
    Li, Jiawei
    Xia, Shu-Tao
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I, 2019, 11953 : 390 - 401
  • [32] Spatio-Temporal Attentive RNN for Node Classification in Temporal Attributed Graphs
    Xu, Dongkuan
    Cheng, Wei
    Luo, Dongsheng
    Liu, Xiao
    Zhang, Xiang
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3947 - 3953
  • [33] Industrial Scene Text Detection With Refined Feature-Attentive Network
    Guan, Tongkun
    Gu, Chaochen
    Lu, Changsheng
    Tu, Jingzheng
    Feng, Qi
    Wu, Kaijie
    Guan, Xinping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6073 - 6085
  • [34] Attentive Gated Graph Neural Network for Image Scene Graph Generation
    Li, Shuohao
    Tang, Min
    Zhang, Jun
    Jiang, Lincheng
    SYMMETRY-BASEL, 2020, 12 (04):
  • [35] Multi-Representation Dynamic Adaptation Network for Remote Sensing Scene Classification
    Niu, Ben
    Pan, Zongxu
    Wu, Jixiang
    Hu, Yuxin
    Lei, Bin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [36] Attentive Spatio-Temporal Representation Learning for Diving Classification
    Kanojia, Gagan
    Kumawat, Sudhakar
    Raman, Shanmuganathan
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2467 - 2476
  • [37] Pyramid of Spatial Relatons for Scene-Level Land Use Classification
    Chen, Shizhi
    Tian, YingLi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (04): : 1947 - 1957
  • [38] A new scene classification method based on spatial pyramid matching model
    Marine Engineering College, Dalian Maritime University, Dalian, China
    不详
    J. Inf. Comput. Sci., 3 (1073-1080):
  • [39] Dynamic Temporal Pyramid Network: A Closer Look at Multi-scale Modeling for Activity Detection
    Zhang, Da
    Dai, Xiyang
    Wang, Yuan-Fang
    COMPUTER VISION - ACCV 2018, PT IV, 2019, 11364 : 712 - 728
  • [40] Temporal transformer networks for acoustic scene classification
    Zhang, Teng
    Zhang, Kailai
    Wu, Ji
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1349 - 1353