Attentive Temporal Pyramid Network for Dynamic Scene Classification

被引：0

作者：

Huang, Yuanjun ^{[1
,2
,3
,4
]}

Cao, Xianbin ^{[1
,3
,4
]}

Zhen, Xiantong ^{[1
,3
,4
]}

Han, Jungong ^{[2
]}

机构：

[1] Beihang Univ, Sch Elect & Informat Engn, Beijing 100191, Peoples R China

[2] Univ Lancaster, Lancaster LA1 4YW, England

[3] Beihang Univ, Minist Ind & Informat Technol China, Key Lab Adv technol Near Space Informat Syst, Beijing, Peoples R China

[4] Beijing Adv Innovat Ctr Big Data Based Precis Med, Beijing, Peoples R China

来源：

THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2019年

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Dynamic scene classification is an important yet challenging problem especially with the presence of defected or irrelevant frames due to unconstrained imaging conditions such as illumination, camera motion and irrelevant background. In this paper, we propose the attentive temporal pyramid network (ATP-Net) to establish effective representations of dynamic scenes by extracting and aggregating the most informative and discriminative features. The proposed ATP-Net detects informative features of frames that contain the most relevant information to scenes by a temporal pyramid structure with the incorporated attention mechanism. These frame features are effectively fused by a newly designed kernel aggregation layer based on kernel approximation into a discriminative holistic representations of dynamic scenes. The proposed ATP-Net leverages the strength of attention mechanism to select the most relevant frame features and the ability of kernels to achieve optimal feature fusion for discriminative representations of dynamic scenes. Extensive experiments and comparisons are conducted on three benchmark datasets and the results show our superiority over the state-of-the-art methods on all these three benchmark datasets.

引用

页码：8497 / 8504

页数：8

共 50 条

[31] Self-attentive Pyramid Network for Single Image De-raining
Guo, Taian
Dai, Tao
Li, Jiawei
Xia, Shu-Tao
NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I, 2019, 11953 : 390 - 401
[32] Spatio-Temporal Attentive RNN for Node Classification in Temporal Attributed Graphs
Xu, Dongkuan
Cheng, Wei
Luo, Dongsheng
Liu, Xiao
Zhang, Xiang
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3947 - 3953
[33] Industrial Scene Text Detection With Refined Feature-Attentive Network
Guan, Tongkun
Gu, Chaochen
Lu, Changsheng
Tu, Jingzheng
Feng, Qi
Wu, Kaijie
Guan, Xinping
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6073 - 6085
[34] Attentive Gated Graph Neural Network for Image Scene Graph Generation
Li, Shuohao
Tang, Min
Zhang, Jun
Jiang, Lincheng
SYMMETRY-BASEL, 2020, 12 (04):
[35] Multi-Representation Dynamic Adaptation Network for Remote Sensing Scene Classification
Niu, Ben
Pan, Zongxu
Wu, Jixiang
Hu, Yuxin
Lei, Bin
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[36] Attentive Spatio-Temporal Representation Learning for Diving Classification
Kanojia, Gagan
Kumawat, Sudhakar
Raman, Shanmuganathan
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2467 - 2476
[37] Pyramid of Spatial Relatons for Scene-Level Land Use Classification
Chen, Shizhi
Tian, YingLi
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (04): : 1947 - 1957
[38] A new scene classification method based on spatial pyramid matching model
Marine Engineering College, Dalian Maritime University, Dalian, China
不详
J. Inf. Comput. Sci., 3 (1073-1080):
[39] Dynamic Temporal Pyramid Network: A Closer Look at Multi-scale Modeling for Activity Detection
Zhang, Da
Dai, Xiyang
Wang, Yuan-Fang
COMPUTER VISION - ACCV 2018, PT IV, 2019, 11364 : 712 - 728
[40] Temporal transformer networks for acoustic scene classification
Zhang, Teng
Zhang, Kailai
Wu, Ji
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1349 - 1353

← 1 2 3 4 5 →