DR-TANet: Dynamic Receptive Temporal Attention Network for Street Scene Change Detection

被引:18
作者
Chen, Shuo [1 ]
Yang, Kailun [1 ]
Stiefelhagen, Rainer [1 ]
机构
[1] Karlsruhe Inst Technol, Inst Anthropomat & Robot, D-76131 Karlsruhe, Germany
来源
2021 32ND IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV) | 2021年
关键词
D O I
10.1109/IV48863.2021.9575362
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Street scene change detection continues to capture researchers' interests in the computer vision community. It aims to identify the changed regions of the paired street-view images captured at different times. The state-of-the-art network based on the encoder-decoder architecture leverages the feature maps at the corresponding level between two channels to gain sufficient information of changes. Still, the efficiency of feature extraction, feature correlation calculation, even the whole network requires further improvement. This paper proposes the temporal attention and explores the impact of the dependency-scope size of temporal attention on the performance of change detection. In addition, based on the Temporal Attention Module (TAM), we introduce a more efficient and light-weight version - Dynamic Receptive Temporal Attention Module (DRTAM) and propose the Concurrent Horizontal and Vertical Attention (CHVA) to improve the accuracy of the network on specific challenging entities. On street scene datasets `GSV', 'TSUNAMI' and `VL-CMU-CD', our approach gains excellent performance. establishing new state of the art scores without bells and whistles, while maintaining high efficiency applicable in autonomous vehicles.
引用
收藏
页码:502 / 509
页数:8
相关论文
共 24 条
[1]  
Alcantarilla P.F., 2016, Robotics: Science and Systems RSS
[2]  
[Anonymous], INT C LEARNING REPRE
[3]  
[Anonymous], 2014, INT C LEARN REPRESEN
[4]   Cars Can't Fly up in the Sky: Improving Urban-Scene Segmentation via Height-driven Attention Networks [J].
Choi, Sungha ;
Kim, Joanne T. ;
Choo, Jaegul .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9370-9380
[5]  
Daudt RC, 2018, IEEE IMAGE PROC, P4063, DOI 10.1109/ICIP.2018.8451652
[6]  
Guo E., 2018, ARXIV PREPRINT ARXIV
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]  
Ho Jonathan, 2019, Axial attention in multidimensional transformers
[9]   Local Relation Networks for Image Recognition [J].
Hu, Han ;
Zhang, Zheng ;
Xie, Zhenda ;
Lin, Stephen .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3463-3472
[10]   CCNet: Criss-Cross Attention for Semantic Segmentation [J].
Huang, Zilong ;
Wang, Xinggang ;
Huang, Lichao ;
Huang, Chang ;
Wei, Yunchao ;
Liu, Wenyu .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :603-612