DarkLight Networks for Action Recognition in the Dark

被引:11
作者
Chen, Rui [1 ]
Chen, Jiajun [1 ]
Liang, Zixi [1 ]
Gao, Huaien [1 ]
Lin, Shan [1 ]
机构
[1] Guangzhou Xi Ma Informat Technol Co, 101 Waihuan Xi Rd, Guangzhou 510006, Guangdong, Peoples R China
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021 | 2021年
关键词
D O I
10.1109/CVPRW53098.2021.00094
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human action recognition in the dark is a significant task with various applications, e.g., night surveillance and self-driving at night. However, the lack of video datasets for human actions in the dark hinders its development. Recently, a public dataset ARID has been introduced to stimulate progress for the task of human action recognition in dark videos. Currently, there are multiple models that perform well for action recognition in videos shot under normal illumination. However, research shows that these methods may not be effective in recognizing actions in dark videos. In this paper, we construct a novel neural network architecture: DarkLight Networks, which involves (i) a dual-pathway structure where both dark videos and its brightened counterpart are utilized for effective video representation; and (ii) a self-attention mechanism, which fuses and extracts corresponding and complementary features from the two pathways. Our approach achieves state-of-the-art results on ARID.
引用
收藏
页码:846 / 852
页数:7
相关论文
共 36 条
[1]  
[Anonymous], 2015, BIORESOUR BIOPROCESS, DOI DOI 10.1371/J0URNAL.P0NE.0142446
[2]  
Bertasius Gedas, 2021, arXiv
[3]   Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].
Carreira, Joao ;
Zisserman, Andrew .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733
[4]   Seeing Motion in the Dark [J].
Chen, Chen ;
Chen, Qifeng ;
Do, Minh N. ;
Koltun, Vladlen .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3184-3193
[5]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[6]  
Dosovitskiy A, 2021, ICLR 2021 9 INT C LE
[7]   Learning Spatiotemporal Features with 3D Convolutional Networks [J].
Du Tran ;
Bourdev, Lubomir ;
Fergus, Rob ;
Torresani, Lorenzo ;
Paluri, Manohar .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4489-4497
[8]   Large-scale weakly-supervised pre-training for video action recognition [J].
Ghadiyaram, Deepti ;
Du Tran ;
Mahajan, Dhruv .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :12038-12047
[9]   Video Action Transformer Network [J].
Girdhar, Rohit ;
Carreira, Joao ;
Doersch, Carl ;
Zisserman, Andrew .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :244-253
[10]  
Gowda S. N., 2020, ARXIV201210671