DarkLight Networks for Action Recognition in the Dark

被引：11

作者：

Chen, Rui ^{[1
]}

Chen, Jiajun ^{[1
]}

Liang, Zixi ^{[1
]}

Gao, Huaien ^{[1
]}

Lin, Shan ^{[1
]}

机构：

[1] Guangzhou Xi Ma Informat Technol Co, 101 Waihuan Xi Rd, Guangzhou 510006, Guangdong, Peoples R China

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021 | 2021年

关键词：

D O I：

10.1109/CVPRW53098.2021.00094

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human action recognition in the dark is a significant task with various applications, e.g., night surveillance and self-driving at night. However, the lack of video datasets for human actions in the dark hinders its development. Recently, a public dataset ARID has been introduced to stimulate progress for the task of human action recognition in dark videos. Currently, there are multiple models that perform well for action recognition in videos shot under normal illumination. However, research shows that these methods may not be effective in recognizing actions in dark videos. In this paper, we construct a novel neural network architecture: DarkLight Networks, which involves (i) a dual-pathway structure where both dark videos and its brightened counterpart are utilized for effective video representation; and (ii) a self-attention mechanism, which fuses and extracts corresponding and complementary features from the two pathways. Our approach achieves state-of-the-art results on ARID.

引用

页码：846 / 852

页数：7

共 36 条

[11] LIME: Low-Light Image Enhancement via Illumination Map Estimation [J].

Guo, Xiaojie ;

Li, Yu ;

Ling, Haibin .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (02) :982-993

[12] Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? [J].

Hara, Kensho ;

Kataoka, Hirokatsu ;

Satoh, Yutaka .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6546-6555

[13] Learning Spatio-Temporal Features with 3D Residual Networks for Action Recognition [J].

Hara, Kensho ;

Kataoka, Hirokatsu ;

Satoh, Yutaka .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :3154-3160

[14] 3D Convolutional Neural Networks for Human Action Recognition [J].

Ji, Shuiwang ;

Xu, Wei ;

Yang, Ming ;

Yu, Kai .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (01) :221-231

[15] Learning to See Moving Objects in the Dark [J].

Jiang, Haiyang ;

Zheng, Yinqiang .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7323-7332

[16]

Kalfaoglu M. Esat, 2020, Proceedings of the 16th European Conference on Computer Vision (ECCV 2020) Workshops. Lecture Notes in Computer Science (LNCS 12539), P731, DOI 10.1007/978-3-030-68238-5_48

[17] SVAS: Surveillance Video Analysis System [J].

Kardas, Karani ;

Cicekli, Nihan Kesim .

EXPERT SYSTEMS WITH APPLICATIONS, 2017, 89 :343-361

[18] Identifying multiuser activity with overlapping acoustic data for mobile decision making in smart home environments [J].

Lee, Jonathan S. ;

Choi, Sukjae ;

Kwon, Ohbyung .

EXPERT SYSTEMS WITH APPLICATIONS, 2017, 81 :299-308

[19]

Li B, 2019, AAAI CONF ARTIF INTE, P8561

[20]

Li Junnan, 2018, Advances in Neural Information Processing Systems (Neurips)

← 1 2 3 4 →