End-to-end video background subtraction with 3d convolutional neural networks

被引：0

作者：

Dimitrios Sakkos

Heng Liu

Jungong Han

Ling Shao

机构：

[1] Northumbria University,Department of Computer and Information Sciences

[2] Anhui University of Technology,School of Computer Science and of Technology

[3] Lancaster University,School of Computing and Communications

[4] University of East Anglia,School of Computer Sciences

来源：

Multimedia Tools and Applications | 2018年 / 77卷

关键词：

Computer vision; Deep learning; Fully convolutional networks; Background subtraction; Video segmentation; 3D convolutional networks;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Background subtraction in videos is a highly challenging task by definition, as it lays on a pixel-wise classification level. Therefore, great attention to detail is essential. In this paper, we follow the success of Deep Learning in Computer Vision and present an end-to-end system for background subtraction in videos. Our model is able to track temporal changes in a video sequence by applying 3D convolutions to the most recent frames of the video. Thus, no background model is needed to be retained and updated. In addition, it can handle multiple scenes without further fine-tuning on each scene individually. We evaluate our system on the largest dataset for change detection, CDnet, with over 50 videos which span across 11 categories. Further evaluation is performed in the ESI dataset which features extreme and sudden illumination changes. Our model surpasses the state-of-the-art on both datasets according to the average ranking of the models over a wide range of metrics.

引用

页码：23023 / 23041

页数：18

共 50 条

[1] End-to-end video background subtraction with 3d convolutional neural networks
Sakkos, Dimitrios
Liu, Heng
Han, Jungong
Shao, Ling
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (17) : 23023 - 23041
[2] End-to-end Prediction of Driver Intention using 3D Convolutional Neural Networks
Gebert, Patrick
Roitberg, Alina
Haurilet, Monica
Stiefelhagen, Rainer
2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19), 2019, : 969 - 974
[3] Background Subtraction via 3D Convolutional Neural Networks
Gao, Yongqiang
Cai, Huayue
Zhang, Xiang
Lan, Long
Luo, Zhigang
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1271 - 1276
[4] End-to-end 3D face reconstruction with deep neural networks
Dou, Pengfei
Shah, Shishir K.
Kakadiaris, Ioannis A.
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1503 - 1512
[5] An end-to-end 3D convolutional neural network for decoding attentive mental state
Zhang, Yangsong
Cai, Huan
Nie, Li
Xu, Peng
Zhao, Sirui
Guan, Cuntai
NEURAL NETWORKS, 2021, 144 : 129 - 137
[6] End-to-End Text Recognition with Convolutional Neural Networks
Wang, Tao
Wu, David J.
Coates, Adam
Ng, Andrew Y.
2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3304 - 3308
[7] An end-to-end convolutional neural network for automated failure localisation and characterisation of 3D interconnects
Paulachan, Priya
Siegert, Jorg
Wiesler, Ingo
Brunner, Roland
SCIENTIFIC REPORTS, 2023, 13 (01)
[8] END-TO-END LEARNING OF DEEP CONVOLUTIONAL NEURAL NETWORK FOR 3D HUMAN ACTION RECOGNITION
Li, Chao
Sun, Shouqian
Min, Xin
Lin, Wenqian
Nie, Binling
Zhang, Xianfu
2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
[9] An end-to-end convolutional neural network for automated failure localisation and characterisation of 3D interconnects
Priya Paulachan
Jörg Siegert
Ingo Wiesler
Roland Brunner
Scientific Reports, 13
[10] 3DVSD: An end-to-end 3D convolutional object detection network for video smoke detection
Huo, Yinuo
Zhang, Qixing
Zhang, Yongming
Zhu, Jiping
Wang, Jinjun
FIRE SAFETY JOURNAL, 2022, 134

← 1 2 3 4 5 →