End-to-end video background subtraction with 3d convolutional neural networks

被引:0
|
作者
Dimitrios Sakkos
Heng Liu
Jungong Han
Ling Shao
机构
[1] Northumbria University,Department of Computer and Information Sciences
[2] Anhui University of Technology,School of Computer Science and of Technology
[3] Lancaster University,School of Computing and Communications
[4] University of East Anglia,School of Computer Sciences
来源
Multimedia Tools and Applications | 2018年 / 77卷
关键词
Computer vision; Deep learning; Fully convolutional networks; Background subtraction; Video segmentation; 3D convolutional networks;
D O I
暂无
中图分类号
学科分类号
摘要
Background subtraction in videos is a highly challenging task by definition, as it lays on a pixel-wise classification level. Therefore, great attention to detail is essential. In this paper, we follow the success of Deep Learning in Computer Vision and present an end-to-end system for background subtraction in videos. Our model is able to track temporal changes in a video sequence by applying 3D convolutions to the most recent frames of the video. Thus, no background model is needed to be retained and updated. In addition, it can handle multiple scenes without further fine-tuning on each scene individually. We evaluate our system on the largest dataset for change detection, CDnet, with over 50 videos which span across 11 categories. Further evaluation is performed in the ESI dataset which features extreme and sudden illumination changes. Our model surpasses the state-of-the-art on both datasets according to the average ranking of the models over a wide range of metrics.
引用
收藏
页码:23023 / 23041
页数:18
相关论文
共 50 条
  • [1] End-to-end video background subtraction with 3d convolutional neural networks
    Sakkos, Dimitrios
    Liu, Heng
    Han, Jungong
    Shao, Ling
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (17) : 23023 - 23041
  • [2] End-to-end Prediction of Driver Intention using 3D Convolutional Neural Networks
    Gebert, Patrick
    Roitberg, Alina
    Haurilet, Monica
    Stiefelhagen, Rainer
    2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19), 2019, : 969 - 974
  • [3] Background Subtraction via 3D Convolutional Neural Networks
    Gao, Yongqiang
    Cai, Huayue
    Zhang, Xiang
    Lan, Long
    Luo, Zhigang
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1271 - 1276
  • [4] End-to-end 3D face reconstruction with deep neural networks
    Dou, Pengfei
    Shah, Shishir K.
    Kakadiaris, Ioannis A.
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1503 - 1512
  • [5] An end-to-end 3D convolutional neural network for decoding attentive mental state
    Zhang, Yangsong
    Cai, Huan
    Nie, Li
    Xu, Peng
    Zhao, Sirui
    Guan, Cuntai
    NEURAL NETWORKS, 2021, 144 : 129 - 137
  • [6] End-to-End Text Recognition with Convolutional Neural Networks
    Wang, Tao
    Wu, David J.
    Coates, Adam
    Ng, Andrew Y.
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3304 - 3308
  • [7] An end-to-end convolutional neural network for automated failure localisation and characterisation of 3D interconnects
    Paulachan, Priya
    Siegert, Jorg
    Wiesler, Ingo
    Brunner, Roland
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [8] END-TO-END LEARNING OF DEEP CONVOLUTIONAL NEURAL NETWORK FOR 3D HUMAN ACTION RECOGNITION
    Li, Chao
    Sun, Shouqian
    Min, Xin
    Lin, Wenqian
    Nie, Binling
    Zhang, Xianfu
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [9] An end-to-end convolutional neural network for automated failure localisation and characterisation of 3D interconnects
    Priya Paulachan
    Jörg Siegert
    Ingo Wiesler
    Roland Brunner
    Scientific Reports, 13
  • [10] 3DVSD: An end-to-end 3D convolutional object detection network for video smoke detection
    Huo, Yinuo
    Zhang, Qixing
    Zhang, Yongming
    Zhu, Jiping
    Wang, Jinjun
    FIRE SAFETY JOURNAL, 2022, 134