Motion U-Net: Multi-cue Encoder-Decoder Network for Motion Segmentation

被引:37
作者
Rahmon, Gani [1 ]
Bunyak, Filiz [1 ]
Seetharaman, Guna [2 ]
Palaniappan, Kannappan [1 ]
机构
[1] Univ Missouri, Dept Elect Engn & Comp Sci, Columbia, MO 65211 USA
[2] US Naval Res Lab, Washington, DC USA
来源
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2021年
关键词
moving object; change detection; background subtraction; flux tensor; convolutional neural networks; U-Net;
D O I
10.1109/ICPR48806.2021.9413211
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detection of moving objects is a critical component of many computer vision tasks. Recently, deep learning architectures have been developed for supervised learning based moving object change detection. Some top performing architectures, like FgSegNet arc single frame spatial appearance cue-based detection and tend to overfit to the training videos. We propose a novel compact multi-cue autoencoder deep architecture, Motion U-Net (MU-Net) for robust moving object detection that generalizes much better than FgSegNet and requires nearly 30 times fewer weight parameters. Motion and change cues are estimated using a multi-modal background subtraction module combined with flux tensor motion estimation. MU-Net was trained and evaluated on the CDnet-2014 change detection challenge video sequences and had an overall F-measure of 0.9369. We used the unseen SBI-2015 video dataset to assess generalization capacity where MU-Net had an F-measure of 0.7625 while FgSegNet_v2 was 0.3519, less than half the MU-Net accuracy. The source code of the Motion U-Net is available at hlips://github.com/CIVA-Lab/Motion-U-Net.
引用
收藏
页码:8125 / 8132
页数:8
相关论文
共 30 条
[1]  
[Anonymous], 2015, MEDICAL IMAGE COMPUT
[2]  
Antonio P., 2020, ARXIV200105566
[3]   A deep convolutional neural network for video sequence background subtraction [J].
Babaee, Mohammadreza ;
Duc Tung Dinh ;
Rigoll, Gerhard .
PATTERN RECOGNITION, 2018, 76 :635-649
[4]   ViBe: A Universal Background Subtraction Algorithm for Video Sequences [J].
Barnich, Olivier ;
Van Droogenbroeck, Marc .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (06) :1709-1724
[5]   Review and Evaluation of Commonly-Implemented Background Subtraction Algorithms [J].
Benezeth, Y. ;
Jodoin, P. M. ;
Emile, B. ;
Laurent, H. ;
Rosenberger, C. .
19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, :237-+
[6]   How Far Can You Get by Combining Change Detection Algorithms? [J].
Bianco, Simone ;
Ciocca, Gianluigi ;
Schettini, Raimondo .
IMAGE ANALYSIS AND PROCESSING,(ICIAP 2017), PT I, 2017, 10484 :96-107
[7]  
Brutzer S, 2011, PROC CVPR IEEE
[8]   Flux tensor constrained geodesic active contours with sensor fusion for persistent object tracking [J].
Bunyak, Filiz ;
Palaniappan, Kannappan ;
Nath, Sumit Kumar ;
Seetharaman, Gunasekaran .
Journal of Multimedia, 2007, 2 (04) :20-33
[9]  
CDnet, CDNET VIDEO DATABASE
[10]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848