Deep Learning Markov Random Field for Semantic Segmentation

被引:116
作者
Liu, Ziwei [1 ]
Li, Xiaoxiao [1 ]
Luo, Ping [1 ]
Loy, Chen Change [1 ]
Tang, Xiaoou [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Informat Engn, Shatin, Hong Kong, Peoples R China
关键词
Semantic image/video segmentation; Markov random field; convolutional neural network; VIDEO; TRACKING;
D O I
10.1109/TPAMI.2017.2737535
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation tasks can be well modeled by Markov Random Field (MRF). This paper addresses semantic segmentation by incorporating high-order relations and mixture of label contexts into MRF. Unlike previous works that optimized MRFs using iterative algorithm, we solve MRF by proposing a Convolutional Neural Network (CNN), namely Deep Parsing Network (DPN), which enables deterministic end-to-end computation in a single forward pass. Specifically, DPN extends a contemporary CNN to model unary terms and additional layers are devised to approximate the mean field (MF) algorithm for pairwise terms. It has several appealing properties. First, different from the recent works that required many iterations of MF during back-propagation, DPN is able to achieve high performance by approximating one iteration of MF. Second, DPN represents various types of pairwise terms, making many existing models as its special cases. Furthermore, pairwise terms in DPN provide a unified framework to encode rich contextual information in high-dimensional data, such as images and videos. Third, DPN makes MF easier to be parallelized and speeded up, thus enabling efficient inference. DPN is thoroughly evaluated on standard semantic image/video segmentation benchmarks, where a single DPN model yields state-of-the-art segmentation accuracies on PASCAL VOC 2012, Cityscapes dataset and CamVid dataset.
引用
收藏
页码:1814 / 1828
页数:15
相关论文
共 61 条
[1]   Fast High-Dimensional Filtering Using the Permutohedral Lattice [J].
Adams, Andrew ;
Baek, Jongmin ;
Davis, Myers Abraham .
COMPUTER GRAPHICS FORUM, 2010, 29 (02) :753-762
[2]  
[Anonymous], P INT C MACH LEARN
[3]  
[Anonymous], IEEE T PATTERN ANAL
[4]  
[Anonymous], P IEEE INT C COMP VI
[5]  
[Anonymous], 2011, ADV NEURAL INF PROCE
[6]  
[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.348
[7]  
[Anonymous], P BRIT MACH VIS C
[8]  
[Anonymous], P INT C NEUR INF PRO
[9]  
[Anonymous], 2017, SEGNET DEEP CONVOLUT
[10]  
[Anonymous], 2015, PROC CVPR IEEE