Combining belief networks and neural networks for scene segmentation

被引:50
作者
Feng, XJ
Williams, CKI
Felderhof, SN
机构
[1] Natl Inst Biol Stand & Controls, Informat Lab, Potters Bar EN6 3QG, Herts, England
[2] Univ Edinburgh, Div Informat, Edinburgh EH1 2QL, Midlothian, Scotland
基金
英国工程与自然科学研究理事会;
关键词
tree-structured belief network (TSBN); hierarchical modeling; Markov random field (MRF); neural network; scaled-likelihood method; conditional maximum-liklihood training; Gaussian mixture model; expectation-maximization (EM);
D O I
10.1109/34.993555
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We are concerned with the problem of image segmentation, in which each pixel is assigned to one of a predefined finite number of labels. In Bayesian image analysis, this requires fusing together local predictions for the class labels with a prior model of label images. Following the work of [5], we consider the use of tree-structured belief networks (TSBNs) as prior models. The parameters in the TSBN are trained using a maximum-likelihood objective function with the EM algorithm and the resulting model is evaluated by calculating how efficiently it codes label images. A number of authors have used Gaussian mixture models to connect the label field to the image data. In this paper, we compare this approach to the scaled-likelihood method of [42], [31], where local predictions of pixel classification from neural networks are fused with the TSBN prior. Our results show a higher performance is obtained with the neural networks. We evaluate the classification results obtained and emphasize not only the maximum a posteriori segmentation, but also the uncertainty, as evidenced e.g., by the pixelwise posterior marginal entropies. We also investigate the use of conditional maximum-likelihood training for the TSBN and find that this gives rise to improved classification performance over the ML-trained TSBN.
引用
收藏
页码:467 / 483
页数:17
相关论文
共 51 条
[1]  
[Anonymous], KNOWLEDGE BASED INTE
[2]  
Ballard D.H., 1982, Computer Vision
[3]   MODELING AND ESTIMATION OF MULTIRESOLUTION STOCHASTIC-PROCESSES [J].
BASSEVILLE, M ;
BENVENISTE, A ;
CHOU, KC ;
GOLDEN, SA ;
NIKOUKHAH, R ;
WILLSKY, AS .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1992, 38 (02) :766-784
[4]  
BESAG J, 1986, J R STAT SOC B, V48, P259
[5]  
Bishop C. M., 1995, NEURAL NETWORKS PATT
[6]   A MULTISCALE RANDOM-FIELD MODEL FOR BAYESIAN IMAGE SEGMENTATION [J].
BOUMAN, CA ;
SHAPIRO, M .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1994, 3 (02) :162-177
[7]  
Cheng H, 1998, 1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, P610, DOI 10.1109/ICIP.1998.723575
[8]   Wavelet-based statistical signal processing using hidden Markov models [J].
Crouse, MS ;
Nowak, RD ;
Baraniuk, RG .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1998, 46 (04) :886-902
[9]   THE HELMHOLTZ MACHINE [J].
DAYAN, P ;
HINTON, GE ;
NEAL, RM ;
ZEMEL, RS .
NEURAL COMPUTATION, 1995, 7 (05) :889-904
[10]  
De Bonet JS, 1998, ADV NEUR IN, V10, P773