Combining belief networks and neural networks for scene segmentation

被引：50

作者：

Feng, XJ

Williams, CKI

Felderhof, SN

机构：

[1] Natl Inst Biol Stand & Controls, Informat Lab, Potters Bar EN6 3QG, Herts, England

[2] Univ Edinburgh, Div Informat, Edinburgh EH1 2QL, Midlothian, Scotland

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2002年 / 24卷 / 04期

基金：

英国工程与自然科学研究理事会;

关键词：

tree-structured belief network (TSBN); hierarchical modeling; Markov random field (MRF); neural network; scaled-likelihood method; conditional maximum-liklihood training; Gaussian mixture model; expectation-maximization (EM);

D O I：

10.1109/34.993555

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We are concerned with the problem of image segmentation, in which each pixel is assigned to one of a predefined finite number of labels. In Bayesian image analysis, this requires fusing together local predictions for the class labels with a prior model of label images. Following the work of [5], we consider the use of tree-structured belief networks (TSBNs) as prior models. The parameters in the TSBN are trained using a maximum-likelihood objective function with the EM algorithm and the resulting model is evaluated by calculating how efficiently it codes label images. A number of authors have used Gaussian mixture models to connect the label field to the image data. In this paper, we compare this approach to the scaled-likelihood method of [42], [31], where local predictions of pixel classification from neural networks are fused with the TSBN prior. Our results show a higher performance is obtained with the neural networks. We evaluate the classification results obtained and emphasize not only the maximum a posteriori segmentation, but also the uncertainty, as evidenced e.g., by the pixelwise posterior marginal entropies. We also investigate the use of conditional maximum-likelihood training for the TSBN and find that this gives rise to improved classification performance over the ML-trained TSBN.

引用

页码：467 / 483

页数：17

共 51 条

[1]

[Anonymous], KNOWLEDGE BASED INTE

[2]

Ballard D.H., 1982, Computer Vision

[3] MODELING AND ESTIMATION OF MULTIRESOLUTION STOCHASTIC-PROCESSES [J].

BASSEVILLE, M ;

BENVENISTE, A ;

CHOU, KC ;

GOLDEN, SA ;

NIKOUKHAH, R ;

WILLSKY, AS .

IEEE TRANSACTIONS ON INFORMATION THEORY, 1992, 38 (02) :766-784

[4]

BESAG J, 1986, J R STAT SOC B, V48, P259

[5]

Bishop C. M., 1995, NEURAL NETWORKS PATT

[6] A MULTISCALE RANDOM-FIELD MODEL FOR BAYESIAN IMAGE SEGMENTATION [J].

BOUMAN, CA ;

SHAPIRO, M .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 1994, 3 (02) :162-177

[7]

Cheng H, 1998, 1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, P610, DOI 10.1109/ICIP.1998.723575

[8] Wavelet-based statistical signal processing using hidden Markov models [J].

Crouse, MS ;

Nowak, RD ;

Baraniuk, RG .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1998, 46 (04) :886-902

[9] THE HELMHOLTZ MACHINE [J].

DAYAN, P ;

HINTON, GE ;

NEAL, RM ;

ZEMEL, RS .

NEURAL COMPUTATION, 1995, 7 (05) :889-904

[10]

De Bonet JS, 1998, ADV NEUR IN, V10, P773

← 1 2 3 4 5 6 →