Classification and Automatic Annotation Extension of Images Using Bayesian Network

被引:0
作者
Barrat, Sabine [1 ]
Tabbone, Salvatore [1 ]
机构
[1] Univ Nancy 2, LORIA, UMR 7503, F-54506 Vandoeuvre Les Nancy, France
来源
STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION | 2008年 / 5342卷
关键词
probabilistic graphical models; Bayesian networks; image classification; image annotation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many vision problems, instead of having fully annotated training data, it is easier to obtain just a subset, of data with annotations. because it, is less restrictive for the user. Per this reason, in this paper, we consider especially the problem of classifying weakly-annotated images, where just a, small subset of the database is annotated with keywords. In this paper we present and evaluate a new method which improves the effectiveness of content-based image classification, by integrating semantic concepts extracted from text, and by automatically extending annotations to the images with missing keywords. Our model is inspired from the probabilistic graphical model theory: we propose a hierarchical mixture model which enables to handle missing values. Results of visual-textual classification, reported on a database of images collected from the Web, partially and manually annotated, show an improvement by 32.3% in terms of recognition rate against only visual information classification. Besides the automatic annotation extension with our model For images with missing keywords outperforms the visual-textual classification by 6.8%. Finally the proposed method is experimentally competitive with the state-of-art classifiers.
引用
收藏
页码:937 / 946
页数:10
相关论文
共 15 条
[1]  
[Anonymous], IJCAI
[2]  
[Anonymous], CVPR
[3]   Matching words and pictures [J].
Barnard, K ;
Duygulu, P ;
Forsyth, D ;
de Freitas, N ;
Blei, DM ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1107-1135
[4]   Perceptual knowledge construction from annotated image collections [J].
Benitez, AB ;
Chang, SF .
IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, :189-192
[5]  
Blei D., 2003, P 26 ANN INT ACM SIG, P127, DOI DOI 10.1145/860435.860460
[6]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[7]  
Feng SL, 2004, PROC CVPR IEEE, P1002
[8]  
Gao Y., 2006, MULTI-MEDIA '06, P901
[9]  
GROSKY WI, 2001, LECT NOTES COMPUTER, V2234, P33
[10]   Combining visual features with semantics for a more effective image retrieval [J].
Kherfi, ML ;
Brahmi, D ;
Ziou, D .
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, :961-964