Building and using fuzzy multimedia ontologies for semantic image annotation

被引:24
作者
Bannour, Hichem [1 ]
Hudelot, Celine [1 ]
机构
[1] Ecole Cent Paris, MAS Lab, F-92295 Chatenay Malabry, France
关键词
Image annotation; Multimedia ontology; Ontology building; Ontological reasoning; Fuzzy DL; Spatial information; Contextual information; RETRIEVAL;
D O I
10.1007/s11042-013-1491-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a methodology for building fuzzy multimedia ontologies dedicated to image annotation. The built ontology incorporates visual, conceptual, contextual and spatial knowledge about image concepts in order to model image semantics in an effective way. Indeed, our approach uses visual and conceptual information to build a semantic hierarchy that will serve as a backbone of our ontology. Contextual and spatial information about image concepts are then computed and incorporated in the ontology in order to model richer semantic relationships between these concepts. Fuzzy description logics are used as a formalism to represent our ontology and the inherent uncertainty and imprecision of this kind of information. Subsequently, we propose a new approach for image annotation based on hierarchical image classification and a multi-stage reasoning framework for reasoning about the consistency of the produced annotation. In this approach, fuzzy ontological reasoning is used in order to achieve a semantically relevant decision on the belonging of a given image to the set of concepts from the annotation vocabulary. An empirical evaluation of our approach on Pascal VOC'2009 and Pascal VOC'2010 datasets has shown a significant improvement on the average precision results.
引用
收藏
页码:2107 / 2141
页数:35
相关论文
共 47 条
[1]  
[Anonymous], 2003, DESCRIPTION LOGIC HD
[2]  
[Anonymous], 2009, P CVPR
[3]  
[Anonymous], 1999, P 7 IEEE INT C COMPU
[4]  
[Anonymous], 2010, PASCAL VISUAL OBJECT
[5]  
Bannour H., 2012, P 21 ACM INT C INFOR, P2431
[6]  
Bannour H, 2011, CONT BAS MULT IND CB
[7]  
Bannour H, 2012, LECT NOTES COMPUT SC, V7131, P4
[8]   Matching words and pictures [J].
Barnard, K ;
Duygulu, P ;
Forsyth, D ;
de Freitas, N ;
Blei, DM ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1107-1135
[9]  
Bart E, 2008, COMPUTER VISION PATT, P1
[10]   Fuzzy spatial relationships for image processing and interpretation: a review [J].
Bloch, I .
IMAGE AND VISION COMPUTING, 2005, 23 (02) :89-110