Image annotation via graph learning

被引:132
作者
Liu, Jing [1 ]
Li, Mingjing [2 ]
Liu, Qingshan [1 ]
Lu, Hanqing [1 ]
Ma, Songde [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100080, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph learning; Image annotation; Image similarity; Word correlation;
D O I
10.1016/j.patcog.2008.04.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image annotation has been an active research topic in recent years due to its potential impact on both image understanding and web image search. In this paper, we propose a graph learning framework for image annotation. First, the image-based graph learning is performed to obtain the candidate annotations for each image. In order to capture the complex distribution of image data, we propose a Nearest Spanning Chain (NSC) method to construct the image-based graph, whose edge-weights are derived from the chain-wise statistical information instead of the traditional pairwise similarities. Second, the word-based graph learning is developed to refine the relationships between images and words to get final annotations for each image. To enrich the representation of the word-based graph, we design two types of word correlations based on web search results besides the word co-occurrence in the training set. The effectiveness of the proposed Solution is demonstrated from the experiments on the Corel dataset and a web image dataset. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:218 / 228
页数:11
相关论文
共 25 条
[1]  
[Anonymous], P ADV NEUTR INF PROC
[2]  
[Anonymous], 1997, P 10 RES COMP LING I
[3]  
[Anonymous], 2003, VIPS VISION BASED PA
[4]  
BLEI D, 2003, P 26 INT C RES DEV I
[5]   Supervised learning of semantic classes for image annotation and retrieval [J].
Carneiro, Gustavo ;
Chan, Antoni B. ;
Moreno, Pedro J. ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (03) :394-410
[6]   CBSA: Content-based soft annotation for multimodal image retrieval using Bayes point machines [J].
Chang, E ;
Goh, K ;
Sychay, G ;
Wu, G .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2003, 13 (01) :26-38
[7]  
CUSANO C, 2003, SPIE, V5304, P330
[8]  
Duygulu P, 2002, LECT NOTES COMPUT SC, V2353, P97
[9]  
Feng SL, 2004, PROC CVPR IEEE, P1002
[10]  
Hanghang Tong, 2005, 13th Annual ACM International Conference on Multimedia, P862, DOI 10.1145/1101149.1101337