Web and Personal Image Annotation by Mining Label Correlation With Relaxed Visual Graph Embedding

被引:88
|
作者
Yang, Yi [1 ]
Wu, Fei [2 ]
Nie, Feiping [3 ]
Shen, Heng Tao [4 ]
Zhuang, Yueting [2 ]
Hauptmann, Alexander G. [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[2] Zhejiang Univ, Coll Comp Sci, Hangzhou 310027, Zhejiang, Peoples R China
[3] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
[4] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
基金
美国国家科学基金会;
关键词
Label correlation mining; multilabel learning; personal album labeling; semisupervised learning; web image annotation; FRAMEWORK; RETRIEVAL;
D O I
10.1109/TIP.2011.2169269
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The number of digital images rapidly increases, and it becomes an important challenge to organize these resources effectively. As a way to facilitate image categorization and retrieval, automatic image annotation has received much research attention. Considering that there are a great number of unlabeled images available, it is beneficial to develop an effective mechanism to leverage unlabeled images for large-scale image annotation. Meanwhile, a single image is usually associated with multiple labels, which are inherently correlated to each other. A straightforward method of image annotation is to decompose the problem into multiple independent single-label problems, but this ignores the underlying correlations among different labels. In this paper, we propose a new inductive algorithm for image annotation by integrating label correlation mining and visual similarity mining into a joint framework. We first construct a graph model according to image visual features. A multilabel classifier is then trained by simultaneously uncovering the shared structure common to different labels and the visual graph embedded label prediction matrix for image annotation. We show that the globally optimal solution of the proposed framework can be obtained by performing generalized eigen-decomposition. We apply the proposed framework to both web image annotation and personal album labeling using the NUS-WIDE, MSRA MM 2.0, and Kodak image data sets, and the AUC evaluation metric. Extensive experiments on large-scale image databases collected from the web and personal album show that the proposed algorithm is capable of utilizing both labeled and unlabeled data for image annotation and outperforms other algorithms.
引用
收藏
页码:1339 / 1351
页数:13
相关论文
共 50 条
  • [1] Automatic image annotation by visual topic discovering and web mining
    Liu, Zheng
    Ma, Jun
    Journal of Information and Computational Science, 2010, 7 (02): : 365 - 371
  • [2] Automatic image annotation by mining the Web
    Gong, Zhiguo
    Liu, Qian
    Zhang, Jingbai
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4081 : 449 - 458
  • [3] Adaptive Graph Guided Embedding for Multi-label Annotation
    Wang, Lichen
    Ding, Zhengming
    Fu, Yun
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2798 - 2804
  • [4] Mining web data for image semantic annotation
    Basili, Roberto
    Petitti, Riccardo
    Saracino, Dario
    AI(ASTERISK)IA 2007: ARTIFICIAL INTELLIGENCE AND HUMAN-ORIENTED COMPUTING, 2007, 4733 : 674 - +
  • [5] A Renewed Image Annotation Baseline by Image Embedding and Tag Correlation
    Liu, Rujie
    Wang, Yuchong
    Yu, Tao
    Naoi, Satoshi
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3216 - 3219
  • [6] Multimedia automatic annotation by mining label set correlation
    Feng Tian
    Xukun Shen
    Xianmei Liu
    Multimedia Tools and Applications, 2018, 77 : 3473 - 3491
  • [7] Multimedia automatic annotation by mining label set correlation
    Tian, Feng
    Shen, Xukun
    Liu, Xianmei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (03) : 3473 - 3491
  • [8] Label Embedding Based on Interclass Correlation Mining for Remote Sensing Image Scene Classification
    He, Jiang
    Zhang, Yao
    Gao, Peng
    Tian, Jinwen
    Tian, Tian
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [9] SLED: Semantic Label Embedding Dictionary Representation for Multilabel Image Annotation
    Cao, Xiaochun
    Zhang, Hua
    Guo, Xiaojie
    Liu, Si
    Meng, Dan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (09) : 2746 - 2759
  • [10] Mining multiple visual appearances of semantics for image annotation
    Tan, Hung-Khoon
    Ngo, Chong-Wah
    ADVANCES IN MULTIMEDIA MODELING, PT 1, 2007, 4351 : 269 - 278