A two-stage hybrid probabilistic topic model for refining image annotation

被引:0
作者
Dongping Tian
Zhongzhi Shi
机构
[1] Baoji University of Arts and Sciences,Institute of Computer Software
[2] Chinese Academy of Sciences,Key Laboratory of Intelligent Information Processing, Institute of Computing Technology
来源
International Journal of Machine Learning and Cybernetics | 2020年 / 11卷
关键词
Refining image annotation; Semantic gap; Expectation–maximization; PLSA; Max-bisection; Image retrieval;
D O I
暂无
中图分类号
学科分类号
摘要
Refining image annotation has become one of the core research topics in computer vision and pattern recognition due to its great potentials in image retrieval. However, it is still in its infancy and is not sophisticated enough to extract perfect semantic concepts just according to the image low-level features. In this paper, we propose a two-stage hybrid probabilistic topic model to improve the quality of automatic image annotation. To start with, a probabilistic latent semantic analysis model with asymmetric modalities is learned to estimate the posterior probabilities of each annotation keyword, during which the image-to-word relation can be well established. Next, a label similarity graph is constructed by a weighted linear combination of label similarity and visual similarity of images associated with the corresponding labels. By this way, the information from image low-level visual features and high-level semantic concepts can be seamlessly integrated by fully taking into account the word-to-word and image-to-image relations. Finally, the rank-two relaxation heuristics is exploited to further mine the correlation of the candidate annotations so as to capture the refining results, which plays a critical role in semantic based image retrieval. Extensive experiments show that the proposed model achieves not only superior annotation accuracy but also better retrieval performance.
引用
收藏
页码:417 / 431
页数:14
相关论文
共 92 条
  • [1] Bhagat P(2018)Image annotation: then and now Image Vis Comput 80 1-23
  • [2] Choudhary P(2013)Enhanced representation and multi-task learning for image annotation Comput Vis Image Underst 117 466-478
  • [3] Binder A(2007)Correlated topic models Ann Appl Stat 1 17-35
  • [4] Samek W(2012)Probabilistic topic models Commun ACM 55 77-84
  • [5] Müller K(2006)Scene classification via PLSA Proc 9th Eur Conf Comput Vis (ECCV’06) 3954 517-530
  • [6] Blei D(2002)Rank-two relaxation heuristics for max-cut and other binary quadratic programs SIAM J Optim 12 503-521
  • [7] Lafferty J(2007)Supervised learning of semantic classes for image annotation and retrieval IEEE Trans Pattern Anal Mach Intell 29 394-410
  • [8] Blei D(2012)An adaptive recognition model for image annotation IEEE Trans Syst Man Cybern Part C 42 1120-1127
  • [9] Bosch A(2013)Automatic landslide detection from remote-sensing imagery using a scene classification method based on BoVW and PLSA Int J Remote Sens 34 45-59
  • [10] Zisserman A(2007)The google similarity distance IEEE Trans Knowl Data Eng 19 370-383