Learning multi-task local metrics for image annotation

被引:0
|
作者
Xing Xu
Atsushi Shimada
Hajime Nagahara
Rin-ichiro Taniguchi
机构
[1] Kyushu University,Department of Advanced Information and Technology
来源
Multimedia Tools and Applications | 2016年 / 75卷
关键词
Image annotation; Label prediction; Metric learning; Local metric; Multi-task learning;
D O I
暂无
中图分类号
学科分类号
摘要
The goal of image annotation is to automatically assign a set of textual labels to an image to describe the visual contents thereof. Recently, with the rapid increase in the number of web images, nearest neighbor (NN) based methods have become more attractive and have shown exciting results for image annotation. One of the key challenges of these methods is to define an appropriate similarity measure between images for neighbor selection. Several distance metric learning (DML) algorithms derived from traditional image classification problems have been applied to annotation tasks. However, a fundamental limitation of applying DML to image annotation is that it learns a single global distance metric over the entire image collection and measures the distance between image pairs in the image-level. For multi-label annotation problems, it may be more reasonable to measure similarity of image pairs in the label-level. In this paper, we develop a novel label prediction scheme utilizing multiple label-specific local metrics for label-level similarity measure, and propose two different local metric learning methods in a multi-task learning (MTL) framework. Extensive experimental results on two challenging annotation datasets demonstrate that 1) utilizing multiple local distance metrics to learn label-level distances is superior to using a single global metric in label prediction, and 2) the proposed methods using the MTL framework to learn multiple local metrics simultaneously can model the commonalities of labels, thereby facilitating label prediction results to achieve state-of-the-art annotation performance.
引用
收藏
页码:2203 / 2231
页数:28
相关论文
共 50 条
  • [1] Learning multi-task local metrics for image annotation
    Xu, Xing
    Shimada, Atsushi
    Nagahara, Hajime
    Taniguchi, Rin-ichiro
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (04) : 2203 - 2231
  • [2] Enhanced representation and multi-task learning for image annotation
    Binder, Alexander
    Samek, Wojciech
    Mueller, Klaus-Robert
    Kawanabe, Motoaki
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (05) : 466 - 478
  • [3] Asymmetric Multi-Task Learning with Local Transference
    Oliveira, Saullo H. G.
    Goncalves, Andre R.
    Von Zuben, Fernando J.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (05)
  • [4] Multi-task Deep Learning for Image Understanding
    Yu, Bo
    Lane, Ian
    2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 37 - 42
  • [5] A Hierarchical Multi-Task Learning Framework for Semantic Annotation in Tabular Data
    Wu, Jie
    Hou, Mengshu
    ENTROPY, 2024, 26 (08)
  • [6] Multi-task gradient descent for multi-task learning
    Lu Bai
    Yew-Soon Ong
    Tiantian He
    Abhishek Gupta
    Memetic Computing, 2020, 12 : 355 - 369
  • [7] Multi-task gradient descent for multi-task learning
    Bai, Lu
    Ong, Yew-Soon
    He, Tiantian
    Gupta, Abhishek
    MEMETIC COMPUTING, 2020, 12 (04) : 355 - 369
  • [8] Simultaneous Image Annotation and Geo-Tag Prediction via Correlation Guided Multi-Task Learning
    Wang, Hua
    Joshi, Dhiraj
    Luo, Jiebo
    Huang, Heng
    Park, Minwoo
    2012 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2012, : 69 - 72
  • [9] Deep multi-task learning for malware image classification
    Bensaoud, Ahmed
    Kalita, Jugal
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2022, 64
  • [10] Multi-task learning with self-learning weight for image denoising
    Xiang, Qian
    Tang, Yong
    Zhou, Xiangyang
    Journal of Engineering and Applied Science, 2024, 71 (01):