Automatic Image Dataset Construction from Click-through Logs Using Deep Neural Network

被引:20
|
作者
Bai, Yalong [1 ]
Yang, Kuiyuan [2 ]
Yu, Wei [1 ]
Xu, Chang [3 ]
Ma, Wei-Ying [2 ]
Zhao, Tiejun [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
[2] Microsoft Res, Beijing 100080, Peoples R China
[3] Nankai Univ, Coll Comp & Control Engn, Tianjin 300071, Peoples R China
关键词
Automatic Image Dataset Construction; Image Representation; Word Representation; Deep Learning; DATABASE;
D O I
10.1145/2733373.2806243
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Labelled image datasets are the backbone for high-level image understanding tasks with wide application scenarios, and continuously drive and evaluate the progress of feature designing and supervised learning models. Recently, the million scale labelled image dataset further contributes to the rebirth of deep convolutional neural network and bypass manual designing handcraft features. However, the construction process of image dataset is mainly manual-based and quite labor intensive, which often take years' efforts to construct a million scale dataset with high quality. In this paper, we propose a deep learning based method to construct large scale image dataset in an automatic way. Specifically, word representation and image representation are learned in a deep neural network from large amount of click-through logs, and further used to define word-word similarity and image-word similarity. These two similarities are used to automatize the two labor intensive steps in manual-based image dataset construction: query formation and noisy image removal. With a new proposed cross convolutional filter regularizer, we can construct a million scale image dataset in one week. Finally, two image datasets are constructed to verify the effectiveness of the method. In addition to scale, the automatically constructed dataset has comparable accuracy, diversity and cross-dataset generalization with manually labelled image datasets.
引用
收藏
页码:441 / 450
页数:10
相关论文
共 50 条
  • [1] Fine-Grained Image Recognition from Click-Through Logs Using Deep Siamese Network
    Feng, Wu
    Liu, Dong
    MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 : 127 - 138
  • [2] Mining Latent Attributes From Click-Through Logs for Image Recognition
    Lu, Yi-Jie
    Yang, Linjun
    Yang, Kuiyuan
    Rui, Yong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (08) : 1213 - 1224
  • [3] Adaptive Deep Neural Network for Click-Through Rate estimation
    Zeng, Wei
    Zhao, Wenhai
    Bai, Xiaoxuan
    Sun, Hongbin
    He, Yixin
    Yong, Wangqianwei
    Luo, Yonggang
    Han, Sanchu
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259
  • [4] Click-through rate prediction model based on a deep neural network
    Liu, Hong-Li
    Wu, Sen
    Wei, Gui-Ying
    Li, Xin
    Gao, Xiao-Nan
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2022, 44 (11): : 1917 - 1925
  • [5] Deep Field Relation Neural Network for click-through rate prediction
    Zou, Dafang
    Wang, Zidong
    Zhang, Leimin
    Zou, Jinting
    Li, Qi
    Chen, Yun
    Sheng, Weiguo
    INFORMATION SCIENCES, 2021, 577 : 128 - 139
  • [6] Deep Interest Network for Click-Through Rate Prediction
    Zhou, Guorui
    Zhu, Xiaoqiang
    Song, Chengru
    Fan, Ying
    Zhu, Han
    Ma, Xiao
    Yan, Yanghui
    Jin, Junqi
    Li, Han
    Gai, Kun
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1059 - 1068
  • [7] Deep Interest Context Network for Click-Through Rate
    Yu, Mingting
    Liu, Tingting
    Yin, Jian
    Chai, Peilin
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [8] Automatic Generation of Social Event Storyboard From Image Click-Through Data
    Xu, Jun
    Mei, Tao
    Cai, Rui
    Li, Houqiang
    Rui, Yong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (01) : 242 - 253
  • [9] Deep Pattern Network for Click-Through Rate Prediction
    Zhang, Hengyu
    Pan, Junwei
    Liu, Dapeng
    Jiang, Jie
    Li, Xiu
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 1189 - 1199
  • [10] Automatic query recommendation using click-through data
    Dupret, Georges
    Mendoza, Marcelo
    PROFESSIONAL PRACTICE IN ARTIFICIAL INTELLIGENCE, 2006, 218 : 303 - +