Automatic Image Dataset Construction from Click-through Logs Using Deep Neural Network

被引:20
|
作者
Bai, Yalong [1 ]
Yang, Kuiyuan [2 ]
Yu, Wei [1 ]
Xu, Chang [3 ]
Ma, Wei-Ying [2 ]
Zhao, Tiejun [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
[2] Microsoft Res, Beijing 100080, Peoples R China
[3] Nankai Univ, Coll Comp & Control Engn, Tianjin 300071, Peoples R China
关键词
Automatic Image Dataset Construction; Image Representation; Word Representation; Deep Learning; DATABASE;
D O I
10.1145/2733373.2806243
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Labelled image datasets are the backbone for high-level image understanding tasks with wide application scenarios, and continuously drive and evaluate the progress of feature designing and supervised learning models. Recently, the million scale labelled image dataset further contributes to the rebirth of deep convolutional neural network and bypass manual designing handcraft features. However, the construction process of image dataset is mainly manual-based and quite labor intensive, which often take years' efforts to construct a million scale dataset with high quality. In this paper, we propose a deep learning based method to construct large scale image dataset in an automatic way. Specifically, word representation and image representation are learned in a deep neural network from large amount of click-through logs, and further used to define word-word similarity and image-word similarity. These two similarities are used to automatize the two labor intensive steps in manual-based image dataset construction: query formation and noisy image removal. With a new proposed cross convolutional filter regularizer, we can construct a million scale image dataset in one week. Finally, two image datasets are constructed to verify the effectiveness of the method. In addition to scale, the automatically constructed dataset has comparable accuracy, diversity and cross-dataset generalization with manually labelled image datasets.
引用
收藏
页码:441 / 450
页数:10
相关论文
共 50 条
  • [21] Deep Intention-Aware Network for Click-Through Rate Prediction
    Xia, Yaxian
    Cao, Yi
    Hu, Sihao
    Liu, Tong
    Lu, Lingling
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 533 - 537
  • [22] A Deep Behavior Path Matching Network for Click-Through Rate Prediction
    Dong, Jian
    Yu, Yisong
    Zhang, Yapeng
    Lv, Yiming
    Wang, Shuli
    Jin, Beihong
    Wang, Yongkang
    Wang, Xingxing
    Wang, Dong
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 538 - 542
  • [23] Deep Interest with Hierarchical Attention Network for Click-Through Rate Prediction
    Xu, Weinan
    He, Hengxu
    Tan, Minshi
    Li, Yunming
    Lang, Jun
    Guo, Dongbai
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1905 - 1908
  • [24] Deep Interaction Behavioral Feature Network for Click-Through Rate Prediction
    Zhang, Wenxi
    Yang, Peilin
    Zheng, Wenguang
    Xiao, Yingyuan
    2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 636 - 640
  • [25] Feature Generation by Convolutional Neural Network for Click-Through Rate Prediction
    Liu, Bin
    Tang, Ruiming
    Chen, Yingzhi
    Yu, Jinkai
    Guo, Huifeng
    Zhang, Yuzhou
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 1119 - 1129
  • [26] DDIN: Deep Disentangled Interest Network for Click-Through Rate Prediction
    Yao, Xin-Wei
    He, Chuan
    Xing, Wei-Wei
    Lu, Qi-Chao
    Zhang, Xin-Ge
    Zhang, Yu-Chen
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [27] Deep Spatio-Temporal Neural Networks for Click-Through Rate Prediction
    Ouyang, Wentao
    Zhang, Xiuwu
    Li, Li
    Zou, Heng
    Xing, Xin
    Liu, Zhaojie
    Du, Yanlong
    KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 2078 - 2086
  • [28] Research on Mobile Advertising Click-Through Rate Estimation Based on Neural Network
    Liu, Songjiang
    Liu, Songxian
    ADVANCED INTELLIGENT TECHNOLOGIES FOR INDUSTRY, 2022, 285 : 89 - 94
  • [29] Deep Spatio-Temporal Attention Network for Click-Through Rate Prediction
    Li, Xin-Lu
    Gao, Peng
    Lei, Yuan-Yuan
    Zhang, Le-Xuan
    Fang, Liang-Kuan
    INTELLIGENT COMPUTING METHODOLOGIES, PT III, 2022, 13395 : 626 - 638
  • [30] Density Matrix Based Convolutional Neural Network for Click-Through Rate Prediction
    Niu, Tianyuan
    Hou, Yuexian
    2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2020), 2020, : 46 - 50