Fixation prediction for advertising images: Dataset and benchmark

被引:5
|
作者
Liang, Song [1 ]
Liu, Ruihang [1 ]
Qian, Jiansheng [1 ]
机构
[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Jiangsu, Peoples R China
关键词
Saliency prediction; Advertising; OCR; Lightweight architecture; SALIENCY DETECTION; VISUAL-ATTENTION; EYE FIXATIONS; PICTORIAL; SCENES; BRAND; TEXT;
D O I
10.1016/j.jvcir.2021.103356
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing saliency prediction methods focus on exploring a universal saliency model for natural images, relatively few on advertising images which typically consists of both textual regions and pictorial regions. To fill this gap, we first build an advertising image database, named ADD1000, recording 57 subjects' eye movement data of 1000 ad images. Compared to natural images, advertising images contain more artificial scenarios and show stronger persuasiveness and deliberateness, while the impact of this scene heterogeneity on visual attention is rarely studied. Moreover, text elements and picture elements express closely related semantic information to highlight product or brand in ad images, while their respective contribution to visual attention is also less known. Motivated by these, we further propose a saliency prediction model for advertising images based on text enhanced learning (TEL-SP), which comprehensively considers the interplay between textual region and pictorial region. Extensive experiments on the ADD1000 database show that the proposed model outperforms existing state-of-the-art methods.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] A benchmark GaoFen-7 dataset for building extraction from satellite images
    Peimin Chen
    Huabing Huang
    Feng Ye
    Jinying Liu
    Weijia Li
    Jie Wang
    Zixuan Wang
    Chong Liu
    Ning Zhang
    Scientific Data, 11
  • [22] Perceptual Quality Assessment of Enhanced Colonoscopy Images: A Benchmark Dataset and an Objective Method
    Yue, Guanghui
    Cheng, Di
    Zhou, Tianwei
    Hou, Jingwen
    Liu, Weide
    Xu, Long
    Wang, Tianfu
    Cheng, Jun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 5549 - 5561
  • [23] A new dataset of dog breed images and a benchmark for fine-grained classification
    Ding-Nan Zou
    Song-Hai Zhang
    Tai-Jiang Mu
    Min Zhang
    Computational Visual Media, 2020, 6 (04) : 477 - 487
  • [24] An Evolutionary Shadow Correction Network and a Benchmark UAV Dataset for Remote Sensing Images
    Luo, Shuang
    Li, Huifang
    Li, Yiqiu
    Shao, Chenglin
    Shen, Huanfeng
    Zhang, Liangpei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [25] Epitope Prediction Based on Random Peptide Library Screening: Benchmark Dataset and Prediction Tools Evaluation
    Sun, Pingping
    Chen, Wenhan
    Huang, Yanxin
    Wang, Hongyan
    Ma, Zhiqiang
    Lv, Yinghua
    MOLECULES, 2011, 16 (06): : 4971 - 4993
  • [26] UrbanEV: An Open Benchmark Dataset for Urban Electric Vehicle Charging Demand Prediction
    Li, Han
    Qu, Haohao
    Tan, Xiaojun
    You, Linlin
    Zhu, Rui
    Fan, Wenqi
    SCIENTIFIC DATA, 2025, 12 (01)
  • [27] nablaDFT: Large-Scale Conformational Energy and Hamiltonian Prediction benchmark and dataset
    Khrabrov, Kuzma
    Shenbin, Ilya
    Ryabov, Alexander
    Tsypin, Artem
    Telepov, Alexander
    Alekseev, Anton
    Grishin, Alexander
    Strashnov, Pavel
    Zhilyaev, Petr
    Nikolenko, Sergey
    Kadurin, Artur
    PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2022, 24 (42) : 25853 - 25863
  • [28] A Churn Prediction Dataset from the Telecom Sector: A New Benchmark for Uplift Modeling
    Verhelst, Theo
    Mercier, Denis
    Shestha, Jeevan
    Bontempi, Gianluca
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2023, PT IV, 2025, 2136 : 292 - 299
  • [29] MultiScene: A Large-Scale Dataset and Benchmark for Multiscene Recognition in Single Aerial Images
    Hua, Yuansheng
    Mou, Lichao
    Jin, Pu
    Zhu, Xiao Xiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [30] SID4VAM: A Benchmark Dataset with Synthetic Images for Visual Attention Modeling
    Berga, David
    Fdez-Vidal, Xose R.
    Otazu, Xavier
    Pardo, Xose M.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8788 - 8797