Fixation prediction for advertising images: Dataset and benchmark

被引：5

作者：

Liang, Song ^{[1
]}

Liu, Ruihang ^{[1
]}

Qian, Jiansheng ^{[1
]}

机构：

[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Jiangsu, Peoples R China

来源：

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION | 2021年 / 81卷

关键词：

Saliency prediction; Advertising; OCR; Lightweight architecture; SALIENCY DETECTION; VISUAL-ATTENTION; EYE FIXATIONS; PICTORIAL; SCENES; BRAND; TEXT;

D O I：

10.1016/j.jvcir.2021.103356

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Existing saliency prediction methods focus on exploring a universal saliency model for natural images, relatively few on advertising images which typically consists of both textual regions and pictorial regions. To fill this gap, we first build an advertising image database, named ADD1000, recording 57 subjects' eye movement data of 1000 ad images. Compared to natural images, advertising images contain more artificial scenarios and show stronger persuasiveness and deliberateness, while the impact of this scene heterogeneity on visual attention is rarely studied. Moreover, text elements and picture elements express closely related semantic information to highlight product or brand in ad images, while their respective contribution to visual attention is also less known. Motivated by these, we further propose a saliency prediction model for advertising images based on text enhanced learning (TEL-SP), which comprehensively considers the interplay between textual region and pictorial region. Extensive experiments on the ADD1000 database show that the proposed model outperforms existing state-of-the-art methods.

引用

页数：14

共 50 条

[21] A benchmark GaoFen-7 dataset for building extraction from satellite images
Peimin Chen
Huabing Huang
Feng Ye
Jinying Liu
Weijia Li
Jie Wang
Zixuan Wang
Chong Liu
Ning Zhang
Scientific Data, 11
[22] Perceptual Quality Assessment of Enhanced Colonoscopy Images: A Benchmark Dataset and an Objective Method
Yue, Guanghui
Cheng, Di
Zhou, Tianwei
Hou, Jingwen
Liu, Weide
Xu, Long
Wang, Tianfu
Cheng, Jun
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 5549 - 5561
[23] A new dataset of dog breed images and a benchmark for fine-grained classification
Ding-Nan Zou
Song-Hai Zhang
Tai-Jiang Mu
Min Zhang
Computational Visual Media, 2020, 6 (04) : 477 - 487
[24] An Evolutionary Shadow Correction Network and a Benchmark UAV Dataset for Remote Sensing Images
Luo, Shuang
Li, Huifang
Li, Yiqiu
Shao, Chenglin
Shen, Huanfeng
Zhang, Liangpei
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[25] Epitope Prediction Based on Random Peptide Library Screening: Benchmark Dataset and Prediction Tools Evaluation
Sun, Pingping
Chen, Wenhan
Huang, Yanxin
Wang, Hongyan
Ma, Zhiqiang
Lv, Yinghua
MOLECULES, 2011, 16 (06): : 4971 - 4993
[26] UrbanEV: An Open Benchmark Dataset for Urban Electric Vehicle Charging Demand Prediction
Li, Han
Qu, Haohao
Tan, Xiaojun
You, Linlin
Zhu, Rui
Fan, Wenqi
SCIENTIFIC DATA, 2025, 12 (01)
[27] nablaDFT: Large-Scale Conformational Energy and Hamiltonian Prediction benchmark and dataset
Khrabrov, Kuzma
Shenbin, Ilya
Ryabov, Alexander
Tsypin, Artem
Telepov, Alexander
Alekseev, Anton
Grishin, Alexander
Strashnov, Pavel
Zhilyaev, Petr
Nikolenko, Sergey
Kadurin, Artur
PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2022, 24 (42) : 25853 - 25863
[28] A Churn Prediction Dataset from the Telecom Sector: A New Benchmark for Uplift Modeling
Verhelst, Theo
Mercier, Denis
Shestha, Jeevan
Bontempi, Gianluca
MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2023, PT IV, 2025, 2136 : 292 - 299
[29] MultiScene: A Large-Scale Dataset and Benchmark for Multiscene Recognition in Single Aerial Images
Hua, Yuansheng
Mou, Lichao
Jin, Pu
Zhu, Xiao Xiang
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[30] SID4VAM: A Benchmark Dataset with Synthetic Images for Visual Attention Modeling
Berga, David
Fdez-Vidal, Xose R.
Otazu, Xavier
Pardo, Xose M.
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8788 - 8797

← 1 2 3 4 5 →