Pseudo Label Fusion With Uncertainty Estimation for Semi-Supervised Cropping Box Regression

被引：0

作者：

Pan, Zhiyu ^{[1
]}

Cui, Jiahao ^{[1
]}

Wang, Kewei ^{[1
]}

Wu, Yizheng ^{[1
]}

Cao, Zhiguo ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, Key Lab Image Proc & Intelligent Control, Sch Artificial Intelligence & Automat, Minist Educ, Wuhan 430074, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

关键词：

Task analysis; Uncertainty; Annotations; Semisupervised learning; Object detection; Data models; Multitasking; Image cropping; cropping box regression; semi-supervised learning; uncertainty estimation;

D O I：

10.1109/TMM.2024.3377125

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Cropping box regression algorithms re-frame the images with predicted cropping boxes for better composition quality, which can save considerable manpower and time for massive image retouching work. Yet, recent learning-based cropping box regression algorithms require expert annotations, which makes the scale of training limited. This consequently incurs a performance bottleneck. To address this issue, previous works seek the help from auxiliary datasets of related tasks, e.g., the composition classification. However, the domain gap between related tasks and the likewise restricted scale of auxiliary datasets are still limiting factors. Hence, our work provides a novel semi-supervised framework that can learn better re-framing knowledge with unlimited unlabeled data. We make use of the unlabeled data via pseudo-labeling, where the model learns from the pseudo labels generated from a temporal ensemble version of itself. To prevent the model learns from its own mistakes, a.k.a. the problem of confirmation bias, we propose to rectify the mistakes by fusing multiple candidate pseudo labels into the better ones. The fusion procedure is based on the uncertainty estimation for each boundary of the candidate cropping boxes. The multiple candidates are from the proposed aesthetic region proposal network. Extensive experimental results explain how the uncertainty-based pseudo label fusion procedure overcomes the confirmation bias and demonstrate the superiority of our semi-supervised cropping box regression framework.

引用

页码：8157 / 8171

页数：15

共 65 条

[1] Bennett KP, 1999, ADV NEUR IN, V11, P368
[2] Burns C, 2023, Arxiv, DOI [arXiv:2312.09390, DOI 10.48550/ARXIV.2312.09390]
[3] Label Matching Semi-Supervised Object Detection
Chen, Binbin
Chen, Weijie
Yang, Shicai
Xuan, Yunyi
Song, Jie
Xie, Di
Pu, Shiliang
Song, Mingli
Zhuang, Yueting
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14361 - 14370
[4] Temporal Self-Ensembling Teacher for Semi-Supervised Object Detection
Chen, Cong
Dong, Shouyang
Tian, Ye
Cao, Kunlin
Liu, Li
Guo, Yuanhao
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 3679 - 3692
[5] A visual attention model for adapting images on small displays
Chen, LQ
Xie, X
Fan, X
Ma, WY
Zhang, HJ
Zhou, HQ
[J]. MULTIMEDIA SYSTEMS, 2003, 9 (04) : 353 - 364
[6] Learning to Compose with Professional Photographs on the Web
Chen, Yi-Ling
Klopp, Jan
Sun, Min
Chien, Shao-Yi
Ma, Kwan-Liu
[J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 37 - 45
[7] Quantitative Analysis of Automatic Image Cropping Algorithms: A Dataset and Comparative Study
Chen, Yi-Ling
Huang, Tzu-Wei
Chang, Kai-Han
Tsai, Yu-Chen
Chen, Hwann-Tzong
Chen, Bing-Yu
[J]. 2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 226 - 234
[8] Cheng B., 2010, P 18 ACM INT C MULT, P291
[9] Graph-based semi-supervised learning: A review
Chong, Yanwen
Ding, Yun
Yan, Qing
Pan, Shaoming
[J]. NEUROCOMPUTING, 2020, 408 (408) : 216 - 230
[10] An overview on semi-supervised support vector machine
Ding, Shifei
Zhu, Zhibin
Zhang, Xiekai
[J]. NEURAL COMPUTING & APPLICATIONS, 2017, 28 (05) : 969 - 978

← 1 2 3 4 5 6 7 →