Pseudo Label Fusion With Uncertainty Estimation for Semi-Supervised Cropping Box Regression

被引:0
作者
Pan, Zhiyu [1 ]
Cui, Jiahao [1 ]
Wang, Kewei [1 ]
Wu, Yizheng [1 ]
Cao, Zhiguo [1 ]
机构
[1] Huazhong Univ Sci & Technol, Key Lab Image Proc & Intelligent Control, Sch Artificial Intelligence & Automat, Minist Educ, Wuhan 430074, Peoples R China
关键词
Task analysis; Uncertainty; Annotations; Semisupervised learning; Object detection; Data models; Multitasking; Image cropping; cropping box regression; semi-supervised learning; uncertainty estimation;
D O I
10.1109/TMM.2024.3377125
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cropping box regression algorithms re-frame the images with predicted cropping boxes for better composition quality, which can save considerable manpower and time for massive image retouching work. Yet, recent learning-based cropping box regression algorithms require expert annotations, which makes the scale of training limited. This consequently incurs a performance bottleneck. To address this issue, previous works seek the help from auxiliary datasets of related tasks, e.g., the composition classification. However, the domain gap between related tasks and the likewise restricted scale of auxiliary datasets are still limiting factors. Hence, our work provides a novel semi-supervised framework that can learn better re-framing knowledge with unlimited unlabeled data. We make use of the unlabeled data via pseudo-labeling, where the model learns from the pseudo labels generated from a temporal ensemble version of itself. To prevent the model learns from its own mistakes, a.k.a. the problem of confirmation bias, we propose to rectify the mistakes by fusing multiple candidate pseudo labels into the better ones. The fusion procedure is based on the uncertainty estimation for each boundary of the candidate cropping boxes. The multiple candidates are from the proposed aesthetic region proposal network. Extensive experimental results explain how the uncertainty-based pseudo label fusion procedure overcomes the confirmation bias and demonstrate the superiority of our semi-supervised cropping box regression framework.
引用
收藏
页码:8157 / 8171
页数:15
相关论文
共 65 条
  • [1] Bennett KP, 1999, ADV NEUR IN, V11, P368
  • [2] Burns C, 2023, Arxiv, DOI [arXiv:2312.09390, DOI 10.48550/ARXIV.2312.09390]
  • [3] Label Matching Semi-Supervised Object Detection
    Chen, Binbin
    Chen, Weijie
    Yang, Shicai
    Xuan, Yunyi
    Song, Jie
    Xie, Di
    Pu, Shiliang
    Song, Mingli
    Zhuang, Yueting
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14361 - 14370
  • [4] Temporal Self-Ensembling Teacher for Semi-Supervised Object Detection
    Chen, Cong
    Dong, Shouyang
    Tian, Ye
    Cao, Kunlin
    Liu, Li
    Guo, Yuanhao
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 3679 - 3692
  • [5] A visual attention model for adapting images on small displays
    Chen, LQ
    Xie, X
    Fan, X
    Ma, WY
    Zhang, HJ
    Zhou, HQ
    [J]. MULTIMEDIA SYSTEMS, 2003, 9 (04) : 353 - 364
  • [6] Learning to Compose with Professional Photographs on the Web
    Chen, Yi-Ling
    Klopp, Jan
    Sun, Min
    Chien, Shao-Yi
    Ma, Kwan-Liu
    [J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 37 - 45
  • [7] Quantitative Analysis of Automatic Image Cropping Algorithms: A Dataset and Comparative Study
    Chen, Yi-Ling
    Huang, Tzu-Wei
    Chang, Kai-Han
    Tsai, Yu-Chen
    Chen, Hwann-Tzong
    Chen, Bing-Yu
    [J]. 2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 226 - 234
  • [8] Cheng B., 2010, P 18 ACM INT C MULT, P291
  • [9] Graph-based semi-supervised learning: A review
    Chong, Yanwen
    Ding, Yun
    Yan, Qing
    Pan, Shaoming
    [J]. NEUROCOMPUTING, 2020, 408 (408) : 216 - 230
  • [10] An overview on semi-supervised support vector machine
    Ding, Shifei
    Zhu, Zhibin
    Zhang, Xiekai
    [J]. NEURAL COMPUTING & APPLICATIONS, 2017, 28 (05) : 969 - 978