Uncertain region mining semi-supervised object detection

被引:0
作者
Yin, Tianxiang [1 ,2 ]
Liu, Ningzhong [1 ,2 ]
Sun, Han [1 ,2 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Jiangsu, Peoples R China
[2] MIIT, Key Lab Pattern Anal & Machine Intelligence, Nanjing 211106, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-supervised; Object detection; Deep learning;
D O I
10.1007/s10489-023-05246-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised learning uses a small amount of labeled data to guide the model and a large amount of unlabeled data to improve its performance. Most semi-supervised object detection methods build a teacher-student architecture and train the student network with pseudo-labels generated by the teacher. To guarantee the accuracy of pseudo-labels, a high threshold value is always applied to filter out all of the low-scoring labels according to the inference results of the teacher. We argue that these discarded labels with low scores have more information for the current model and should be taken into consideration in the training phase. Further, we propose an uncertain region mining(URM) framework that utilizes these uncertainty low confidence labels. Especially, URM exploits the uncertain labels from two aspects: (1)Recalling the underlying correct labels: URM designs a fusion function that rectifies the outputs of the teacher with the student and recalls the high-quality pseudo-labels. (2)Avoiding the error information of the error labels: URM proposes a negative loss function that utilizes the uncertain labels without introducing error information. For the regression task, a new branch is attached to the detector to predict the localization scores of bounding boxes. Based on the predicted scores, we propose a re-weighting strategy that alleviates the noisy problem from the imprecise localization of the bounding boxes. Experiments on PASCAL-VOC, MS-COCO and DOTA datasets demonstrate the effectiveness of our proposed method.
引用
收藏
页码:2300 / 2313
页数:14
相关论文
共 51 条
  • [1] Ahmed W, 2022, P IEEECVF WINTER C A, ppp1616
  • [2] Ai W., 2023, APPL INTELL, P1
  • [3] Allabadi G, 2023, ARXIV
  • [4] [Anonymous], 2012, VOC2012 RESULTS
  • [5] 5G K-Simulator of Flexible, Open, Modular (FOM) Structure and Web-based 5G K-SimPlatform
    Baek, Jaeuk
    Bae, Jimin
    Kim, Yongjae
    Lim, Jinteak
    Park, Eunhye
    Lee, Jaehyeok
    Lee, Gyujae
    Han, Sang Ik
    Chu, Chol
    Han, Youngnam
    [J]. 2019 16TH IEEE ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2019,
  • [6] Berthelot D, 2019, ADV NEUR IN, V32
  • [7] Boyd S., 2004, Convex Optimization, DOI 10.1017/CBO9780511804441
  • [8] Chen B, 2022, P IEEECVF C COMPUTER
  • [9] Dense Learning based Semi-Supervised Object Detection
    Chen, Binghui
    Li, Pengyu
    Chen, Xiang
    Wang, Biao
    Zhang, Lei
    Hua, Xian-Sheng
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4805 - 4814
  • [10] Negative-ResNet: noisy ambulatory electrocardiogram signal classification scheme
    Chen, Zijiao
    Lin, Zihuai
    Wang, Peng
    Ding, Ming
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (14) : 8857 - 8869