Learning from ambiguous labels for X-Ray security inspection via weakly supervised correction

被引:2
作者
Wang, Wei [1 ,2 ]
He, Linyang [1 ]
Cheng, Guohua [1 ]
Wen, Ting [1 ]
Tian, Yan [3 ]
机构
[1] Zhejiang PeckerAI Technol Ltd, Hangzhou 310051, Zhejiang, Peoples R China
[2] Southeast Univ, Sch Automat, Sipailou Rd, Nanjing 210000, Jiangsu, Peoples R China
[3] Zhejiang Gongshang Univ, Sch Comp Sci & Technol, Xuezheng Rd, Hangzhou 310018, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Security inspection; Computer vision; Deep learning; Object detection;
D O I
10.1007/s11042-023-15299-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
X-ray security inspection has been dominated by supervised learning detectors for several years. The extreme angles, overlapping occlusion, and diversity of inspected items cause ambiguous objects to appear, bringing ambiguous labels to the training processes of the supervised learning network. It is well known that the training performance of a supervised learning detector is extremely dependent on the quality of the labels. Human-annotated labels are less reliable and more inconsistent due to the loss of key features of ambiguous objects. With the increase in the proportion of unreliable labels, highly negative effects are imposed on contraband detection. To mitigate this problem, an end-to-end weakly supervised correction (WSC) method with three modules for denoising and rectifying ambiguous labels is proposed. (1) X-ray energy awareness blending (X-Blending) extracts ambiguous images and reliable images during each iteration and mixes them into a single image, which improves the stability and efficiency of ambiguous image training. (2) A weakly supervised head (WSH) is embedded in the supervised detector to rectify the noise labels of ambiguous objects. (3) An adaptive label corrector (ALC) dynamically combines object similarity and confidence measures to generate credible labels and reweights factors to adjust sample contributions. WSC is the first work to achieve end-to-end ambiguous label rectification in the field of contraband detection. Different from traditional contraband detection models, WSC innovatively combines weakly supervised learning to provide more prior knowledge for uncertainty label learning and obtain effective feature information from ambiguous objects. When applied to Faster R-CNN, experimental validations show that WSC increases the average precision (AP) by 3.3% and 4.5% on the EDXray and PIDray datasets.
引用
收藏
页码:6319 / 6334
页数:16
相关论文
共 63 条
  • [11] CenterNet: Keypoint Triplets for Object Detection
    Duan, Kaiwen
    Bai, Song
    Xie, Lingxi
    Qi, Honggang
    Huang, Qingming
    Tian, Qi
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6568 - 6577
  • [12] TOOD: Task-aligned One-stage Object Detection
    Feng, Chengjian
    Zhong, Yujie
    Gao, Yu
    Scott, Matthew R.
    Huang, Weilin
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3490 - 3499
  • [13] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [14] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
  • [15] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [16] Hongkai Zhang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12360), P260, DOI 10.1007/978-3-030-58555-6_16
  • [17] LAmbDA: label ambiguous domain adaptation dataset integration reduces batch effects and improves subtype detection
    Johnson, Travis S.
    Wang, Tongxin
    Huang, Zhi
    Yu, Christina Y.
    Wu, Yi
    Han, Yatong
    Zhang, Yan
    Huang, Kun
    Zhang, Jie
    [J]. BIOINFORMATICS, 2019, 35 (22) : 4696 - 4706
  • [18] Kalinathan L, 2020, Nuclei detection in hepatocellular carcinoma and dysplastic liver nodules in histopathology images using bootstrap regression
  • [19] Optimal fusion aided face recognition from visible and thermal face images
    Kanmani, Madheswari
    Narasimhan, Venkateswaran
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (25-26) : 17859 - 17883
  • [20] Li HD, 2020, PROC CVPR IEEE, P10585, DOI 10.1109/CVPR42600.2020.01060