Learning from ambiguous labels for X-Ray security inspection via weakly supervised correction

被引：2

作者：

Wang, Wei ^{[1
,2
]}

He, Linyang ^{[1
]}

Cheng, Guohua ^{[1
]}

Wen, Ting ^{[1
]}

Tian, Yan ^{[3
]}

机构：

[1] Zhejiang PeckerAI Technol Ltd, Hangzhou 310051, Zhejiang, Peoples R China

[2] Southeast Univ, Sch Automat, Sipailou Rd, Nanjing 210000, Jiangsu, Peoples R China

[3] Zhejiang Gongshang Univ, Sch Comp Sci & Technol, Xuezheng Rd, Hangzhou 310018, Zhejiang, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2023年 / 83卷 / 2期

基金：

中国国家自然科学基金;

关键词：

Security inspection; Computer vision; Deep learning; Object detection;

D O I：

10.1007/s11042-023-15299-9

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

X-ray security inspection has been dominated by supervised learning detectors for several years. The extreme angles, overlapping occlusion, and diversity of inspected items cause ambiguous objects to appear, bringing ambiguous labels to the training processes of the supervised learning network. It is well known that the training performance of a supervised learning detector is extremely dependent on the quality of the labels. Human-annotated labels are less reliable and more inconsistent due to the loss of key features of ambiguous objects. With the increase in the proportion of unreliable labels, highly negative effects are imposed on contraband detection. To mitigate this problem, an end-to-end weakly supervised correction (WSC) method with three modules for denoising and rectifying ambiguous labels is proposed. (1) X-ray energy awareness blending (X-Blending) extracts ambiguous images and reliable images during each iteration and mixes them into a single image, which improves the stability and efficiency of ambiguous image training. (2) A weakly supervised head (WSH) is embedded in the supervised detector to rectify the noise labels of ambiguous objects. (3) An adaptive label corrector (ALC) dynamically combines object similarity and confidence measures to generate credible labels and reweights factors to adjust sample contributions. WSC is the first work to achieve end-to-end ambiguous label rectification in the field of contraband detection. Different from traditional contraband detection models, WSC innovatively combines weakly supervised learning to provide more prior knowledge for uncertainty label learning and obtain effective feature information from ambiguous objects. When applied to Faster R-CNN, experimental validations show that WSC increases the average precision (AP) by 3.3% and 4.5% on the EDXray and PIDray datasets.

引用

页码：6319 / 6334

页数：16

共 63 条

[11] CenterNet: Keypoint Triplets for Object Detection
Duan, Kaiwen
Bai, Song
Xie, Lingxi
Qi, Honggang
Huang, Qingming
Tian, Qi
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6568 - 6577
[12] TOOD: Task-aligned One-stage Object Detection
Feng, Chengjian
Zhong, Yujie
Gao, Yu
Scott, Matthew R.
Huang, Weilin
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3490 - 3499
[13] Fast R-CNN
Girshick, Ross
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
[14] Rich feature hierarchies for accurate object detection and semantic segmentation
Girshick, Ross
Donahue, Jeff
Darrell, Trevor
Malik, Jitendra
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
[15] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[16] Hongkai Zhang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12360), P260, DOI 10.1007/978-3-030-58555-6_16
[17] LAmbDA: label ambiguous domain adaptation dataset integration reduces batch effects and improves subtype detection
Johnson, Travis S.
Wang, Tongxin
Huang, Zhi
Yu, Christina Y.
Wu, Yi
Han, Yatong
Zhang, Yan
Huang, Kun
Zhang, Jie
[J]. BIOINFORMATICS, 2019, 35 (22) : 4696 - 4706
[18] Kalinathan L, 2020, Nuclei detection in hepatocellular carcinoma and dysplastic liver nodules in histopathology images using bootstrap regression
[19] Optimal fusion aided face recognition from visible and thermal face images
Kanmani, Madheswari
Narasimhan, Venkateswaran
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (25-26) : 17859 - 17883
[20] Li HD, 2020, PROC CVPR IEEE, P10585, DOI 10.1109/CVPR42600.2020.01060

← 1 2 3 4 5 6 7 →