Accurate and Robust Object Detection via Selective Adversarial Learning With Constraints

被引:0
作者
Chen, Jianpin [1 ]
Li, Heng [1 ]
Gao, Qi [1 ]
Liang, Junling [1 ]
Zhang, Ruipeng [1 ]
Yin, Liping [2 ]
Chai, Xinyu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Biomed Engn, Shanghai 200240, Peoples R China
[2] Tech Ctr Anim Plant & Food Inspection & Quarantine, Shanghai 200002, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Detectors; Training; Robustness; Object detection; Accuracy; Adversarial machine learning; Standards; Pipelines; Perturbation methods; Degradation; image degradation; adversarial training; multitask learning;
D O I
10.1109/TIP.2024.3470529
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
ConvNet-based object detection networks have achieved outstanding performance on clean images. However, many works have shown that these detectors perform poorly on corrupted images caused by noises, blurs, poor weather conditions and so on. With the development of security-sensitive applications, the detector's practicability has raised increasing concerns. Existing approaches improve detector robustness via extra operations (image restoration or training on extra labeled data) or by applying adversarial training at the expense of performance degradation on clean images. In this paper, we present Selective Adversarial Learning with Constraints (SALC) as a universal detector training approach to simultaneously improve the detector's precision and robustness. We first propose a unified formulation of adversarial samples for multitask adversarial learning, which significantly diversifies the obtained adversarial samples when integrated into the adversarial training of the detector. Next, we examine our findings on model bias against adversarial attacks of different strengths and differences in Batch Normalization (BN) statistics among clean images and different adversarial samples. On this basis, we propose a batch local comparison strategy with two BN branches to balance the detector's accuracy and robustness. Furthermore, to avoid performance degradation caused by overwhelming subtask losses, we leverage task-aware ratio thresholds to control the influence of learning in each subtask. The proposed approach can be applied to various detectors without any extra labeled data, inference time costs, or model parameters. Extensive experiments show that our SALC achieves state-of-the-art results on both clean benchmarks (Pascal VOC and MS-COCO) and corruption benchmarks (Pascal VOC-C and MS-COCO-C).
引用
收藏
页码:5593 / 5605
页数:13
相关论文
共 57 条
[1]   Class-Aware Robust Adversarial Training for Object Detection [J].
Chen, Pin-Chun ;
Kung, Bo-Han ;
Chen, Jun-Cheng .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :10415-10424
[2]   ShapeShifter: Robust Physical Adversarial Attack on Faster R-CNN Object Detector [J].
Chen, Shang-Tse ;
Cornelius, Cory ;
Martin, Jason ;
Chau, Duen Horng .
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT I, 2019, 11051 :52-68
[3]   Robust and Accurate Object Detection via Adversarial Learning [J].
Chen, Xiangning ;
Xie, Cihang ;
Tan, Mingxing ;
Zhang, Li ;
Hsieh, Cho-Jui ;
Gong, Boqing .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :16617-16626
[4]   Robust Object Detection Fusion Against Deception [J].
Chow, Ka-Ho ;
Liu, Ling .
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, :2703-2713
[5]   Understanding Object Detection Through an Adversarial Lens [J].
Chow, Ka-Ho ;
Liu, Ling ;
Gursoy, Mehmet Emre ;
Truex, Stacey ;
Wei, Wenqi ;
Wu, Yanzhao .
COMPUTER SECURITY - ESORICS 2020, PT II, 2020, 12309 :460-481
[6]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7]  
Ding QQ, 2022, Arxiv, DOI arXiv:2206.01381
[8]   Adversarially-Aware Robust Object Detector [J].
Dong, Ziyi ;
Wei, Pengxu ;
Lin, Liang .
COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 :297-313
[9]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136
[10]  
Howard AG, 2017, Arxiv, DOI [arXiv:1704.04861, 10.48550/arXiv.1704.04861]