Metamorphic Object Insertion for Testing Object Detection Systems

被引:57
作者
Wang, Shuai [1 ]
Su, Zhendong [2 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Swiss Fed Inst Technol, Zurich, Switzerland
来源
2020 35TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE 2020) | 2020年
关键词
testing; computer vision; object detection; deep neural networks;
D O I
10.1145/3324884.3416584
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advances in deep neural networks (DNNs) have led to object detectors (ODs) that can rapidly process pictures or videos, and recognize the objects that they contain. Despite the promising progress by industrial manufacturers such as Amazon and Google in commercializing deep learning-based ODs as a standard computer vision service, ODs - similar to traditional software - may still produce incorrect results. These errors, in turn, can lead to severe negative outcomes for the users. For instance, an autonomous driving system that fails to detect pedestrians can cause accidents or even fatalities. However, despite their importance, principled, systematic methods for testing ODs do not yet exist. To fill this critical gap, we introduce the design and realization of METAOD, a metamorphic testing system specifically designed for ODs to effectively uncover erroneous detection results. To this end, we (1) synthesize natural-looking images by inserting extra object instances into background images, and (2) design metamorphic conditions asserting the equivalence of OD results between the original and synthetic images after excluding the prediction results on the inserted objects. METAOD is designed as a streamlined workflow that performs object extraction, selection, and insertion. We develop a set of practical techniques to realize an effective workflow, and generate diverse, natural-looking images for testing. Evaluated on four commercial OD services and four pretrained models provided by the TensorFlow API, METAOD found tens of thousands of detection failures. To further demonstrate the practical usage of METAOD, we use the synthetic images that cause erroneous detection results to retrain the model. Our results show that the model performance is significantly increased, from an mAP score of 9.3 to an mAP score of 10.5.
引用
收藏
页码:1053 / 1065
页数:13
相关论文
共 85 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Strike (With) a Pose: Neural Networks Are Easily Fooled by Strange Poses of Familiar Objects
    Alcorn, Michael A.
    Li, Qi
    Gong, Zhitao
    Wang, Chengfei
    Mai, Long
    Ku, Wei-Shinn
    Anh Nguyen
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4840 - 4849
  • [3] [Anonymous], 2019, BERKELEY DEEPDRIVE
  • [4] [Anonymous], 2020, DROPBOX FOLDER ALL E
  • [5] [Anonymous], 2018, TENSORFLOW OBJECT DE
  • [6] [Anonymous], 2019, AZURE COMPUTER VISIO
  • [7] [Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
  • [8] [Anonymous], 2020, METAOD CODEBASE
  • [9] [Anonymous], 2018, GOOGLE CLOUD DETECTI
  • [10] [Anonymous], 2010, INT J COMPUT VISION, DOI DOI 10.1007/s11263-009-0275-4