Gradient-Based Instance-Specific Visual Explanations for Object Specification and Object Discrimination

被引:2
|
作者
Zhao, Chenyang [1 ]
Hsiao, Janet H. [2 ]
Chan, Antoni B. [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Div Social Sci, Hong Kong, Peoples R China
关键词
Detectors; Visualization; Heat maps; Task analysis; Object detection; Predictive models; Transformers; Deep learning; explainable AI; explaining object detection; gradient-based explanation; human eye gaze; instance-level explanation; knowledge distillation; non-maximum suppression; object discrimination; object specification; NMS;
D O I
10.1109/TPAMI.2024.3380604
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose the gradient-weighted Object Detector Activation Maps (ODAM), a visual explanation technique for interpreting the predictions of object detectors. Utilizing the gradients of detector targets flowing into the intermediate feature maps, ODAM produces heat maps that show the influence of regions on the detector's decision for each predicted attribute. Compared to previous works on classification activation maps (CAM), ODAM generates instance-specific explanations rather than class-specific ones. We show that ODAM is applicable to one-stage, two-stage, and transformer-based detectors with different types of detector backbones and heads, and produces higher-quality visual explanations than the state-of-the-art in terms of both effectiveness and efficiency. We discuss two explanation tasks for object detection: 1) object specification: what is the important region for the prediction? 2) object discrimination: which object is detected? Aiming at these two aspects, we present a detailed analysis of the visual explanations of detectors and carry out extensive experiments to validate the effectiveness of the proposed ODAM. Furthermore, we investigate user trust on the explanation maps, how well the visual explanations of object detectors agrees with human explanations, as measured through human eye gaze, and whether this agreement is related with user trust. Finally, we also propose two applications, ODAM-KD and ODAM-NMS, based on these two abilities of ODAM. ODAM-KD utilizes the object specification of ODAM to generate top-down attention for key predictions and instruct the knowledge distillation of object detection. ODAM-NMS considers the location of the model's explanation for each prediction to distinguish the duplicate detected objects. A training scheme, ODAM-Train, is proposed to improve the quality on object discrimination, and help with ODAM-NMS.
引用
收藏
页码:5967 / 5985
页数:19
相关论文
共 50 条
  • [1] Gradient-based sparse voxel attacks on point cloud object detection
    Wu, Junqi
    Yao, Wen
    Jia, Shuai
    Jiang, Tingsong
    Zhou, Weien
    Ma, Chao
    Chen, Xiaoqian
    PATTERN RECOGNITION, 2025, 160
  • [2] The role of the hippocampus in object discrimination based on visual features
    Levcik, David
    Nekovarova, Tereza
    Antosova, Eliska
    Stuchlik, Ales
    Klement, Daniel
    NEUROBIOLOGY OF LEARNING AND MEMORY, 2018, 155 : 127 - 135
  • [3] Gradient-Based Saliency Maps Are Not Trustworthy Visual Explanations of Automated AI Musculoskeletal Diagnoses
    Venkatesh, Kesavan
    Mutasa, Simukayi
    Moore, Fletcher
    Sulam, Jeremias
    Yi, Paul H.
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, 37 (05): : 2490 - 2499
  • [4] Weakly Supervised Object Detection via Object-Specific Pixel Gradient
    Shen, Yunhang
    Ji, Rongrong
    Wang, Changhu
    Li, Xi
    Li, Xuelong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (12) : 5960 - 5970
  • [5] Improving Interpretability for Computer-Aided Diagnosis Tools on Whole Slide Imaging with Multiple Instance Learning and Gradient-Based Explanations
    Pirovano, Antoine
    Heuberger, Hippolyte
    Berlemont, Sylvain
    Ladjal, Said
    Bloch, Isabelle
    INTERPRETABLE AND ANNOTATION-EFFICIENT LEARNING FOR MEDICAL IMAGE COMPUTING, IMIMIC 2020, MIL3ID 2020, LABELS 2020, 2020, 12446 : 43 - 53
  • [6] Training-Based Methods for Comparison of Object Detection Methods for Visual Object Tracking
    Delforouzi, Ahmad
    Pamarthi, Bhargav
    Grzegorzek, Marcin
    SENSORS, 2018, 18 (11)
  • [7] Review of object instance segmentation based on deep learning
    Tian, Di
    Han, Yi
    Wang, Biyao
    Guan, Tian
    Gu, Hengzhi
    Wei, Wei
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
  • [8] Object Detection via Gradient-Based Mask R-CNN Using Machine Learning Algorithms
    Xavier, Alphonse Inbaraj
    Villavicencio, Charlyn
    Macrohon, Julio Jerison
    Jeng, Jyh-Horng
    Hsieh, Jer-Guang
    MACHINES, 2022, 10 (05)
  • [9] Object localization, discrimination, and grasping with the optic nerve visual prosthesis
    Duret, Florence
    Brelen, Marten E.
    Lambert, Valerie
    Gerard, Benoit
    Delbeke, Jean
    Veraart, Claude
    RESTORATIVE NEUROLOGY AND NEUROSCIENCE, 2006, 24 (01) : 31 - 40
  • [10] Camouflaged Object Detection via Complementary Information-Selected Network Based on Visual and Semantic Separation
    Yin, Chao
    Yang, Kequan
    Li, Jide
    Li, Xiaoqiang
    Wu, Yifan
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (11) : 12871 - 12881