Gradient-Based Instance-Specific Visual Explanations for Object Specification and Object Discrimination

被引:2
|
作者
Zhao, Chenyang [1 ]
Hsiao, Janet H. [2 ]
Chan, Antoni B. [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Div Social Sci, Hong Kong, Peoples R China
关键词
Detectors; Visualization; Heat maps; Task analysis; Object detection; Predictive models; Transformers; Deep learning; explainable AI; explaining object detection; gradient-based explanation; human eye gaze; instance-level explanation; knowledge distillation; non-maximum suppression; object discrimination; object specification; NMS;
D O I
10.1109/TPAMI.2024.3380604
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose the gradient-weighted Object Detector Activation Maps (ODAM), a visual explanation technique for interpreting the predictions of object detectors. Utilizing the gradients of detector targets flowing into the intermediate feature maps, ODAM produces heat maps that show the influence of regions on the detector's decision for each predicted attribute. Compared to previous works on classification activation maps (CAM), ODAM generates instance-specific explanations rather than class-specific ones. We show that ODAM is applicable to one-stage, two-stage, and transformer-based detectors with different types of detector backbones and heads, and produces higher-quality visual explanations than the state-of-the-art in terms of both effectiveness and efficiency. We discuss two explanation tasks for object detection: 1) object specification: what is the important region for the prediction? 2) object discrimination: which object is detected? Aiming at these two aspects, we present a detailed analysis of the visual explanations of detectors and carry out extensive experiments to validate the effectiveness of the proposed ODAM. Furthermore, we investigate user trust on the explanation maps, how well the visual explanations of object detectors agrees with human explanations, as measured through human eye gaze, and whether this agreement is related with user trust. Finally, we also propose two applications, ODAM-KD and ODAM-NMS, based on these two abilities of ODAM. ODAM-KD utilizes the object specification of ODAM to generate top-down attention for key predictions and instruct the knowledge distillation of object detection. ODAM-NMS considers the location of the model's explanation for each prediction to distinguish the duplicate detected objects. A training scheme, ODAM-Train, is proposed to improve the quality on object discrimination, and help with ODAM-NMS.
引用
收藏
页码:5967 / 5985
页数:19
相关论文
共 50 条
  • [41] Optical Flow-Based Stereo Visual Odometry With Dynamic Object Detection
    Liu, Yu
    Zhou, Zhiyu
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (06) : 3556 - 3568
  • [42] Improving Training Instance Quality in Aerial Image Object Detection With a Sampling-Balance-Based Multistage Network
    Han, Wei
    Fan, Runyu
    Wang, Lizhe
    Feng, Ruyi
    Li, Fengpeng
    Deng, Ze
    Chen, Xiaodao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (12): : 10575 - 10589
  • [43] A Deep Detection Network Based on Interaction of Instance Segmentation and Object Detection for SAR Images
    Wu, Zitong
    Hou, Biao
    Ren, Bo
    Ren, Zhongle
    Wang, Shuang
    Jiao, Licheng
    REMOTE SENSING, 2021, 13 (13)
  • [44] Object Detection and Tracking Based on Image and Point Clouds Instance Matching for Intelligent Vehicles
    Li, Shangjie
    Yin, Guodong
    Geng, Keke
    Liu, Shuaipeng
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2024, 60 (22): : 302 - 310
  • [45] Background Instance-Based Copy-Paste Data Augmentation for Object Detection
    Zhang, Liuying
    Xing, Zhiqiang
    Wang, Xikun
    ELECTRONICS, 2023, 12 (18)
  • [46] Transformer visual object tracking algorithm based on mixed attention
    Hou Z.-Q.
    Guo F.
    Yang X.-L.
    Ma S.-G.
    Fan J.-L.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (03): : 739 - 748
  • [47] Advances in visual object tracking algorithm based on correlation filter
    Huang Y.
    Li X.
    Yang X.
    Qi N.
    Lu R.
    Zhang S.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2021, 43 (08): : 2051 - 2065
  • [48] Visual Attention Based Motion Object Detection and Trajectory Tracking
    Guo, Wen
    Xu, Changsheng
    Ma, Songde
    Xu, Min
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING-PCM 2010, PT II, 2010, 6298 : 462 - +
  • [49] Visual Object Detection based LiDAR Point Cloud Classification
    Muhammad, Sualeh
    Gon-Woo, Kim
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 438 - 440
  • [50] Visual SLAM Based on Object Detection Network: A Review br
    Peng, Jiansheng
    Chen, Dunhua
    Yang, Qing
    Yang, Chengjun
    Xu, Yong
    Qin, Yong
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (03): : 3209 - 3236