Gradient-Based Instance-Specific Visual Explanations for Object Specification and Object Discrimination

被引:2
|
作者
Zhao, Chenyang [1 ]
Hsiao, Janet H. [2 ]
Chan, Antoni B. [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Div Social Sci, Hong Kong, Peoples R China
关键词
Detectors; Visualization; Heat maps; Task analysis; Object detection; Predictive models; Transformers; Deep learning; explainable AI; explaining object detection; gradient-based explanation; human eye gaze; instance-level explanation; knowledge distillation; non-maximum suppression; object discrimination; object specification; NMS;
D O I
10.1109/TPAMI.2024.3380604
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose the gradient-weighted Object Detector Activation Maps (ODAM), a visual explanation technique for interpreting the predictions of object detectors. Utilizing the gradients of detector targets flowing into the intermediate feature maps, ODAM produces heat maps that show the influence of regions on the detector's decision for each predicted attribute. Compared to previous works on classification activation maps (CAM), ODAM generates instance-specific explanations rather than class-specific ones. We show that ODAM is applicable to one-stage, two-stage, and transformer-based detectors with different types of detector backbones and heads, and produces higher-quality visual explanations than the state-of-the-art in terms of both effectiveness and efficiency. We discuss two explanation tasks for object detection: 1) object specification: what is the important region for the prediction? 2) object discrimination: which object is detected? Aiming at these two aspects, we present a detailed analysis of the visual explanations of detectors and carry out extensive experiments to validate the effectiveness of the proposed ODAM. Furthermore, we investigate user trust on the explanation maps, how well the visual explanations of object detectors agrees with human explanations, as measured through human eye gaze, and whether this agreement is related with user trust. Finally, we also propose two applications, ODAM-KD and ODAM-NMS, based on these two abilities of ODAM. ODAM-KD utilizes the object specification of ODAM to generate top-down attention for key predictions and instruct the knowledge distillation of object detection. ODAM-NMS considers the location of the model's explanation for each prediction to distinguish the duplicate detected objects. A training scheme, ODAM-Train, is proposed to improve the quality on object discrimination, and help with ODAM-NMS.
引用
收藏
页码:5967 / 5985
页数:19
相关论文
共 50 条
  • [31] Object Detection Based on Hierarchical Visual perception mechanism
    Dou, Hao
    Deng, Qianqian
    Mao, Jiaxing
    MIPPR 2019: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2020, 11429
  • [32] Visual Object Detection and Tracking for Internet of Things Devices Based on Spatial Attention Powered Multidomain Network
    Gao, Haining
    Yu, Lei
    Khan, Imran Ali
    Wang, Yinling
    Yang, Yong
    Shen, Hongdan
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (04) : 2811 - 2820
  • [33] Object Detection Model Based on Dual Visual Pathway
    Chen, Ruixuan
    Yao, Xiao
    Zeng, Yifeng
    2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, : 615 - 619
  • [34] Visual SLAM in dynamic environments based on object detection
    Ai, Yong-bao
    Rui, Ting
    Yang, Xiao-qiang
    He, Jia-lin
    Fu, Lei
    Li, Jian-bin
    Lu, Ming
    DEFENCE TECHNOLOGY, 2021, 17 (05) : 1712 - 1721
  • [35] Few-Shot Cross-Domain Object Detection With Instance-Level Prototype-Based Meta-Learning
    Zhang, Lin
    Zhang, Bo
    Shi, Botian
    Fan, Jiayuan
    Chen, Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9078 - 9089
  • [36] Positive-gradient-weighted object activation mapping: visual explanation of object detector towards precise colorectal-polyp localisation
    Hayato Itoh
    Masashi Misawa
    Yuichi Mori
    Shin-Ei Kudo
    Masahiro Oda
    Kensaku Mori
    International Journal of Computer Assisted Radiology and Surgery, 2022, 17 : 2051 - 2063
  • [37] Positive-gradient-weighted object activation mapping: visual explanation of object detector towards precise colorectal-polyp localisation
    Itoh, Hayato
    Misawa, Masashi
    Mori, Yuichi
    Kudo, Shin-Ei
    Oda, Masahiro
    Mori, Kensaku
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022, 17 (11) : 2051 - 2063
  • [38] Unsupervised Object Detection using Patch Based Image Classifier and Gradient Importance Map
    Vanita Jain
    Manu S. Pillai
    Achin Jain
    Arun Kumar Dubey
    International Journal of Information Technology, 2025, 17 (4) : 2407 - 2416
  • [39] Re-identification framework for long term visual object tracking based on object detection and classification
    Nousi, Paraskevi
    Triantafyllidou, Danai
    Tefas, Anastasios
    Pitas, Ioannis
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 88 (88)
  • [40] Interactive Semantic Map Representation for Skill-Based Visual Object Navigation
    Zemskova, Tatiana
    Staroverov, Aleksei
    Muravyev, Kirill
    Yudin, Dmitry A.
    Panov, Aleksandr I.
    IEEE ACCESS, 2024, 12 : 44628 - 44639