Compositional Convolutional Neural Networks: A Robust and Interpretable Model for Object Recognition Under Occlusion

被引:51
|
作者
Kortylewski, Adam [1 ]
Liu, Qing [1 ]
Wang, Angtian [1 ]
Sun, Yihong [1 ]
Yuille, Alan [1 ]
机构
[1] Johns Hopkins Univ, Baltimore, MD 21218 USA
基金
瑞士国家科学基金会;
关键词
Compositional models; Robustness to partial occlusion; Image classification; Object detection; Out-of-distribution generalization;
D O I
10.1007/s11263-020-01401-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Computer vision systems in real-world applications need to be robust to partial occlusion while also being explainable. In this work, we show that black-box deep convolutional neural networks (DCNNs) have only limited robustness to partial occlusion. We overcome these limitations by unifying DCNNs with part-based models into Compositional Convolutional Neural Networks (CompositionalNets)-an interpretable deep architecture with innate robustness to partial occlusion. Specifically, we propose to replace the fully connected classification head of DCNNs with a differentiable compositional model that can be trained end-to-end. The structure of the compositional model enables CompositionalNets to decompose images into objects and context, as well as to further decompose object representations in terms of individual parts and the objects' pose. The generative nature of our compositional model enables it to localize occluders and to recognize objects based on their non-occluded parts. We conduct extensive experiments in terms of image classification and object detection on images of artificially occluded objects from the PASCAL3D+ and ImageNet dataset, and real images of partially occluded vehicles from the MS-COCO dataset. Our experiments show that CompositionalNets made from several popular DCNN backbones (VGG-16, ResNet50, ResNext) improve by a large margin over their non-compositional counterparts at classifying and detecting partially occluded objects. Furthermore, they can localize occluders accurately despite being trained with class-level supervision only. Finally, we demonstrate that CompositionalNets provide human interpretable predictions as their individual components can be understood as detecting parts and estimating an objects' viewpoint.
引用
收藏
页码:736 / 760
页数:25
相关论文
共 50 条
  • [1] Compositional Convolutional Neural Networks: A Robust and Interpretable Model for Object Recognition Under Occlusion
    Adam Kortylewski
    Qing Liu
    Angtian Wang
    Yihong Sun
    Alan Yuille
    International Journal of Computer Vision, 2021, 129 : 736 - 760
  • [2] Convolutional Neural Networks for Object Recognition on Mobile Devices: a Case Study
    Tobias, Luis
    Ducournau, Aurelien
    Rousseau, Francois
    Mercier, Gregoire
    Fablet, Ronan
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3530 - 3535
  • [3] Object Detection Using Convolutional Neural Networks
    Galvez, Reagan L.
    Bandala, Argel A.
    Dadios, Elmer P.
    Vicerra, Ryan Rhay P.
    Maningo, Jose Martin Z.
    PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 2023 - 2027
  • [4] Two-Stage Ensemble of Deep Convolutional Neural Networks for Object Recognition
    Uddamvathanak, Rom
    Yang, Feng
    Yang, Xulei
    Das, Ankit Kumar
    Shen, Yan
    Salahuddin, Mohamed
    Hussain, Shaista
    Chawla, Shailey
    2018 INTERNATIONAL CONFERENCE ON INTELLIGENT RAIL TRANSPORTATION (ICIRT), 2018,
  • [5] Parallel Convolutional Neural Networks for Object Detection
    Olugboja, Adedeji
    Wang, Zenghui
    Sun, Yanxia
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2021, 12 (04) : 279 - 286
  • [6] Robust object proposals re-ranking for object detection in autonomous driving using convolutional neural networks
    Pham, Cuong Cao
    Jeon, Jae Wook
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2017, 53 : 110 - 122
  • [7] ARCHITECTURE RECOGNITION BY MEANS OF CONVOLUTIONAL NEURAL NETWORKS
    Andrianaivo, Louis N.
    D'Autilia, Roberto
    Palma, Valerio
    27TH CIPA INTERNATIONAL SYMPOSIUM: DOCUMENTING THE PAST FOR A BETTER FUTURE, 2019, 42-2 (W15): : 77 - 84
  • [8] Object Detection and Recognition in Remote Sensing Images by Employing a Hybrid Generative Adversarial Networks and Convolutional Neural Networks
    Deshmukh, Araddhana Arvind
    Kumari, Mamta
    Krishnaiah, V. V. Jaya Rama
    Bandhekar, Shweta
    Dharani, R.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (09) : 621 - 632
  • [9] Driving posture recognition by convolutional neural networks
    Yan, Chao
    Coenen, Frans
    Zhang, Bailing
    IET COMPUTER VISION, 2016, 10 (02) : 103 - 114
  • [10] Ensemble Convolutional Neural Networks for Face Recognition
    Cheng, Wen-Chang
    Wu, Tin-Yu
    Li, Dai-Wei
    2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,