Compositional Convolutional Neural Networks: A Robust and Interpretable Model for Object Recognition Under Occlusion

被引：51

作者：

Kortylewski, Adam ^{[1
]}

Liu, Qing ^{[1
]}

Wang, Angtian ^{[1
]}

Sun, Yihong ^{[1
]}

Yuille, Alan ^{[1
]}

机构：

[1] Johns Hopkins Univ, Baltimore, MD 21218 USA

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2021年 / 129卷 / 03期

基金：

瑞士国家科学基金会;

关键词：

Compositional models; Robustness to partial occlusion; Image classification; Object detection; Out-of-distribution generalization;

D O I：

10.1007/s11263-020-01401-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Computer vision systems in real-world applications need to be robust to partial occlusion while also being explainable. In this work, we show that black-box deep convolutional neural networks (DCNNs) have only limited robustness to partial occlusion. We overcome these limitations by unifying DCNNs with part-based models into Compositional Convolutional Neural Networks (CompositionalNets)-an interpretable deep architecture with innate robustness to partial occlusion. Specifically, we propose to replace the fully connected classification head of DCNNs with a differentiable compositional model that can be trained end-to-end. The structure of the compositional model enables CompositionalNets to decompose images into objects and context, as well as to further decompose object representations in terms of individual parts and the objects' pose. The generative nature of our compositional model enables it to localize occluders and to recognize objects based on their non-occluded parts. We conduct extensive experiments in terms of image classification and object detection on images of artificially occluded objects from the PASCAL3D+ and ImageNet dataset, and real images of partially occluded vehicles from the MS-COCO dataset. Our experiments show that CompositionalNets made from several popular DCNN backbones (VGG-16, ResNet50, ResNext) improve by a large margin over their non-compositional counterparts at classifying and detecting partially occluded objects. Furthermore, they can localize occluders accurately despite being trained with class-level supervision only. Finally, we demonstrate that CompositionalNets provide human interpretable predictions as their individual components can be understood as detecting parts and estimating an objects' viewpoint.

引用

页码：736 / 760

页数：25

共 50 条

[1] Compositional Convolutional Neural Networks: A Robust and Interpretable Model for Object Recognition Under Occlusion
Adam Kortylewski
Qing Liu
Angtian Wang
Yihong Sun
Alan Yuille
International Journal of Computer Vision, 2021, 129 : 736 - 760
[2] Convolutional Neural Networks for Object Recognition on Mobile Devices: a Case Study
Tobias, Luis
Ducournau, Aurelien
Rousseau, Francois
Mercier, Gregoire
Fablet, Ronan
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3530 - 3535
[3] Object Detection Using Convolutional Neural Networks
Galvez, Reagan L.
Bandala, Argel A.
Dadios, Elmer P.
Vicerra, Ryan Rhay P.
Maningo, Jose Martin Z.
PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 2023 - 2027
[4] Two-Stage Ensemble of Deep Convolutional Neural Networks for Object Recognition
Uddamvathanak, Rom
Yang, Feng
Yang, Xulei
Das, Ankit Kumar
Shen, Yan
Salahuddin, Mohamed
Hussain, Shaista
Chawla, Shailey
2018 INTERNATIONAL CONFERENCE ON INTELLIGENT RAIL TRANSPORTATION (ICIRT), 2018,
[5] Parallel Convolutional Neural Networks for Object Detection
Olugboja, Adedeji
Wang, Zenghui
Sun, Yanxia
JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2021, 12 (04) : 279 - 286
[6] Robust object proposals re-ranking for object detection in autonomous driving using convolutional neural networks
Pham, Cuong Cao
Jeon, Jae Wook
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2017, 53 : 110 - 122
[7] ARCHITECTURE RECOGNITION BY MEANS OF CONVOLUTIONAL NEURAL NETWORKS
Andrianaivo, Louis N.
D'Autilia, Roberto
Palma, Valerio
27TH CIPA INTERNATIONAL SYMPOSIUM: DOCUMENTING THE PAST FOR A BETTER FUTURE, 2019, 42-2 (W15): : 77 - 84
[8] Object Detection and Recognition in Remote Sensing Images by Employing a Hybrid Generative Adversarial Networks and Convolutional Neural Networks
Deshmukh, Araddhana Arvind
Kumari, Mamta
Krishnaiah, V. V. Jaya Rama
Bandhekar, Shweta
Dharani, R.
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (09) : 621 - 632
[9] Driving posture recognition by convolutional neural networks
Yan, Chao
Coenen, Frans
Zhang, Bailing
IET COMPUTER VISION, 2016, 10 (02) : 103 - 114
[10] Ensemble Convolutional Neural Networks for Face Recognition
Cheng, Wen-Chang
Wu, Tin-Yu
Li, Dai-Wei
2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,

← 1 2 3 4 5 →