KNOWLEDGE-BASED REASONING NETWORK FOR OBJECT DETECTION

被引：5

作者：

Zhang, Huigang ^{[1
]}

Wang, Liuan ^{[1
]}

Sun, Jun ^{[1
]}

机构：

[1] Fujitsu R&D Ctr, Beijing, Peoples R China

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2021年

关键词：

Object detection; commonsense knowledge; reasoning; GAT; COCO detection benchmark;

D O I：

10.1109/ICIP42928.2021.9506228

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The mainstream object detection algorithms rely on recognizing object instances individually, but do not consider the high-level relationship among objects in context. This will inevitably lead to biased detection results, due to the lack of commonsense knowledge that humans often use to assist the task for object identification. In this paper, we present a novel reasoning module to endow the current detection systems with the power of commonsense knowledge. Specifically, we use graph attention network (GAT) to represent the knowledge among objects. The knowledge covers visual and semantic relations. Through the iterative update of GAT, the object features can be enriched. Experiments on the COCO detection benchmark indicate that our knowledge-based reasoning network has achieved consistent improvements upon various CNN detectors. We achieved 1.9 and 1.8 points higher Average Precision (AP) than Faster-RCNN and Mask-RCNN respectively, when using ResNet50-FPN as back-bone.

引用

页码：1579 / 1583

页数：5

共 18 条

[1] [Anonymous], 2019, MMDETECTION OPEN MML, DOI DOI 10.1109/CVPR.2019.00511
[2] Bochkovskiy A., 2020, PREPRINT, DOI DOI 10.48550/ARXIV.2004.10934
[3] Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present
Chen, Xinpeng
Ma, Lin
Jiang, Wenhao
Yao, Jian
Liu, Wei
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7995 - 8003
[4] Multi-Label Image Recognition with Graph Convolutional Networks
Chen, Zhao-Min
Wei, Xiu-Shen
Wang, Peng
Guo, Yanwen
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5172 - 5181
[5] He K., 2017, ICCV, P2961
[6] Relation Networks for Object Detection
Hu, Han
Gu, Jiayuan
Zhang, Zheng
Dai, Jifeng
Wei, Yichen
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3588 - 3597
[7] Kipf TN, 2016, ICLR
[8] Relation-Aware Graph Attention Network for Visual Question Answering
Li, Linjie
Gan, Zhe
Cheng, Yu
Liu, Jingjing
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10312 - 10321
[9] Focal Loss for Dense Object Detection
Lin, Tsung-Yi
Goyal, Priya
Girshick, Ross
He, Kaiming
Dollar, Piotr
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 318 - 327
[10] Microsoft COCO: Common Objects in Context
Lin, Tsung-Yi
Maire, Michael
Belongie, Serge
Hays, James
Perona, Pietro
Ramanan, Deva
Dollar, Piotr
Zitnick, C. Lawrence
[J]. COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 740 - 755

← 1 2 →