Knowledge-Embedded Routing Network for Scene Graph Generation

被引：271

作者：

Chen, Tianshui ^{[1
,2
]}

Yu, Weihao ^{[1
]}

Chen, Riquan ^{[1
]}

Lin, Liang ^{[1
,2
]}

机构：

[1] Sun Yat Sen Univ, Guangzhou, Peoples R China

[2] DarkMatter AI Res, Beijing, Peoples R China

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/CVPR.2019.00632

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To understand a scene in depth not only involves locating/recognizing individual objects, but also requires to infer the relationships and interactions among them. However, since the distribution of real-world relationships is seriously unbalanced, existing methods perform quite poorly for the less frequent relationships. In this work, we find that the statistical correlations between object pairs and their relationships can effectively regularize semantic space and make prediction less ambiguous, and thus well address the unbalanced distribution issue. To achieve this, we incorporate these statistical correlations into deep neural networks to facilitate scene graph generation by developing a Knowledge-Embedded Routing Network. More specifically, we show that the statistical correlations between objects appearing in images and their relationships, can be explicitly represented by a structured knowledge graph, and a routing mechanism is learned to propagate messages through the graph to explore their interactions. Extensive experiments on the large-scale Visual Genome dataset demonstrate the superiority of the proposed method over current state-of-the-art competitors.

引用

页码：6156 / 6164

页数：9

共 34 条

[1]

[Anonymous], 2017, ARXIV171106640

[2]

[Anonymous], 2017, ARXIV170205448

[3]

Chen T., 2018, TMM

[4]

Chen TS, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P627

[5]

Cho K., 2014, P SSST8 8 WORKSH SYN, P103, DOI 10.3115/v1/w14-4012

[6] Understanding Indoor Scenes using 3D Geometric Phrases [J].

Choi, Wongun ;

Chao, Yu-Wei ;

Pantofaru, Caroline ;

Savarese, Silvio .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :33-40

[7] Detecting Visual Relationships with Deep Relational Networks [J].

Dai, Bo ;

Zhang, Yuqi ;

Lin, Dahua .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3298-3308

[8]

Deng J, 2014, LECT NOTES COMPUT SC, V8689, P48, DOI 10.1007/978-3-319-10590-1_4

[9]

Fang Y, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1661

[10]

Galleguillos C, 2008, PROC CVPR IEEE, P3552

← 1 2 3 4 →