Detecting Visual Relationships with Deep Relational Networks

被引:359
作者
Dai, Bo [1 ]
Zhang, Yuqi [1 ]
Lin, Dahua [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Informat Engn, Hong Kong, Hong Kong, Peoples R China
来源
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) | 2017年
关键词
LOCATION; OBJECTS;
D O I
10.1109/CVPR.2017.352
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Relationships among objects play a crucial role in image understanding. Despite the great success of deep learning techniques in recognizing individual objects, reasoning about the relationships among objects remains a challenging task. Previous methods often treat this as a classification problem, considering each type of relationship (e.g. "ride") or each distinct visual phrase (e.g. "person-ride-horse") as a category. Such approaches are faced with significant difficulties caused by the high diversity of visual appearance for each kind of relationships or the large number of distinct visual phrases. We propose an integrated framework to tackle this problem. At the heart of this framework is the Deep Relational Network, a novel formulation designed specifically for exploiting the statistical dependencies between objects and their relationships. On two large data sets, the proposed method achieves substantial improvement over state-of-the-art.
引用
收藏
页码:3298 / 3308
页数:11
相关论文
共 64 条
[1]  
Aditya Somak, 2015, ARXIV151103292
[2]  
[Anonymous], 2016, VISUAL GENOME CONNEC
[3]  
[Anonymous], ARXIV160800187
[4]  
[Anonymous], EUR C COMP VIS
[5]  
[Anonymous], P NEURAL INFORM PROC
[6]  
[Anonymous], COMP VIS PATT REC CV
[7]  
[Anonymous], ARXIV151106350
[8]  
[Anonymous], 2015, ARXIV151103745
[9]  
[Anonymous], P IEEE INT C COMP VI
[10]  
[Anonymous], 2016, ARXIV PREPRINT ARXIV