Detecting Visual Relationships Using Box Attention

被引：35

作者：

Kolesnikov, Alexander ^{[1
,2
]}

Kuznetsova, Alina ^{[1
]}

Lampert, Christoph H. ^{[2
]}

Ferrari, Vittorio ^{[1
]}

机构：

[1] Google Res, Mountain View, CA 94043 USA

[2] IST Austria, Klosterneuburg, Austria

来源：

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW) | 2019年

关键词：

D O I：

10.1109/ICCVW.2019.00217

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a new modelfor detecting visual relationships, such as "person riding motorcycle" or "bottle on table". This task is an important step towards comprehensive structured image understanding, going beyond detecting individual objects. Our main novelty is a Box Attention mechanism that allows to model pairwise interactions between objects using standard object detection pipelines. The resulting model is conceptually clean, expressive and relies on welljustified training and prediction procedures. Moreover, unlike previously proposed approaches, our model does not introduce any additional complex components or hyperparameters on top of those already required by the underlying detection model. We conduct an experimental evaluation on two datasets, V-COCO and Open Images, demonstrating strong quantitative and qualitative results.

引用

页码：1749 / 1753

页数：5

共 26 条

[1] [Anonymous], EUR C COMP VIS
[2] [Anonymous], 2015, Arxiv.Org, DOI DOI 10.3389/FPSYG.2013.00124
[3] [Anonymous], COMP VIS PATT REC CV
[4] Detecting Visual Relationships with Deep Relational Networks
Dai, Bo
Zhang, Yuqi
Lin, Dahua
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3298 - 3308
[5] Desai Chaitanya., 2012, ECCV
[6] The PASCAL Visual Object Classes Challenge: A Retrospective
Everingham, Mark
Eslami, S. M. Ali
Van Gool, Luc
Williams, Christopher K. I.
Winn, John
Zisserman, Andrew
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) : 98 - 136
[7] Gao C., 2018, P BRIT MACH VIS C
[8] Detecting and Recognizing Human-Object Interactions
Gkioxari, Georgia
Girshick, Ross
Dollar, Piotr
He, Kaiming
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8359 - 8367
[9] Gupta A., 2009, IEEE T PAMI
[10] Gupta Saurabh, 2015, ARXIV150504474

← 1 2 3 →