Three-dimensional feature extraction using local reference frame for detecting human-object interaction

被引：0

作者：

Rezaei, Mansoureh ^{[1
]}

Rezaeian, Mehdi ^{[1
]}

机构：

[1] Yazd Univ, Comp Engn Dept, Yazd, Iran

来源：

JOURNAL OF ELECTRONIC IMAGING | 2022年 / 31卷 / 04期

关键词：

human-object interaction; two-dimensional information; three-dimensional attributes; face transformation; viewing angle; object position;

D O I：

10.1117/1.JEI.31.4.043046

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

To understand the visual world, a device knows not only the instances but also how they interact. Humans are at the center of such interactions. Detection of human-object interaction (HOI) is one of the growing research fields in computer vision. However, identifying HOIs due to the large label space of verbs and their interaction with various object types still needs much research. We focus on HOIs in images, which is necessary for a deeper understanding of the scene. In addition to two-dimensional (2D) information, such as the appearance of humans and objects and their spatial location, three-dimensional (3D) status, especially in the configuration of the human body and object as well as their location and spatial, can play an important role in learning HOI. The mapping of 2D to 3D world adds depth information to the problem. These issues led us to collect 3D information along with the 2D features of the images to provide more accurate results. We show 3D attributes, such as face transformation, the viewing angle, the position of an object, and its related location to the human face, can improve HOI learning. The results of experiments on large-scale data show that our method has been able to improve the outcome of interactions.

引用

页数：11

共 33 条

[1]

[Anonymous], 2018, ADV NEUR IN

[2] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].

Cao, Zhe ;

Simon, Tomas ;

Wei, Shih-En ;

Sheikh, Yaser .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310

[3] Learning to Detect Human-Object Interactions [J].

Chao, Yu-Wei ;

Liu, Yunfan ;

Liu, Xieyang ;

Zeng, Huayi ;

Deng, Jia .

2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :381-389

[4] HICO: A Benchmark for Recognizing Human-Object Interactions in Images [J].

Chao, Yu-Wei ;

Wang, Zhan ;

He, Yugeng ;

Wang, Jiaxuan ;

Deng, Jia .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1017-1025

[5] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].

Dai, Angela ;

Qi, Charles Ruizhongtai ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554

[6]

Delaitre Vincent., 2011, ADV NEURAL INFORM PR

[7]

Desai C, 2012, LECT NOTES COMPUT SC, V7575, P158, DOI 10.1007/978-3-642-33765-9_12

[8]

Gao C., 2018, arXiv

[9] Fast R-CNN [J].

Girshick, Ross .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448

[10] Detecting and Recognizing Human-Object Interactions [J].

Gkioxari, Georgia ;

Girshick, Ross ;

Dollar, Piotr ;

He, Kaiming .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8359-8367

← 1 2 3 4 →