Scene understanding using natural language description based on 3D semantic graph map

被引：0

作者：

Jiyoun Moon

Beomhee Lee

机构：

[1] Seoul National University,Automation and Systems Research Institute, Department of Electrical Engineering

来源：

Intelligent Service Robotics | 2018年 / 11卷

关键词：

Scene understanding; Natural language description; 3D semantic graph map;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

A natural language description for working environment understanding is an important component in human–robot communication. Although 3D semantic graph mappings are widely studied for perceptual aspects of the environment, these approaches hardly apply to the communication issues such as natural language descriptions for a semantic graph map. There are many researches on workspace understanding over images in the field of computer vision, which automatically generate sentences while they usually never utilize multiple scenes and 3D information. In this paper, we introduce a novel natural language description method using 3D semantic graph map. An object-oriented semantic graph map is first constructed using 3D information. A graph convolutional neural network and a recurrent neural network are then used to generate a description of the map. A natural language sentence focusing on objects over 3D semantic graph map can be eventually generated consisting of a single scene or multiple scenes. We validate the proposed method using publicly available dataset and compare it with conventional methods.

引用

页码：347 / 354

页数：7

共 27 条

[1] Scene understanding using natural language description based on 3D semantic graph map
Moon, Jiyoun
Lee, Beomhee
INTELLIGENT SERVICE ROBOTICS, 2018, 11 (04) : 347 - 354
[2] 3D Scene Understanding at Urban Intersection using Stereo Vision and Digital Map
Bhattacharyya, Prarthana
Gu, Yanlei
Bao, Jiali
Liu, Xu
Kamijo, Shunsuke
2017 IEEE 85TH VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2017,
[3] Unbiased 3D Semantic Scene Graph Prediction in Point Cloud Using Deep Learning
Han, Chaolin
Li, Hongwei
Xu, Jian
Dong, Bing
Wang, Yalin
Zhou, Xiaowen
Zhao, Shan
APPLIED SCIENCES-BASEL, 2023, 13 (09):
[4] Scene Understanding and Semantic Mapping for Unmanned Ground Vehicles Using 3D Point Clouds
Yan, Fei
He, Guojian
Zhuang, Yan
Chang, Huan
2018 8TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST 2018), 2018, : 341 - 347
[5] Indoor Semantic Scene Understanding Using 2D-3D Fusion
Gopinathan, Muraleekrishna
Truong, Giang
Abu-Khalaf, Jumana
2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 133 - 140
[6] 3D Semantic Scene Completion: A Survey
Luis Roldão
Raoul de Charette
Anne Verroust-Blondet
International Journal of Computer Vision, 2022, 130 : 1978 - 2005
[7] 3D Semantic Scene Completion: A Survey
Roldao, Luis
de Charette, Raoul
Verroust-Blondet, Anne
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (08) : 1978 - 2005
[8] NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding
Zhai, Hongjia
Huang, Gan
Hu, Qirui
Li, Guanglin
Bao, Hujun
Zhang, Guofeng
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (11) : 7129 - 7139
[9] 3D scene graph prediction from point clouds
Wu F.
Yan F.
Shi W.
Zhou Z.
Virtual Reality and Intelligent Hardware, 2022, 4 (01): : 76 - 88
[10] Explore Contextual Information for 3D Scene Graph Generation
Liu, Yuanyuan
Long, Chengjiang
Zhang, Zhaoxuan
Liu, Bokai
Zhang, Qiang
Yin, Baocai
Yang, Xin
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (12) : 5556 - 5568

← 1 2 3 →