Scene understanding using natural language description based on 3D semantic graph map

被引:0
|
作者
Jiyoun Moon
Beomhee Lee
机构
[1] Seoul National University,Automation and Systems Research Institute, Department of Electrical Engineering
来源
Intelligent Service Robotics | 2018年 / 11卷
关键词
Scene understanding; Natural language description; 3D semantic graph map;
D O I
暂无
中图分类号
学科分类号
摘要
A natural language description for working environment understanding is an important component in human–robot communication. Although 3D semantic graph mappings are widely studied for perceptual aspects of the environment, these approaches hardly apply to the communication issues such as natural language descriptions for a semantic graph map. There are many researches on workspace understanding over images in the field of computer vision, which automatically generate sentences while they usually never utilize multiple scenes and 3D information. In this paper, we introduce a novel natural language description method using 3D semantic graph map. An object-oriented semantic graph map is first constructed using 3D information. A graph convolutional neural network and a recurrent neural network are then used to generate a description of the map. A natural language sentence focusing on objects over 3D semantic graph map can be eventually generated consisting of a single scene or multiple scenes. We validate the proposed method using publicly available dataset and compare it with conventional methods.
引用
收藏
页码:347 / 354
页数:7
相关论文
共 27 条
  • [1] Scene understanding using natural language description based on 3D semantic graph map
    Moon, Jiyoun
    Lee, Beomhee
    INTELLIGENT SERVICE ROBOTICS, 2018, 11 (04) : 347 - 354
  • [2] 3D Scene Understanding at Urban Intersection using Stereo Vision and Digital Map
    Bhattacharyya, Prarthana
    Gu, Yanlei
    Bao, Jiali
    Liu, Xu
    Kamijo, Shunsuke
    2017 IEEE 85TH VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2017,
  • [3] Unbiased 3D Semantic Scene Graph Prediction in Point Cloud Using Deep Learning
    Han, Chaolin
    Li, Hongwei
    Xu, Jian
    Dong, Bing
    Wang, Yalin
    Zhou, Xiaowen
    Zhao, Shan
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [4] Scene Understanding and Semantic Mapping for Unmanned Ground Vehicles Using 3D Point Clouds
    Yan, Fei
    He, Guojian
    Zhuang, Yan
    Chang, Huan
    2018 8TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST 2018), 2018, : 341 - 347
  • [5] Indoor Semantic Scene Understanding Using 2D-3D Fusion
    Gopinathan, Muraleekrishna
    Truong, Giang
    Abu-Khalaf, Jumana
    2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 133 - 140
  • [6] 3D Semantic Scene Completion: A Survey
    Luis Roldão
    Raoul de Charette
    Anne Verroust-Blondet
    International Journal of Computer Vision, 2022, 130 : 1978 - 2005
  • [7] 3D Semantic Scene Completion: A Survey
    Roldao, Luis
    de Charette, Raoul
    Verroust-Blondet, Anne
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (08) : 1978 - 2005
  • [8] NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding
    Zhai, Hongjia
    Huang, Gan
    Hu, Qirui
    Li, Guanglin
    Bao, Hujun
    Zhang, Guofeng
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (11) : 7129 - 7139
  • [9] 3D scene graph prediction from point clouds
    Wu F.
    Yan F.
    Shi W.
    Zhou Z.
    Virtual Reality and Intelligent Hardware, 2022, 4 (01): : 76 - 88
  • [10] Explore Contextual Information for 3D Scene Graph Generation
    Liu, Yuanyuan
    Long, Chengjiang
    Zhang, Zhaoxuan
    Liu, Bokai
    Zhang, Qiang
    Yin, Baocai
    Yang, Xin
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (12) : 5556 - 5568