Hydra: A Real-time Spatial Perception System for 3D Scene Graph Construction and Optimization

被引:0
|
作者
Hughes, Nathan [1 ]
Chang, Yun [1 ]
Carlone, Luca [1 ]
机构
[1] MIT, Lab Informat & Decis Syst, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
Robot perception; 3D scene graphs; localization and mapping; real-time scene understanding; REPRESENTATIONS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
3D scene graphs have recently emerged as a powerful high-level representation of 3D environments. A 3D scene graph describes the environment as a layered graph where nodes represent spatial concepts at multiple levels of abstraction (from low-level geometry to high-level semantics including objects, places, rooms, buildings, etc.) and edges represent relations between concepts. While 3D scene graphs can serve as an advanced "mental model" for robots, how to build such a rich representation in real-time is still uncharted territory. This paper describes a real-time Spatial Perception System, a suite of algorithms to build a 3D scene graph from sensor data in real-time. Our first contribution is to develop real-time algorithms to incrementally construct the layers of a scene graph as the robot explores the environment; these algorithms build a local Euclidean Signed Distance Function (ESDF) around the current robot location, extract a topological map of places from the ESDF, and then segment the places into rooms using an approach inspired by community-detection techniques. Our second contribution is to investigate loop closure detection and optimization in 3D scene graphs. We show that 3D scene graphs allow defining hierarchical descriptors for loop closure detection; our descriptors capture statistics across layers in the scene graph, ranging from low-level visual appearance to summary statistics about objects and places. We then propose the first algorithm to optimize a 3D scene graph in response to loop closures; our approach relies on embedded deformation graphs to simultaneously correct all layers of the scene graph. We implement the proposed Spatial Perception System into a highly parallelized architecture, named Hydra(1), that combines fast early and mid-level perception processes (e.g., local mapping) with slower high-level perception (e.g., global optimization of the scene graph). We evaluate Hydra on simulated and real data and show it is able to reconstruct 3D scene graphs with an accuracy comparable with batch offline methods despite running online.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Near Real-Time 3D Reconstruction and Quality 3D Point Cloud for Time-Critical Construction Monitoring
    Liu, Zuguang
    Kim, Daeho
    Lee, Sanghyun
    Zhou, Li
    An, Xuehui
    Liu, Meiyin
    BUILDINGS, 2023, 13 (02)
  • [42] Complementary spatial transformer network for real-time 3D object recognition
    Krishna Kumar, K. P.
    Paul, Varghese
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2023, 20 (05)
  • [43] Real-Time Spatial 3D Audio Synthesis on FPGAs for Blind Sailing
    Singhani, Anish
    Morrow, Alexander
    2020 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA '20), 2020, : 104 - 110
  • [44] Study and Implementation of 3D Scene Management and Real-Time Rendering Technology Based on OSG
    Qun, Wei
    Lie, Xu
    ELECTRONIC INFORMATION AND ELECTRICAL ENGINEERING, 2012, 19 : 529 - 532
  • [45] Virtual marionettes: a system and paradigm for real-time 3D animation
    Bar-Lev, A
    Bruckstein, AM
    Elber, G
    VISUAL COMPUTER, 2005, 21 (07): : 488 - 501
  • [46] Real-time 3D microscope system incorporating optical tweezers
    Hirano, Akinari
    Shimada, Naoya
    Ikeuchi, Masashi
    Ikuta, Koji
    Transactions of Japanese Society for Medical and Biological Engineering, 2014, 52
  • [47] Onboard real-time system for 3D urban environment reconstruction
    Abuhadrous, I
    Nashashibi, F
    Laurgeau, C
    Goulette, F
    IEEE IV2003: INTELLIGENT VEHICLES SYMPOSIUM, PROCEEDINGS, 2003, : 479 - 483
  • [48] A real-time analysis of 3D scene from monocular images by observing known background
    Kondo, K
    Kobashi, S
    Hata, Y
    ISPACS 2005: PROCEEDINGS OF THE 2005 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, 2005, : 5 - 8
  • [49] Monocular 3D Vision Using Real-Time Generated Scene with Depth of Field Effect
    Hosomi, Takashi
    Sakamoto, Kunio
    ENTERTAINMENT COMPUTING - ICEC 2009, 2009, 5709 : 284 - 285
  • [50] C/S based real-time 3D graphics system
    Tan, JW
    Jing, YC
    SYSTEM SIMULATION AND SCIENTIFIC COMPUTING (SHANGHAI), VOLS I AND II, 2002, : 398 - 402