Long-Term Human Trajectory Prediction Using 3D Dynamic Scene Graphs

被引:1
|
作者
Gorlo, Nicolas [1 ]
Schmid, Lukas [1 ]
Carlone, Luca [1 ]
机构
[1] MIT, MIT SPARK Lab, Cambridge, MA 02139 USA
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 12期
基金
瑞士国家科学基金会; 芬兰科学院;
关键词
Trajectory; Probabilistic logic; Three-dimensional displays; Predictive models; Indoor environment; Planning; Cognition; Annotations; Service robots; Legged locomotion; AI-enabled robotics; human-centered robotics; service robotics; datasets for human motion; modeling and simulating humans; NAVIGATION;
D O I
10.1109/LRA.2024.3482169
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
We present a novel approach for long-term human trajectory prediction in indoor human-centric environments, which is essential for long-horizon robot planning in these environments. State-of-the-art human trajectory prediction methods are limited by their focus on collision avoidance and short-term planning, and their inability to model complex interactions of humans with the environment. In contrast, our approach overcomes these limitations by predicting sequences of human interactions with the environment and using this information to guide trajectory predictions over a horizon of up to 60s . We leverage Large Language Models (LLMs) to predict interactions with the environment by conditioning the LLM prediction on rich contextual information about the scene. This information is given as a 3D Dynamic Scene Graph that encodes the geometry, semantics, and traversability of the environment into a hierarchical representation. We then ground these interaction sequences into multi-modal spatio-temporal distributions over human positions using a probabilistic approach based on continuous-time Markov Chains. To evaluate our approach, we introduce a new semi-synthetic dataset of long-term human trajectories in complex indoor environments, which also includes annotations of human-object interactions. We show in thorough experimental evaluations that our approach achieves a 54% lower average negative log-likelihood and a 26.5% lower Best-of-20 displacement error compared to the best non-privileged (i.e., evaluated in a zero-shot fashion on the dataset) baselines for a time horizon of 60 s .
引用
收藏
页码:10978 / 10985
页数:8
相关论文
共 50 条
  • [31] Managing Sets of Flying Base Stations Using Energy Efficient 3D Trajectory Planning in Cellular Networks
    Sobouti, Mohammad Javad
    Mohajerzadeh, Amir Hossein
    Seno, Seyed Amin Hosseini
    Yanikomeroglu, Halim
    IEEE SENSORS JOURNAL, 2023, 23 (10) : 10983 - 10997
  • [32] A Novel Spatio-Temporal 3D Convolutional Encoder-Decoder Network for Dynamic Saliency Prediction
    Li, Hao
    Qi, Fei
    Shi, Guangming
    IEEE ACCESS, 2021, 9 : 36328 - 36341
  • [33] Safe Walking Route Recommender Based on Fall Risk Calculation Using a Digital Human Model on a 3D Map
    Minakata, Mayuko
    Maruyama, Tsubasa
    Tada, Mitsunori
    Ramasamy, Priyanka
    Das, Swagata
    Kurita, Yuichi
    IEEE ACCESS, 2022, 10 : 8424 - 8433
  • [34] Human Tracking by a Mobile Robot using 3D Features
    Ali, Badar
    Qureshi, Ahmed Hussain
    Iqbal, Khawaja Fahad
    Ayaz, Yasar
    Gilani, Syed Omer
    Jamil, Mohsin
    Muhammad, Naveed
    Ahmed, Faizan
    Muhammad, Mannan Saeed
    Kim, Whoi-Yul
    Ra, Moonsoo
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2013, : 2464 - 2469
  • [35] Constructing 3D Maps for Dynamic Environments using Autonomous UAVs
    Ahmed, Ahmed Abdelmoamen
    Olumide, Abel
    Akinwa, Adeoluwa
    Chouikha, Mohamed
    PROCEEDINGS OF THE 16TH EAI INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES (MOBIQUITOUS'19), 2019, : 504 - 513
  • [36] Pose-Driven Compression for Dynamic 3D Human via Human Prior Models
    Yan, Ruoke
    Yin, Qian
    Zhang, Xinfeng
    Zhang, Qi
    Zhang, Gai
    Ma, Siwei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) : 5820 - 5834
  • [37] Toward Realistic 3D Human Motion Prediction With a Spatio-Temporal Cross- Transformer Approach
    Yu, Hua
    Fan, Xuanzhe
    Hou, Yaqing
    Pei, Wenbin
    Ge, Hongwei
    Yang, Xin
    Zhou, Dongsheng
    Zhang, Qiang
    Zhang, Mengjie
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 5707 - 5720
  • [38] Vehicle Position Prediction Using Particle Filtering Based on 3D CNN-LSTM Model
    Wang, Jiaqin
    Liu, Kai
    Gong, Yi
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (04) : 2992 - 3004
  • [39] Dynamic Scene Understanding for Autonomous Driving Using 2D-3D Convolution With Voxel Key Points
    Liu, Kunhua
    Zheng, Yi
    Xie, Junkun
    Xie, Yuting
    Wang, Feiyang
    Ma, Longyan
    Dai, Chenggang
    Lu, Tao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024,
  • [40] Advanced 3D Motion Prediction for Video-Based Dynamic Point Cloud Compression
    Li, Li
    Li, Zhu
    Zakharchenko, Vladyslav
    Chen, Jianle
    Li, Houqiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 289 - 302