Long-Term Human Trajectory Prediction Using 3D Dynamic Scene Graphs

被引：1

作者：

Gorlo, Nicolas ^{[1
]}

Schmid, Lukas ^{[1
]}

Carlone, Luca ^{[1
]}

机构：

[1] MIT, MIT SPARK Lab, Cambridge, MA 02139 USA

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 12期

基金：

瑞士国家科学基金会; 芬兰科学院;

关键词：

Trajectory; Probabilistic logic; Three-dimensional displays; Predictive models; Indoor environment; Planning; Cognition; Annotations; Service robots; Legged locomotion; AI-enabled robotics; human-centered robotics; service robotics; datasets for human motion; modeling and simulating humans; NAVIGATION;

D O I：

10.1109/LRA.2024.3482169

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

We present a novel approach for long-term human trajectory prediction in indoor human-centric environments, which is essential for long-horizon robot planning in these environments. State-of-the-art human trajectory prediction methods are limited by their focus on collision avoidance and short-term planning, and their inability to model complex interactions of humans with the environment. In contrast, our approach overcomes these limitations by predicting sequences of human interactions with the environment and using this information to guide trajectory predictions over a horizon of up to 60s . We leverage Large Language Models (LLMs) to predict interactions with the environment by conditioning the LLM prediction on rich contextual information about the scene. This information is given as a 3D Dynamic Scene Graph that encodes the geometry, semantics, and traversability of the environment into a hierarchical representation. We then ground these interaction sequences into multi-modal spatio-temporal distributions over human positions using a probabilistic approach based on continuous-time Markov Chains. To evaluate our approach, we introduce a new semi-synthetic dataset of long-term human trajectories in complex indoor environments, which also includes annotations of human-object interactions. We show in thorough experimental evaluations that our approach achieves a 54% lower average negative log-likelihood and a 26.5% lower Best-of-20 displacement error compared to the best non-privileged (i.e., evaluated in a zero-shot fashion on the dataset) baselines for a time horizon of 60 s .

引用

页码：10978 / 10985

页数：8

共 50 条

[31] Managing Sets of Flying Base Stations Using Energy Efficient 3D Trajectory Planning in Cellular Networks
Sobouti, Mohammad Javad
Mohajerzadeh, Amir Hossein
Seno, Seyed Amin Hosseini
Yanikomeroglu, Halim
IEEE SENSORS JOURNAL, 2023, 23 (10) : 10983 - 10997
[32] A Novel Spatio-Temporal 3D Convolutional Encoder-Decoder Network for Dynamic Saliency Prediction
Li, Hao
Qi, Fei
Shi, Guangming
IEEE ACCESS, 2021, 9 : 36328 - 36341
[33] Safe Walking Route Recommender Based on Fall Risk Calculation Using a Digital Human Model on a 3D Map
Minakata, Mayuko
Maruyama, Tsubasa
Tada, Mitsunori
Ramasamy, Priyanka
Das, Swagata
Kurita, Yuichi
IEEE ACCESS, 2022, 10 : 8424 - 8433
[34] Human Tracking by a Mobile Robot using 3D Features
Ali, Badar
Qureshi, Ahmed Hussain
Iqbal, Khawaja Fahad
Ayaz, Yasar
Gilani, Syed Omer
Jamil, Mohsin
Muhammad, Naveed
Ahmed, Faizan
Muhammad, Mannan Saeed
Kim, Whoi-Yul
Ra, Moonsoo
2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2013, : 2464 - 2469
[35] Constructing 3D Maps for Dynamic Environments using Autonomous UAVs
Ahmed, Ahmed Abdelmoamen
Olumide, Abel
Akinwa, Adeoluwa
Chouikha, Mohamed
PROCEEDINGS OF THE 16TH EAI INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES (MOBIQUITOUS'19), 2019, : 504 - 513
[36] Pose-Driven Compression for Dynamic 3D Human via Human Prior Models
Yan, Ruoke
Yin, Qian
Zhang, Xinfeng
Zhang, Qi
Zhang, Gai
Ma, Siwei
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) : 5820 - 5834
[37] Toward Realistic 3D Human Motion Prediction With a Spatio-Temporal Cross- Transformer Approach
Yu, Hua
Fan, Xuanzhe
Hou, Yaqing
Pei, Wenbin
Ge, Hongwei
Yang, Xin
Zhou, Dongsheng
Zhang, Qiang
Zhang, Mengjie
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 5707 - 5720
[38] Vehicle Position Prediction Using Particle Filtering Based on 3D CNN-LSTM Model
Wang, Jiaqin
Liu, Kai
Gong, Yi
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (04) : 2992 - 3004
[39] Dynamic Scene Understanding for Autonomous Driving Using 2D-3D Convolution With Voxel Key Points
Liu, Kunhua
Zheng, Yi
Xie, Junkun
Xie, Yuting
Wang, Feiyang
Ma, Longyan
Dai, Chenggang
Lu, Tao
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024,
[40] Advanced 3D Motion Prediction for Video-Based Dynamic Point Cloud Compression
Li, Li
Li, Zhu
Zakharchenko, Vladyslav
Chen, Jianle
Li, Houqiang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 289 - 302

← 1 2 3 4 5 →