A Survey of Embodied AI: From Simulators to Research Tasks

被引:101
|
作者
Duan, Jiafei [1 ]
Yu, Samson [2 ]
Tan, Hui Li [3 ]
Zhu, Hongyuan [3 ]
Tan, Cheston [3 ]
机构
[1] Nanyang Technol Univ Singapore, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Singapore Univ Technol & Design, Singapore 487372, Singapore
[3] ASTAR, Inst Infocomm Res, Singapore 138632, Singapore
来源
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2022年 / 6卷 / 02期
基金
新加坡国家研究基金会;
关键词
Artificial intelligence; Task analysis; Navigation; Physics; Three-dimensional displays; Visualization; Solid modeling; Embodied AI; computer vision; 3D simulators; NAVIGATION;
D O I
10.1109/TETCI.2022.3141105
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There has been an emerging paradigm shift from the era of "internet AI" to "embodied AI," where AI algorithms and agents no longer learn from datasets of images, videos or text curated primarily from the internet. Instead, they learn through interactions with their environments from an egocentric perception similar to humans. Consequently, there has been substantial growth in the demand for embodied AI simulators to support various embodied AI research tasks. This growing interest in embodied AI is beneficial to the greater pursuit of Artificial General Intelligence (AGI), but there has not been a contemporary and comprehensive survey of this field. This paper aims to provide an encyclopedic survey for the field of embodied AI, from its simulators to its research. By evaluating nine current embodied AI simulators with our proposed seven features, this paper aims to understand the simulators in their provision for use in embodied AI research and their limitations. Lastly, this paper surveys the three main research tasks in embodied AI - visual exploration, visual navigation and embodied question answering (QA), covering the state-of-the-art approaches, evaluation metrics and datasets. Finally, with the new insights revealed through surveying the field, the paper will provide suggestions for simulator-for-task selections and recommendations for the future directions of the field.
引用
收藏
页码:230 / 244
页数:15
相关论文
共 50 条
  • [1] A survey of visual navigation: From geometry to embodied AI
    Zhang, Tianyao
    Hu, Xiaoguang
    Xiao, Jin
    Zhang, Guofeng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114
  • [2] From screens to scenes: A survey of embodied AI in healthcare
    Liu, Yihao
    Cao, Xu
    Chen, Tingting
    Jiang, Yankai
    You, Junjie
    Wu, Minghua
    Wang, Xiaosong
    Feng, Mengling
    Jin, Yaochu
    Chen, Jintai
    INFORMATION FUSION, 2025, 119
  • [3] Scene-Driven Multimodal Knowledge Graph Construction for Embodied AI
    Song, Yaoxian
    Sun, Penglei
    Liu, Haoyu
    Li, Zhixu
    Song, Wei
    Xiao, Yanghua
    Zhou, Xiaofang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 6962 - 6976
  • [4] Embodied AI beyond Embodied Cognition and Enactivism
    Manzotti, Riccardo
    PHILOSOPHIES, 2019, 4 (03)
  • [5] RoboTHOR: An Open Simulation-to-Real Embodied AI Platform
    Deitke, Matt
    Han, Winson
    Herrasti, Alvaro
    Kembhavi, Aniruddha
    Kolve, Eric
    Mottaghi, Roozbeh
    Salvador, Jordi
    Schwenk, Dustin
    VanderBilt, Eli
    Wallingford, Matthew
    Weihs, Luca
    Yatskar, Mark
    Farhadi, Ali
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3161 - 3171
  • [6] Deploying embodied AI into virtual worlds
    Burden, David J. H.
    KNOWLEDGE-BASED SYSTEMS, 2009, 22 (07) : 540 - 544
  • [7] Embodied AI, creation and Cog - Response
    Foerst, AL
    ZYGON, 1998, 33 (03): : 455 - 461
  • [8] A Short Survey on Future Research of AI and IoT Technologies
    Tan, Jie
    Sha, Xiubin
    Lu, Ting
    Dai, Bo
    2022 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2022, : 190 - 195
  • [9] An Embodied Approach to AI Art Collaboration
    Andrews, Christopher
    PROCEEDINGS OF THE 2019 ON CREATIVITY AND COGNITION - C&C 19, 2019, : 156 - 162
  • [10] Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks
    Fang, Kuan
    Toshev, Alexander
    Li Fei-Fei
    Savarese, Silvio
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 538 - 547