A Survey of Embodied AI: From Simulators to Research Tasks

被引:101
|
作者
Duan, Jiafei [1 ]
Yu, Samson [2 ]
Tan, Hui Li [3 ]
Zhu, Hongyuan [3 ]
Tan, Cheston [3 ]
机构
[1] Nanyang Technol Univ Singapore, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Singapore Univ Technol & Design, Singapore 487372, Singapore
[3] ASTAR, Inst Infocomm Res, Singapore 138632, Singapore
来源
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2022年 / 6卷 / 02期
基金
新加坡国家研究基金会;
关键词
Artificial intelligence; Task analysis; Navigation; Physics; Three-dimensional displays; Visualization; Solid modeling; Embodied AI; computer vision; 3D simulators; NAVIGATION;
D O I
10.1109/TETCI.2022.3141105
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There has been an emerging paradigm shift from the era of "internet AI" to "embodied AI," where AI algorithms and agents no longer learn from datasets of images, videos or text curated primarily from the internet. Instead, they learn through interactions with their environments from an egocentric perception similar to humans. Consequently, there has been substantial growth in the demand for embodied AI simulators to support various embodied AI research tasks. This growing interest in embodied AI is beneficial to the greater pursuit of Artificial General Intelligence (AGI), but there has not been a contemporary and comprehensive survey of this field. This paper aims to provide an encyclopedic survey for the field of embodied AI, from its simulators to its research. By evaluating nine current embodied AI simulators with our proposed seven features, this paper aims to understand the simulators in their provision for use in embodied AI research and their limitations. Lastly, this paper surveys the three main research tasks in embodied AI - visual exploration, visual navigation and embodied question answering (QA), covering the state-of-the-art approaches, evaluation metrics and datasets. Finally, with the new insights revealed through surveying the field, the paper will provide suggestions for simulator-for-task selections and recommendations for the future directions of the field.
引用
收藏
页码:230 / 244
页数:15
相关论文
共 50 条
  • [41] AI foundation models for experimental fusion tasks
    Churchill, R. Michael
    FRONTIERS IN PHYSICS, 2025, 12
  • [42] Disembodied AI and the limits to machine understanding of students' embodied interactions
    Nathan, Mitchell J.
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 6
  • [43] A Brief Survey on the Development of Intelligent Dispatcher Training Simulators
    Dong, Ao
    Lai, Xinyi
    Lin, Chunlong
    Lin, Changnian
    Jin, Wei
    Wen, Fushuan
    ENERGIES, 2023, 16 (02)
  • [44] Bridging the Gap from AI Ethics Research to Practice
    Baxter, Kathy
    Schlesinger, Yoav
    Aerni, Sarah
    Baker, Lewis
    Dawson, Julie
    Kenthapadi, Krishnaram
    Kloumann, Isabel
    Wallach, Hanna
    FAT* '20: PROCEEDINGS OF THE 2020 CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, 2020, : 682 - 682
  • [45] Towards real-time embodied AI agent: a bionic visual encoding framework for mobile robotics
    Hou, Xueyu
    Guan, Yongjie
    Han, Tao
    Wang, Cong
    INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS, 2024, 8 (04) : 1038 - 1056
  • [46] Multi-Task Learning for Dense Prediction Tasks: A Survey
    Vandenhende, Simon
    Georgoulis, Stamatios
    Van Gansbeke, Wouter
    Proesmans, Marc
    Dai, Dengxin
    Van Gool, Luc
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (07) : 3614 - 3633
  • [47] Tasks and Visualizations Used for Data Profiling: A Survey and Interview Study
    Ruddle, Roy A.
    Cheshire, James
    Fernstad, Sara Johansson
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (07) : 3400 - 3412
  • [48] Current Practices in Voice Data Collection and Limitations to Voice AI Research: A National Survey
    Evangelista, Emily
    Kale, Rohan
    Mccutcheon, Desiree
    Rameau, Anais
    Gelbard, Alexander
    Powell, Maria
    Johns, Michael
    Law, Anthony
    Song, Phillip
    Naunheim, Matthew
    Watts, Stephanie
    Bryson, Paul C.
    Pinto, Jeremy
    Crowson, Matthew G.
    Bensoussan, Yael
    LARYNGOSCOPE, 2024, 134 (03): : 1333 - 1339
  • [49] What working memory subcomponents are needed in the acquisition of survey knowledge? Evidence from direction estimation and shortcut tasks
    Labate, Enia
    Pazzaglia, Francesca
    Hegarty, Mary
    JOURNAL OF ENVIRONMENTAL PSYCHOLOGY, 2014, 37 : 73 - 79
  • [50] Impossibility Results in AI: A Survey
    Brcic, Mario
    Yampolskiy, Roman V.
    ACM COMPUTING SURVEYS, 2024, 56 (01)