LEVIOSA: Natural Language-Based Uncrewed Aerial Vehicle Trajectory Generation

被引:0
|
作者
Aikins, Godwyll [1 ]
Dao, Mawaba Pascal [2 ]
Moukpe, Koboyo Josias [2 ]
Eskridge, Thomas C. [2 ]
Nguyen, Kim-Doang [1 ]
机构
[1] Florida Inst Technol, Dept Mech & Civil Engn, Melbourne, FL 32901 USA
[2] Florida Inst Technol, Dept Elect Engn & Comp Sci, Melbourne, FL 32901 USA
基金
美国国家科学基金会;
关键词
multi-robot coordination; vision-language model; human-robot interaction; waypoint generation; drone choreography; program synthesis; multi-UAVs;
D O I
10.3390/electronics13224508
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents LEVIOSA, a novel framework for text- and speech-based uncrewed aerial vehicle (UAV) trajectory generation. By leveraging multimodal large language models (LLMs) to interpret natural language commands, the system converts text and audio inputs into executable flight paths for UAV swarms. The approach aims to simplify the complex task of multi-UAV trajectory generation, which has significant applications in fields such as search and rescue, agriculture, infrastructure inspection, and entertainment. The framework involves two key innovations: a multi-critic consensus mechanism to evaluate trajectory quality and a hierarchical prompt structuring for improved task execution. The innovations ensure fidelity to user goals. The framework integrates several multimodal LLMs for high-level planning, converting natural language inputs into 3D waypoints that guide UAV movements and per-UAV low-level controllers to control each UAV in executing its assigned 3D waypoint path based on the high-level plan. The methodology was tested on various trajectory types with promising accuracy, synchronization, and collision avoidance results. The findings pave the way for more intuitive human-robot interactions and advanced multi-UAV coordination.
引用
收藏
页数:27
相关论文
共 50 条
  • [21] Interactive Natural Language-Based Person Search
    Shree, Vikram
    Chao, Wei-Lun
    Campbell, Mark
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02): : 1851 - 1858
  • [22] DAKRS: Domain Adaptive Knowledge-Based Retrieval System for Natural Language-Based Vehicle Retrieval
    Ha, Synh Viet-Uyen
    Le, Huy Dinh-Anh
    Nguyen, Quang Qui-Vinh
    Chung, Nhat Minh
    IEEE ACCESS, 2023, 11 : 90951 - 90965
  • [23] Language-based generation and evaluation of NIDS signatures
    Rubin, S
    Jha, S
    Miller, BP
    2005 IEEE Symposium on Security and Privacy, Proceedings, 2005, : 3 - 17
  • [24] DUN: Dual-path Temporal Matching Network for Natural Language-based Vehicle Retrieval
    Sun, Ziruo
    Liu, Xinfang
    Bi, Xiaopeng
    Nie, Xiushan
    Yin, Yilong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 4056 - 4062
  • [25] Speech-based Natural Language Interface for UAV Trajectory Generation
    Meszaros, Erica L.
    Chandarana, Meghan
    Trujillo, Anna
    Allen, B. Danette
    2017 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS'17), 2017, : 46 - 55
  • [26] A natural language-based interface for querying a video database
    Kuecuektunc, Onur
    Gueduekbay, Ugur
    Ulusoy, Oezgur
    IEEE MULTIMEDIA, 2007, 14 (01) : 83 - 89
  • [27] Natural Language-Based Knowledge Extraction in Healthcare Domain
    Rencis, Edgars
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND DATA MINING (ICISDM 2019), 2019, : 138 - 142
  • [28] Natural Language-Based Automatic Programming for Industrial Robots
    Haiyang Hu
    Jie Chen
    Hanwen Liu
    Zhongjin Li
    Liguo Huang
    Journal of Grid Computing, 2022, 20
  • [29] Relational interface for natural language-based information sources
    Kawasaki, Z
    Shibata, K
    Tajima, M
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (06) : 1139 - 1143
  • [30] Natural language-based automatic qualitative description of clusters
    Sanchez-Hernandez, German
    Agell, Nuria
    Carlos Aguado, Juan
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE OF THE CATALAN ASSOCIATION FOR ARTIFICIAL INTELLIGENCE, 2013, 256 : 225 - 234