Geospatial large language model trained with a simulated environment for generating tool-use chains autonomously

被引:2
|
作者
Zhang, Yifan [1 ]
Li, Jingxuan [1 ]
Wang, Zhiyun [1 ]
He, Zhengting [1 ]
Guan, Qingfeng [1 ]
Lin, Jianfeng [2 ]
Yu, Wenhao [1 ]
机构
[1] China Univ Geosci, Sch Geog & Informat Engn, Wuhan, Peoples R China
[2] Meituan, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
GIS; Large language model; Tool-use chain; GIS;
D O I
10.1016/j.jag.2024.104312
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Solving geospatial tasks generally requires multiple geospatial tools and steps, i.e., tool-use chains. Automating the geospatial task solving process can effectively enhance the efficiency of GIS users. Traditionally, researchers tend to design rule-based systems to autonomously solve similar geospatial tasks, which is inflexible and difficult to adapt to different tasks. With the development of Large Language Models (LLMs), some research suggests that LLMs have the potential for intelligent task solving with their tool-use ability, which means LLMs can invoke externally provided tools for specific tasks. However, most studies rely on closed-source commercial LLMs like ChatGPT and GPT-4, whose limited API accessibility restricts their deployment on local private devices. Some researchers in the general domain proposed using instruction tuning to improve the tool-use ability of open-source LLMs. However, the requirement of tool-use chains to solve geospatial tasks, including multiple data input and output processes, poses challenges for collecting effective instruction tuning data. To solve these challenges, we propose a framework for training a Geospatial large language model to generate Tool-use Chains autonomously (GTChain). Specifically, we design a seed task-guided self-instruct strategy to generate a geospatial tool-use instruction tuning dataset within a simulated environment, encompassing diverse geospatial task production and corresponding tool-use chain generation. Subsequently, an open-source general- domain LLM, LLaMA-2-7B, is fine-tuned on the collected instruction data to understand geospatial tasks and learn how to generate geospatial tool-use chains. Finally, we also collect an evaluation dataset to serve as a benchmark for assessing the geospatial tool-use ability of LLMs. Experimental results on the evaluation dataset demonstrate that the fine-tuned GTChain can effectively solve geospatial tasks using the provided tools, achieving 32.5% and 27.5% higher accuracy in the percentage of correctly solved tasks compared to GPT-4 and Gemini 1.5 Pro, respectively.
引用
收藏
页数:16
相关论文
共 1 条
  • [1] Generating Simulated Data with a Large Language Model
    Kerley, Jeffrey
    Anderson, Derek T.
    Buck, Andrew R.
    Alvey, Brendan
    SYNTHETIC DATA FOR ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING: TOOLS, TECHNIQUES, AND APPLICATIONS II, 2024, 13035