WaterGPT: Training a Large Language Model to Become a Hydrology Expert

被引：5

作者：

Ren, Yi ^{[1
]}

Zhang, Tianyi ^{[2
]}

Dong, Xurong ^{[3
]}

Li, Weibin ^{[1
]}

Wang, Zhiyang ^{[4
]}

He, Jie ^{[1
]}

Zhang, Hanzhi ^{[1
]}

Jiao, Licheng ^{[2
]}

机构：

[1] Xidian Univ, Lab Artificial Intelligence, Hangzhou Inst Technol, Hangzhou 311231, Peoples R China

[2] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China

[3] Shaanxi Prov Inst Water Resources & Elect Power In, Xian 710048, Peoples R China

[4] Shaanxi Water Dev Grp Co Ltd, Xian 710018, Peoples R China

来源：

WATER | 2024年 / 16卷 / 21期

基金：

中国国家自然科学基金;

关键词：

WaterGPT; large language model; agent; prompt words;

D O I：

10.3390/w16213075

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

This paper introduces WaterGPT, a language model designed for complex multimodal tasks in hydrology. WaterGPT is applied in three main areas: (1) processing and analyzing data such as images and text in water resources, (2) supporting intelligent decision-making for hydrological tasks, and (3) enabling interdisciplinary information integration and knowledge-based Q&A. The model has achieved promising results. One core aspect of WaterGPT involves the meticulous segmentation of training data for the supervised fine-tuning phase, sourced from real-world data and annotated with high quality using both manual methods and GPT-series model annotations. These data are carefully categorized into four types: knowledge-based, task-oriented, negative samples, and multi-turn dialogues. Additionally, another key component is the development of a multi-agent framework called Water_Agent, which enables WaterGPT to intelligently invoke various tools to solve complex tasks in the field of water resources. This framework handles multimodal data, including text and images, allowing for deep understanding and analysis of complex hydrological environments. Based on this framework, WaterGPT has achieved over a 90% success rate in tasks such as object detection and waterbody extraction. For the waterbody extraction task, using Dice and mIoU metrics, WaterGPT's performance on high-resolution images from 2013 to 2022 has remained stable, with accuracy exceeding 90%. Moreover, we have constructed a high-quality water resources evaluation dataset, EvalWater, which covers 21 categories and approximately 10,000 questions. Using this dataset, WaterGPT achieved the highest accuracy to date in the field of water resources, reaching 83.09%, which is about 17.83 points higher than GPT-4.

引用

页数：22

共 48 条

[1]

Achiam J, 2024, ARXIV, DOI DOI 10.48550/ARXIV.2303.08774

[2]

Bahrini Aram, 2023, 2023 Systems and Information Engineering Design Symposium (SIEDS), P274, DOI 10.1109/SIEDS58326.2023.10137850

[3]

Bai JZ, 2023, Arxiv, DOI [arXiv:2309.16609, 10.48550/arXiv.2309.16609]

[4]

Cai Zheng, 2024, arXiv, DOI 10.48550/arXiv.2403.17297

[5]

Chen WZ, 2023, Arxiv, DOI [arXiv:2308.10848, DOI 10.48550/ARXIV.2308.10848]

[6]

Dai WL, 2023, Arxiv, DOI arXiv:2305.06500

[7]

Du ZX, 2022, Arxiv, DOI arXiv:2103.10360

[8]

Feng Shuailong, 2022, Journal of Hydrology: Regional Studies, DOI [10.1016/j.ejrh.2022.101111, 10.1016/j.ejrh.2022.101111]

[9]

github, AutoGPT

[10]

Han TY, 2025, Arxiv, DOI arXiv:2304.08247

← 1 2 3 4 5 →