A comprehensive evaluation of large language models in mining gene relations and pathway knowledge

被引：3

作者：

Azam, Muhammad ^{[1
,2
]}

Chen, Yibo ^{[1
,2
,3
]}

Arowolo, Micheal Olaolu ^{[1
,2
]}

Liu, Haowang ^{[1
,2
]}

Popescu, Mihail ^{[1
,3
,4
]}

Xu, Dong ^{[1
,2
,3
]}

机构：

[1] Univ Missouri, Dept Elect Engn & Comp Sci, Columbia, MO 65201 USA

[2] Univ Missouri, Bond Life Sci Ctr, Columbia, MO 65201 USA

[3] Univ Missouri, Inst Data Sci & Informat, Columbia, MO 65201 USA

[4] Univ Missouri, Dept Biomed Informat Biostat & Med Epidemiol, Columbia, MO USA

来源：

QUANTITATIVE BIOLOGY | 2024年 / 12卷 / 04期

关键词：

biomedical text mining; gene-gene interaction; KEGG pathway; large language model;

D O I：

10.1002/qub2.57

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Understanding complex biological pathways, including gene-gene interactions and gene regulatory networks, is critical for exploring disease mechanisms and drug development. Manual literature curation of biological pathways cannot keep up with the exponential growth of new discoveries in the literature. Large-scale language models (LLMs) trained on extensive text corpora contain rich biological information, and they can be mined as a biological knowledge graph. This study assesses 21 LLMs, including both application programming interface (API)-based models and open-source models in their capacities of retrieving biological knowledge. The evaluation focuses on predicting gene regulatory relations (activation, inhibition, and phosphorylation) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway components. Results indicated a significant disparity in model performance. API-based models GPT-4 and Claude-Pro showed superior performance, with an F1 score of 0.4448 and 0.4386 for the gene regulatory relation prediction, and a Jaccard similarity index of 0.2778 and 0.2657 for the KEGG pathway prediction, respectively. Open-source models lagged behind their API-based counterparts, whereas Falcon-180b and llama2-7b had the highest F1 scores of 0.2787 and 0.1923 in gene regulatory relations, respectively. The KEGG pathway recognition had a Jaccard similarity index of 0.2237 for Falcon-180b and 0.2207 for llama2-7b. Our study suggests that LLMs are informative in gene network analysis and pathway mapping, but their effectiveness varies, necessitating careful model selection. This work also provides a case study and insight into using LLMs das knowledge graphs. Our code is publicly available at the website of GitHub (Muh-aza).

引用

页码：360 / 374

页数：15

共 50 条

[1] Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and Beyond
Xu, Fangzhi
Lin, Qika
Han, Jiawei
Zhao, Tianzhe
Liu, Jun
Cambria, Erik
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (04) : 1620 - 1634
[2] A comprehensive survey of large language models and multimodal large models in medicine
Xiao, Hanguang
Zhou, Feizhong
Liu, Xingyue
Liu, Tianqi
Li, Zhipeng
Liu, Xin
Huang, Xiaoxuan
INFORMATION FUSION, 2025, 117
[3] Large Language Models: A Comprehensive Guide for Radiologists
Kim, Sunkyu
Lee, Choong-kun
Kim, Seung-seob
JOURNAL OF THE KOREAN SOCIETY OF RADIOLOGY, 2024, 85 (05): : 861 - 882
[4] LVLM-EHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models
Xu, Peng
Shao, Wenqi
Zhang, Kaipeng
Gao, Peng
Liu, Shuo
Lei, Meng
Meng, Fanqing
Huang, Siyuan
Qiao, Yu
Luo, Ping
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (03) : 1877 - 1893
[5] Quo Vadis ChatGPT? From large language models to Large Knowledge Models
Venkatasubramanian, Venkat
Chakraborty, Arijit
COMPUTERS & CHEMICAL ENGINEERING, 2025, 192
[6] Benchmarking Biomedical Relation Knowledge in Large Language Models
Zhang, Fenghui
Yang, Kuo
Zhao, Chenqian
Li, Haixu
Dong, Xin
Tian, Haoyu
Zhou, Xuezhong
BIOINFORMATICS RESEARCH AND APPLICATIONS, PT II, ISBRA 2024, 2024, 14955 : 482 - 495
[7] Workshop on Enterprise Knowledge Graphs using Large Language Models
Gupta, Rajeev
Srinivasa, Srinath
PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 5271 - 5272
[8] Unlocking the Black Box? A Comprehensive Exploration of Large Language Models in Rehabilitation
Bonnechere, Bruno
AMERICAN JOURNAL OF PHYSICAL MEDICINE & REHABILITATION, 2024, 103 (06) : 532 - 537
[9] Are large language models qualified reviewers in originality evaluation?
Huang, Shengzhi
Huang, Yong
Liu, Yinpeng
Luo, Zhuoran
Lu, Wei
INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (03)
[10] Towards Generating Executable Metamorphic Relations Using Large Language Models
Shin, Seung Yeob
Pastore, Fabrizio
Bianculli, Domenico
Baicoianu, Alexandra
QUALITY OF INFORMATION AND COMMUNICATIONS TECHNOLOGY, QUATIC 2024, 2024, 2178 : 126 - 141

← 1 2 3 4 5 →