A comprehensive evaluation of large language models in mining gene relations and pathway knowledge

被引:3
|
作者
Azam, Muhammad [1 ,2 ]
Chen, Yibo [1 ,2 ,3 ]
Arowolo, Micheal Olaolu [1 ,2 ]
Liu, Haowang [1 ,2 ]
Popescu, Mihail [1 ,3 ,4 ]
Xu, Dong [1 ,2 ,3 ]
机构
[1] Univ Missouri, Dept Elect Engn & Comp Sci, Columbia, MO 65201 USA
[2] Univ Missouri, Bond Life Sci Ctr, Columbia, MO 65201 USA
[3] Univ Missouri, Inst Data Sci & Informat, Columbia, MO 65201 USA
[4] Univ Missouri, Dept Biomed Informat Biostat & Med Epidemiol, Columbia, MO USA
关键词
biomedical text mining; gene-gene interaction; KEGG pathway; large language model;
D O I
10.1002/qub2.57
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Understanding complex biological pathways, including gene-gene interactions and gene regulatory networks, is critical for exploring disease mechanisms and drug development. Manual literature curation of biological pathways cannot keep up with the exponential growth of new discoveries in the literature. Large-scale language models (LLMs) trained on extensive text corpora contain rich biological information, and they can be mined as a biological knowledge graph. This study assesses 21 LLMs, including both application programming interface (API)-based models and open-source models in their capacities of retrieving biological knowledge. The evaluation focuses on predicting gene regulatory relations (activation, inhibition, and phosphorylation) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway components. Results indicated a significant disparity in model performance. API-based models GPT-4 and Claude-Pro showed superior performance, with an F1 score of 0.4448 and 0.4386 for the gene regulatory relation prediction, and a Jaccard similarity index of 0.2778 and 0.2657 for the KEGG pathway prediction, respectively. Open-source models lagged behind their API-based counterparts, whereas Falcon-180b and llama2-7b had the highest F1 scores of 0.2787 and 0.1923 in gene regulatory relations, respectively. The KEGG pathway recognition had a Jaccard similarity index of 0.2237 for Falcon-180b and 0.2207 for llama2-7b. Our study suggests that LLMs are informative in gene network analysis and pathway mapping, but their effectiveness varies, necessitating careful model selection. This work also provides a case study and insight into using LLMs das knowledge graphs. Our code is publicly available at the website of GitHub (Muh-aza).
引用
收藏
页码:360 / 374
页数:15
相关论文
共 50 条
  • [1] Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and Beyond
    Xu, Fangzhi
    Lin, Qika
    Han, Jiawei
    Zhao, Tianzhe
    Liu, Jun
    Cambria, Erik
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (04) : 1620 - 1634
  • [2] A comprehensive survey of large language models and multimodal large models in medicine
    Xiao, Hanguang
    Zhou, Feizhong
    Liu, Xingyue
    Liu, Tianqi
    Li, Zhipeng
    Liu, Xin
    Huang, Xiaoxuan
    INFORMATION FUSION, 2025, 117
  • [3] Large Language Models: A Comprehensive Guide for Radiologists
    Kim, Sunkyu
    Lee, Choong-kun
    Kim, Seung-seob
    JOURNAL OF THE KOREAN SOCIETY OF RADIOLOGY, 2024, 85 (05): : 861 - 882
  • [4] LVLM-EHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models
    Xu, Peng
    Shao, Wenqi
    Zhang, Kaipeng
    Gao, Peng
    Liu, Shuo
    Lei, Meng
    Meng, Fanqing
    Huang, Siyuan
    Qiao, Yu
    Luo, Ping
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (03) : 1877 - 1893
  • [5] Quo Vadis ChatGPT? From large language models to Large Knowledge Models
    Venkatasubramanian, Venkat
    Chakraborty, Arijit
    COMPUTERS & CHEMICAL ENGINEERING, 2025, 192
  • [6] Benchmarking Biomedical Relation Knowledge in Large Language Models
    Zhang, Fenghui
    Yang, Kuo
    Zhao, Chenqian
    Li, Haixu
    Dong, Xin
    Tian, Haoyu
    Zhou, Xuezhong
    BIOINFORMATICS RESEARCH AND APPLICATIONS, PT II, ISBRA 2024, 2024, 14955 : 482 - 495
  • [7] Workshop on Enterprise Knowledge Graphs using Large Language Models
    Gupta, Rajeev
    Srinivasa, Srinath
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 5271 - 5272
  • [8] Unlocking the Black Box? A Comprehensive Exploration of Large Language Models in Rehabilitation
    Bonnechere, Bruno
    AMERICAN JOURNAL OF PHYSICAL MEDICINE & REHABILITATION, 2024, 103 (06) : 532 - 537
  • [9] Are large language models qualified reviewers in originality evaluation?
    Huang, Shengzhi
    Huang, Yong
    Liu, Yinpeng
    Luo, Zhuoran
    Lu, Wei
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (03)
  • [10] Towards Generating Executable Metamorphic Relations Using Large Language Models
    Shin, Seung Yeob
    Pastore, Fabrizio
    Bianculli, Domenico
    Baicoianu, Alexandra
    QUALITY OF INFORMATION AND COMMUNICATIONS TECHNOLOGY, QUATIC 2024, 2024, 2178 : 126 - 141