Large language models and their applications in bioinformatics

被引:2
|
作者
Sarumi, Oluwafemi A. [1 ,2 ]
Heider, Dominik [1 ,2 ]
机构
[1] Univ Munster, Inst Med Informat, Albert Schweitzer Campus, D-48149 Munster, Germany
[2] Heinrich Heine Univ Duesseldorf, Inst Comp Sci, Graf Adolf Str 63, D-40215 Dusseldorf, Germany
来源
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL | 2024年 / 23卷
关键词
Bioinformatics; Large language models; Natural language processing; Omics data;
D O I
10.1016/j.csbj.2024.09.031
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Recent advancements in Natural Language Processing (NLP) have been significantly driven by the development of Large Language Models (LLMs), representing a substantial leap in language-based technology capabilities. These models, built on sophisticated deep learning architectures, typically transformers, are characterized by billions of parameters and extensive training data, enabling them to achieve high accuracy across various tasks. The transformer architecture of LLMs allows them to effectively handle context and sequential information, which is crucial for understanding and generating human language. Beyond traditional NLP applications, LLMs have shown significant promise in bioinformatics, transforming the field by addressing challenges associated with large and complex biological datasets. In genomics, proteomics, and personalized medicine, LLMs facilitate identifying patterns, predicting protein structures, or understanding genetic variations. This capability is crucial, e.g., for advancing drug discovery, where accurate prediction of molecular interactions is essential. This review discusses the current trends in LLMs research and their potential to revolutionize the field of bioinformatics and accelerate novel discoveries in the life sciences.
引用
收藏
页码:3498 / 3505
页数:8
相关论文
共 50 条
  • [1] BioCoder: a benchmark for bioinformatics code generation with large language models
    Tang, Xiangru
    Qian, Bill
    Gao, Rick
    Chen, Jiakang
    Chen, Xinyun
    Gerstein, Mark B.
    BIOINFORMATICS, 2024, 40 : i266 - i276
  • [2] Industrial applications of large language models
    Mubashar Raza
    Zarmina Jahangir
    Muhammad Bilal Riaz
    Muhammad Jasim Saeed
    Muhammad Awais Sattar
    Scientific Reports, 15 (1)
  • [3] Large language models for oncological applications
    Vera Sorin
    Yiftach Barash
    Eli Konen
    Eyal Klang
    Journal of Cancer Research and Clinical Oncology, 2023, 149 : 9505 - 9508
  • [4] Applications of transformer-based language models in bioinformatics: a survey
    Zhang, Shuang
    Fan, Rui
    Liu, Yuti
    Chen, Shuang
    Liu, Qiao
    Zeng, Wanwen
    NEURO-ONCOLOGY ADVANCES, 2023, 5 (01)
  • [5] Large language models for oncological applications
    Sorin, Vera
    Barash, Yiftach
    Konen, Eli
    Klang, Eyal
    JOURNAL OF CANCER RESEARCH AND CLINICAL ONCOLOGY, 2023, 149 (11) : 9505 - 9508
  • [6] Applications of Large Language Models in Pathology
    Cheng, Jerome
    BIOENGINEERING-BASEL, 2024, 11 (04):
  • [7] Applications of large language models in oncology
    Loefflert, Chiara M.
    Bressem, Keno K.
    Truhn, Daniel
    ONKOLOGIE, 2024, 30 (05): : 388 - 393
  • [8] A journey into the Generative AI and large language models: From NLP to BioInformatics
    Elnaggar, Ahmed
    INTERNATIONAL CONFERENCE ON GRAMMATICAL INFERENCE, VOL 217, 2023, 217 : 7 - 7
  • [9] Large language models: a primer and gastroenterology applications
    Shahab, Omer
    El Kurdi, Bara
    Shaukat, Aasma
    Nadkarni, Girish
    Soroush, Ali
    THERAPEUTIC ADVANCES IN GASTROENTEROLOGY, 2024, 17
  • [10] Looking to Future Applications of Large Language Models
    Liu, Xichong
    Rubin, Samuel J. S.
    Rogalla, Stephan
    AMERICAN JOURNAL OF GASTROENTEROLOGY, 2023, 118 (12): : 2306 - 2306