Advancing Chinese biomedical text mining with community challenges

被引:0
|
作者
Zong, Hui [1 ,2 ]
Wu, Rongrong [1 ,2 ]
Cha, Jiaxue [3 ]
Feng, Weizhe [1 ,2 ]
Wu, Erman [1 ,2 ]
Li, Jiakun [1 ,2 ,4 ]
Shao, Aibin [1 ,2 ]
Tao, Liang [5 ]
Li, Zuofeng [6 ]
Tang, Buzhou [7 ]
Shen, Bairong [1 ,2 ]
机构
[1] Sichuan Univ, West China Hosp, Joint Lab Artificial Intelligence Crit Care Med, Chengdu 610041, Peoples R China
[2] Sichuan Univ, West China Hosp, Inst Syst Genet, Frontiers Sci Ctr Dis Related Mol Network, Chengdu 610041, Peoples R China
[3] Tongji Univ, Shanghai Key Lab Signaling & Dis Res, Collaborat Innovat Ctr Brain Sci, Lab Receptor Based Biomed,Sch Life Sci & Technol, Shanghai 200092, Peoples R China
[4] Sichuan Univ, West China Hosp, Dept Urol, Chengdu 610041, Peoples R China
[5] Shanghai Business Sch, Fac Business Informat, Shanghai 201400, Peoples R China
[6] Takeda Co Ltd, Shanghai 200040, Peoples R China
[7] Harbin Inst Technol, Dept Comp Sci, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Biomedical text mining; Health information processing; Natural language processing; Artificial intelligence; Large language model; NAMED ENTITY RECOGNITION; EXTRACTION; PREDICTION; DISCOVERY; DATABASE; CORPUS;
D O I
10.1016/j.jbi.2024.104716
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objective: This study aims to review the recent advances in community challenges for biomedical text mining in China. Methods: We collected information of evaluation tasks released in community challenges of biomedical text mining, including task description, dataset description, data source, task type and related links. A systematic summary and comparative analysis were conducted on various biomedical natural language processing tasks, such as named entity recognition, entity normalization, attribute extraction, relation extraction, event extraction, text classification, text similarity, knowledge graph construction, question answering, text generation, and large language model evaluation. Results: We identified 39 evaluation tasks from 6 community challenges that spanned from 2017 to 2023. Our analysis revealed the diverse range of evaluation task types and data sources in biomedical text mining. We explored the potential clinical applications of these community challenge tasks from a translational biomedical informatics perspective. We compared with their English counterparts, and discussed the contributions, limitations, lessons and guidelines of these community challenges, while highlighting future directions in the era of large language models. Conclusion: Community challenge evaluation competitions have played a crucial role in promoting technology innovation and fostering interdisciplinary collaboration in the field of biomedical text mining. These challenges provide valuable platforms for researchers to develop state-of-the-art solutions.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Community challenges in biomedical text mining over 10 years: success, failure and the future
    Huang, Chung-Chi
    Lu, Zhiyong
    BRIEFINGS IN BIOINFORMATICS, 2016, 17 (01) : 132 - 144
  • [2] Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges
    Singhal, Ayush
    Leaman, Robert
    Catlett, Natalie
    Lemberger, Thomas
    McEntyre, Johanna
    Polson, Shawn
    Xenarios, Ioannis
    Arighi, Cecilia
    Lu, Zhiyong
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,
  • [3] Biomedical text mining for research rigor and integrity: tasks, challenges, directions
    Kilicoglu, Halil
    BRIEFINGS IN BIOINFORMATICS, 2018, 19 (06) : 1400 - 1414
  • [4] Text mining the biomedical literature
    Pertsemlidis, A
    BIOPHYSICAL JOURNAL, 2002, 82 (01) : 168A - 168A
  • [6] @Note: A workbench for Biomedical Text Mining
    Lourenco, Analia
    Carreira, Rafael
    Carneiro, Sonia
    Maia, Paulo
    Glez-Pena, Daniel
    Fdez-Riverola, Florentino
    Ferreira, Eugenio C.
    Rocha, Isabel
    Rocha, Miguel
    JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (04) : 710 - 720
  • [7] Biomedical Text Mining and Its Applications
    Rodriguez-Esteban, Raul
    PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (12)
  • [8] Text mining patents for biomedical knowledge
    Rodriguez-Esteban, Raul
    Bundschus, Markus
    DRUG DISCOVERY TODAY, 2016, 21 (06) : 997 - 1002
  • [9] New frontiers in biomedical text mining
    Zweigenbaum, Pierre
    Demner-Fushman, Dina
    Yu, Hong
    Cohen, K. Bretonnel
    Pacific Symposium on Biocomputing 2007, 2007, : 205 - 208
  • [10] TEXT AND DATA MINING FOR BIOMEDICAL DISCOVERY
    Gonzalez, Graciela
    Cohen, Kevin Bretonnel
    Leaman, Robert
    Greene, Casey S.
    Shah, Nigam
    Kann, Maricel G.
    Ye, Jieping
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2014, 2014, : 312 - 315