ProtoCode: Leveraging large language models (LLMs) for automated generation of machine-readable PCR protocols from scientific publications

被引:4
作者
Jiang, Shuo [1 ]
Evans-Yamamoto, Daniel [2 ]
Bersenev, Dennis [1 ]
Palaniappan, Sucheendra K. [2 ]
Yachie-Kinoshita, Ayako [1 ,2 ]
机构
[1] SBX BioSci Inc, 1600-925 West Georgia St, Vancouver, BC V6C 3L2, Canada
[2] Syst Biol Inst, Saisei Ikedayama Bldg,5-10-25,Higashi Gotanda,Shin, Tokyo 1410022, Japan
来源
SLAS TECHNOLOGY | 2024年 / 29卷 / 03期
关键词
Protocol standardization; Text mining; Large language model; Lab automation;
D O I
10.1016/j.slast.2024.100134
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protocol standardization and sharing are crucial for reproducibility in life sciences. In spite of numerous efforts for standardized protocol description, adherence to these standards in literature remains largely inconsistent. Curation of protocols are especially challenging due to the labor intensive process, requiring expert domain knowledge of each experimental procedure. Recent advancements in Large Language Models (LLMs) offer a promising solution to interpret and curate knowledge from complex scientific literature. In this work, we develop ProtoCode, a tool leveraging fine-tune LLMs to curate protocols into intermediate representation formats which can be interpretable by both human and machine interfaces. Our proof-of-concept, focused on polymerase chain reaction (PCR) protocols, retrieves information from PCR protocols at an accuracy ranging 69-100 % depending on the information content. In all tested protocols, we demonstrate that ProtoCode successfully converts literature-based protocols into correct operational files for multiple thermal cycler systems. In conclusion, ProtoCode can alleviate labor intensive curation and standardization of life science protocols to enhance research reproducibility by providing a reliable, automated means to process and standardize protocols. ProtoCode is freely available as a web server at https://curation.taxila.io/ProtoCode/.
引用
收藏
页数:6
相关论文
共 29 条
[1]   Biocoder: A programming language for standardizing and automating biology protocols [J].
Vaishnavi Ananthanarayanan ;
William Thies .
Journal of Biological Engineering, 4 (1)
[2]   The Laboratory Automation Protocol (LAP) Format and Repository: A Platform for Enhancing Workflow Efficiency in Synthetic Biology [J].
Anhel, Ana-Mariya ;
Alejaldre, Lorea ;
Goni-Moreno, Angel .
ACS SYNTHETIC BIOLOGY, 2023, 12 (12) :3514-3520
[3]   Building an Open Representation for Biological Protocols [J].
Bartley, Bryan ;
Beal, Jacob ;
Rogers, Miles ;
Bryce, Daniel ;
Goldman, Robert P. ;
Keller, Benjamin ;
Lee, Peter ;
Biggers, Vanessa ;
Nowak, Joshua ;
Weston, Mark .
ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2023, 19 (03)
[4]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[5]   Enhancing bioreactor arrays for automated measurements and reactive control with ReacSight [J].
Bertaux, Francois ;
Sosa-Carrillo, Sebastian ;
Gross, Viktoriia ;
Fraisse, Achille ;
Aditya, Chetan ;
Furstenheim, Mariela ;
Batt, Gregory .
NATURE COMMUNICATIONS, 2022, 13 (01)
[6]  
Chen JT, 2023, Arxiv, DOI arXiv:2310.06646
[7]   Parallel Nonfunctionalization of CK1δ/ε Kinase Ohnologs Following a Whole-Genome Duplication Event [J].
Evans-Yamamoto, Daniel ;
Dube, Alexandre K. ;
Saha, Gourav ;
Plante, Samuel ;
Bradley, David ;
Gagnon-Arsenault, Isabelle ;
Landry, Christian R. .
MOLECULAR BIOLOGY AND EVOLUTION, 2023, 40 (12)
[8]   Barcode fusion genetics-protein-fragment complementation assay (BFG-PCA): tools and resources that expand the potential for binary protein interaction discovery [J].
Evans-Yamamoto, Daniel ;
Rouleau, Francois D. ;
Nanda, Piyush ;
Makanae, Koji ;
Liu, Yin ;
Despres, Philippe C. ;
Matsuo, Hitoshi ;
Seki, Motoaki ;
Dube, Alexandre K. ;
Ascencio, Diana ;
Yachie, Nozomu ;
Landry, Christian R. .
NUCLEIC ACIDS RESEARCH, 2022, 50 (09) :E54
[9]   A guideline for reporting experimental protocols in life sciences [J].
Giraldo, Olga ;
Garcia, Alexander ;
Corcho, Oscar .
PEERJ, 2018, 6
[10]  
Hu Mengzhou, 2023, Res Sq, DOI 10.21203/rs.3.rs-3270331/v1