GENEVIC: GENetic data Exploration and Visualization via Intelligent interactive Console

被引:1
作者
Nath, Anindita [1 ]
Mwesigwa, Savannah [1 ]
Dai, Yulin [1 ]
Jiang, Xiaoqian [2 ]
Zhao, Zhongming [1 ,3 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, Ctr Precis Hlth, McWilliams Sch Biomed Informat, 7000 Fannin St Suite 600, Houston, TX 77030 USA
[2] Univ Texas Hlth Sci Ctr Houston, McWilliams Sch Biomed Informat, Dept Hlth Data Sci & Artificial Intelligence, Houston, TX 77030 USA
[3] UTHlth Grad Sch Biomed Sci, MD Anderson Canc Ctr, Houston, TX 77030 USA
关键词
D O I
10.1093/bioinformatics/btae500
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The vast generation of genetic data poses a significant challenge in efficiently uncovering valuable knowledge. Introducing GENEVIC, an AI-driven chat framework that tackles this challenge by bridging the gap between genetic data generation and biomedical knowledge discovery. Leveraging generative AI, notably ChatGPT, it serves as a biologist's "copilot." It automates the analysis, retrieval, and visualization of customized domain-specific genetic information, and integrates functionalities to generate protein interaction networks, enrich gene sets, and search scientific literature from PubMed, Google Scholar, and arXiv, making it a comprehensive tool for biomedical research. In its pilot phase, GENEVIC is assessed using a curated database that ranks genetic variants associated with Alzheimer's disease, schizophrenia, and cognition, based on their effect weights from the Polygenic Score (PGS) Catalog, thus enabling researchers to prioritize genetic variants in complex diseases. GENEVIC's operation is user-friendly, accessible without any specialized training, secured by Azure OpenAI's HIPAA-compliant infrastructure, and evaluated for its efficacy through real-time query testing. As a prototype, GENEVIC is set to advance genetic research, enabling informed biomedical decisions.Availability and implementation GENEVIC is publicly accessible at https://genevicanath2024.streamlit.app. The underlying code is open-source and available via GitHub at https://github.com/bsml320/GENEVIC.git (also at https://github.com/anath2110/GENEVIC.git).
引用
收藏
页数:5
相关论文
共 12 条
  • [1] Achiam Josh, 2023, GPT-4 Technical Report, DOI [10.48550/arXiv.2303.08774, DOI 10.48550/ARXIV.2303.08774]
  • [2] Tutorial: a guide to performing polygenic risk score analyses
    Choi, Shing Wan
    Mak, Timothy Shin-Heng
    O'Reilly, Paul F.
    [J]. NATURE PROTOCOLS, 2020, 15 (09) : 2759 - 2772
  • [3] Fraenkel J., 2014, Public Choice, V28, P1, DOI [10.1007/BF01718454, DOI 10.1007/BF01718454]
  • [4] Toward a Conversational Agent to Support the Self-Management of Adults and Young Adults With Sickle Cell Disease: Usability and Usefulness Study
    Issom, David-Zacharie
    Hardy-Dessources, Marie-Dominique
    Romana, Marc
    Hartvigsen, Gunnar
    Lovis, Christian
    [J]. FRONTIERS IN DIGITAL HEALTH, 2021, 3
  • [5] GeneGPT: augmenting large language models with domain tools for improved access to biomedical information
    Jin, Qiao
    Yang, Yifan
    Chen, Qingyu
    Lu, Zhiyong
    [J]. BIOINFORMATICS, 2024, 40 (02)
  • [6] The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation
    Lambert, Samuel A.
    Gil, Laurent
    Jupp, Simon
    Ritchie, Scott C.
    Xu, Yu
    Buniello, Annalisa
    McMahon, Aoife
    Abraham, Gad
    Chapman, Michael
    Parkinson, Helen
    Danesh, John
    MacArthur, Jacqueline A. L.
    Inouye, Michael
    [J]. NATURE GENETICS, 2021, 53 (04) : 420 - 425
  • [7] quincunx: an R package to query, download and wrangle PGS Catalog data
    Magno, Ramiro
    Duarte, Isabel
    Maia, Ana-Teresa
    [J]. BIOINFORMATICS, 2022, 38 (01) : 294 - 296
  • [8] The evaluation of chatbot as a tool for health literacy education among undergraduate students
    Mokmin, Nur Azlina Mohamed
    Ibrahim, Nurul Anwar
    [J]. EDUCATION AND INFORMATION TECHNOLOGIES, 2021, 26 (05) : 6033 - 6049
  • [9] Rohrbach A, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P4035
  • [10] WANG K, 2010, NUCLEIC ACIDS RES, V38