Evaluating the positive predictive value of code-based identification of cirrhosis and its complications utilizing GPT-4

被引:0
作者
Far, Aryana T. [1 ]
Bastani, Asal [1 ]
Lee, Albert [2 ,3 ]
Gologorskaya, Oksana [2 ,3 ]
Huang, Chiung-Yu [4 ]
Pletcher, Mark J. [4 ]
Lai, Jennifer C. [1 ]
Ge, Jin [1 ]
机构
[1] Univ Calif San Francisco, Dept Med, Div Gastroenterol & Hepatol, San Francisco, CA 94143 USA
[2] Univ Calif San Francisco, Acad Res Serv, San Francisco, CA 94143 USA
[3] Univ Calif San Francisco, Bakar Computat Hlth Sci Inst, San Francisco, CA 94143 USA
[4] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA 94143 USA
关键词
cirrhosis; cohort identification; large language models (LLMs); natural language processing; CHRONIC LIVER-DISEASE; DATABASES; PROGNOSIS; OUTCOMES; CHILD; MODEL;
D O I
10.1097/HEP.0000000000001115
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
Background and Aims:Diagnosis code classification is a common method for cohort identification in cirrhosis research, but it is often inaccurate and augmented by labor-intensive chart review. Natural language processing using large language models (LLMs) is a potentially more accurate method. To assess LLMs' potential for cirrhosis cohort identification, we compared code-based versus LLM-based classification with chart review as a "gold standard."Approach and Results:We extracted and conducted a limited chart review of 3788 discharge summaries of cirrhosis admissions. We engineered zero-shot prompts using a Generative Pre-trained Transformer 4 to determine whether cirrhosis and its complications were active hospitalization problems. We calculated positive predictive values (PPVs) of LLM-based classification versus limited chart review and PPVs of code-based versus LLM-based classification as a "silver standard" in all 3788 summaries. Compared to gold standard chart review, code-based classification achieved PPVs of 82.2% for identifying cirrhosis, 41.7% for HE, 72.8% for ascites, 59.8% for gastrointestinal bleeding, and 48.8% for spontaneous bacterial peritonitis. Compared to the chart review, Generative Pre-trained Transformer 4 achieved 87.8%-98.8% accuracies for identifying cirrhosis and its complications. Using LLM as a silver standard, code-based classification achieved PPVs of 79.8% for identifying cirrhosis, 53.9% for HE, 55.3% for ascites, 67.6% for gastrointestinal bleeding, and 65.5% for spontaneous bacterial peritonitis.Conclusions:LLM-based classification was highly accurate versus manual chart review in identifying cirrhosis and its complications. This allowed us to assess the performance of code-based classification at scale using LLMs as a silver standard. These results suggest LLMs could augment or replace code-based cohort classification and raise questions regarding the necessity of chart review.
引用
收藏
页数:12
相关论文
共 38 条
  • [31] Pennsylvania State University, 2024, Normal approximation formulas
  • [32] Validation of a hierarchical algorithm to define chronic liver disease and cirrhosis etiology in administrative healthcare data
    Philip, George
    Djerboua, Maya
    Carlone, David
    Flemming, Jennifer A.
    [J]. PLOS ONE, 2020, 15 (02):
  • [33] ATHENA: Automatic Text Height Extraction for the Analysis of Text Lines in Old Handwritten Manuscripts
    Pintus, Ruggero
    Yang, Ying
    Rushmeier, Holly
    [J]. ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE, 2015, 8 (01): : 1
  • [34] Powers D. M. W., 2011, J MACH LEARN TECHNOL, V2, P37, DOI DOI 10.48550/ARXIV.2010.16061
  • [35] TRANSECTION OF ESOPHAGUS FOR BLEEDING ESOPHAGEAL VARICES
    PUGH, RNH
    MURRAYLY.IM
    DAWSON, JL
    PIETRONI, MC
    WILLIAMS, R
    [J]. BRITISH JOURNAL OF SURGERY, 1973, 60 (08) : 646 - 649
  • [36] Python.org, Python Release Python 3.10.0
  • [37] Reback Jeff, 2021, Zenodo, DOI [10.5281/ZENODO.5501881, 10.5281/ZENODO.3509134, 10.5281/ZENODO.5574486, 10.5281/ZENODO.5060318, 10.5281/ZENODO.5203279]
  • [38] From Child-Pugh to MELD score and beyond: Taking a walk down memory lane
    Ruf, Andres
    Dirchwolf, Melisa
    Freeman, Richard B.
    [J]. ANNALS OF HEPATOLOGY, 2022, 27 (01)