Evaluating the positive predictive value of code-based identification of cirrhosis and its complications utilizing GPT-4

被引：0

作者：

Far, Aryana T. ^{[1
]}

Bastani, Asal ^{[1
]}

Lee, Albert ^{[2
,3
]}

Gologorskaya, Oksana ^{[2
,3
]}

Huang, Chiung-Yu ^{[4
]}

Pletcher, Mark J. ^{[4
]}

Lai, Jennifer C. ^{[1
]}

Ge, Jin ^{[1
]}

机构：

[1] Univ Calif San Francisco, Dept Med, Div Gastroenterol & Hepatol, San Francisco, CA 94143 USA

[2] Univ Calif San Francisco, Acad Res Serv, San Francisco, CA 94143 USA

[3] Univ Calif San Francisco, Bakar Computat Hlth Sci Inst, San Francisco, CA 94143 USA

[4] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA 94143 USA

来源：

HEPATOLOGY | 2024年

关键词：

cirrhosis; cohort identification; large language models (LLMs); natural language processing; CHRONIC LIVER-DISEASE; DATABASES; PROGNOSIS; OUTCOMES; CHILD; MODEL;

D O I：

10.1097/HEP.0000000000001115

中图分类号：

R57 [消化系及腹部疾病];

学科分类号：

摘要：

Background and Aims:Diagnosis code classification is a common method for cohort identification in cirrhosis research, but it is often inaccurate and augmented by labor-intensive chart review. Natural language processing using large language models (LLMs) is a potentially more accurate method. To assess LLMs' potential for cirrhosis cohort identification, we compared code-based versus LLM-based classification with chart review as a "gold standard."Approach and Results:We extracted and conducted a limited chart review of 3788 discharge summaries of cirrhosis admissions. We engineered zero-shot prompts using a Generative Pre-trained Transformer 4 to determine whether cirrhosis and its complications were active hospitalization problems. We calculated positive predictive values (PPVs) of LLM-based classification versus limited chart review and PPVs of code-based versus LLM-based classification as a "silver standard" in all 3788 summaries. Compared to gold standard chart review, code-based classification achieved PPVs of 82.2% for identifying cirrhosis, 41.7% for HE, 72.8% for ascites, 59.8% for gastrointestinal bleeding, and 48.8% for spontaneous bacterial peritonitis. Compared to the chart review, Generative Pre-trained Transformer 4 achieved 87.8%-98.8% accuracies for identifying cirrhosis and its complications. Using LLM as a silver standard, code-based classification achieved PPVs of 79.8% for identifying cirrhosis, 53.9% for HE, 55.3% for ascites, 67.6% for gastrointestinal bleeding, and 65.5% for spontaneous bacterial peritonitis.Conclusions:LLM-based classification was highly accurate versus manual chart review in identifying cirrhosis and its complications. This allowed us to assess the performance of code-based classification at scale using LLMs as a silver standard. These results suggest LLMs could augment or replace code-based cohort classification and raise questions regarding the necessity of chart review.

引用

页数：12

共 38 条

[31] Pennsylvania State University, 2024, Normal approximation formulas
[32] Validation of a hierarchical algorithm to define chronic liver disease and cirrhosis etiology in administrative healthcare data
Philip, George
Djerboua, Maya
Carlone, David
Flemming, Jennifer A.
[J]. PLOS ONE, 2020, 15 (02):
[33] ATHENA: Automatic Text Height Extraction for the Analysis of Text Lines in Old Handwritten Manuscripts
Pintus, Ruggero
Yang, Ying
Rushmeier, Holly
[J]. ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE, 2015, 8 (01): : 1
[34] Powers D. M. W., 2011, J MACH LEARN TECHNOL, V2, P37, DOI DOI 10.48550/ARXIV.2010.16061
[35] TRANSECTION OF ESOPHAGUS FOR BLEEDING ESOPHAGEAL VARICES
PUGH, RNH
MURRAYLY.IM
DAWSON, JL
PIETRONI, MC
WILLIAMS, R
[J]. BRITISH JOURNAL OF SURGERY, 1973, 60 (08) : 646 - 649
[36] Python.org, Python Release Python 3.10.0
[37] Reback Jeff, 2021, Zenodo, DOI [10.5281/ZENODO.5501881, 10.5281/ZENODO.3509134, 10.5281/ZENODO.5574486, 10.5281/ZENODO.5060318, 10.5281/ZENODO.5203279]
[38] From Child-Pugh to MELD score and beyond: Taking a walk down memory lane
Ruf, Andres
Dirchwolf, Melisa
Freeman, Richard B.
[J]. ANNALS OF HEPATOLOGY, 2022, 27 (01)

← 1 2 3 4 →