Improving the accuracy of automated gout flare ascertainment using natural language processing of electronic health records and linked Medicare claims data

被引：2

作者：

Yoshida, Kazuki ^{[1
,2
]}

Cai, Tianrun ^{[1
,2
]}

Bessette, Lily G. ^{[3
]}

Kim, Erin ^{[3
]}

Lee, Su Been ^{[3
]}

Zabotka, Luke E. ^{[3
]}

Sun, Alec ^{[3
]}

Mastrorilli, Julianna M. ^{[3
]}

Oduol, Theresa A. ^{[3
]}

Liu, Jun ^{[3
]}

Solomon, Daniel H. ^{[1
,2
,3
]}

Kim, Seoyoung C. ^{[1
,2
,3
]}

Desai, Rishi J. ^{[2
,3
]}

Liao, Katherine P. ^{[1
,2
,4
]}

机构：

[1] Brigham & Womens Hosp, Dept Med, Div Rheumatol Inflammat & Immun, 75 Francis St, Boston, MA 02115 USA

[2] Harvard Med Sch, Dept Med, Boston, MA 02115 USA

[3] Brigham & Womens Hosp, Dept Med, Div Pharmacoepidemiol & Pharmacoecon, 75 Francis St, Boston, MA 02115 USA

[4] Harvard Med Sch, Dept Biomed Informat, Boston, MA 02115 USA

来源：

PHARMACOEPIDEMIOLOGY AND DRUG SAFETY | 2024年 / 33卷 / 01期

关键词：

gout; natural language processing; AMERICAN-COLLEGE; VALIDATION; DEFINITION;

D O I：

10.1002/pds.5684

中图分类号：

R1 [预防医学、卫生学];

学科分类号：

1004 ; 120402 ;

摘要：

Background: We aimed to determine whether integrating concepts from the notes from the electronic health record (EHR) data using natural language processing (NLP) could improve the identification of gout flares. Methods: Using Medicare claims linked with EHR, we selected gout patients who initiated the urate-lowering therapy (ULT). Patients' 12-month baseline period and on treatment follow-up were segmented into 1-month units. We retrieved EHR notes for months with gout diagnosis codes and processed notes for NLP concepts. We selected a random sample of 500 patients and reviewed each of their notes for the presence of a physician-documented gout flare. Months containing at least 1 note mentioning gout flares were considered months with events. We used 60% of patients to train predictive models with LASSO. We evaluated the models by the area under the curve (AUC) in the validation data and examined positive/negative predictive values (P/NPV). Results: We extracted and labeled 839 months of follow-up (280 with gout flares). The claims-only model selected 20 variables (AUC = 0.69). The NLP concept-only model selected 15 (AUC = 0.69). The combined model selected 32 claims variables and 13 NLP concepts (AUC = 0.73). The claims-only model had a PPV of 0.64 [0.50, 0.77] and an NPV of 0.71 [0.65, 0.76], whereas the combined model had a PPV of 0.76 [0.61, 0.88] and an NPV of 0.71 [0.65, 0.76]. Conclusion: Adding NLP concept variables to claims variables resulted in a small improvement in the identification of gout flares. Our data-driven claims-only model and our combined claims/NLP-concept model outperformed existing rule-based claims algorithms reliant on medication use, diagnosis, and procedure codes.

引用

页数：9

共 50 条

[1] Ascertainment of Delirium Status Using Natural Language Processing From Electronic Health Records
Fu, Sunyang
Lopes, Guilherme S.
Pagali, Sandeep R.
Thorsteinsdottir, Bjoerg
LeBrasseur, Nathan K.
Wen, Andrew
Liu, Hongfang
Rocca, Walter A.
Olson, Janet E.
St Sauver, Jennifer
Sohn, Sunghwan
JOURNALS OF GERONTOLOGY SERIES A-BIOLOGICAL SCIENCES AND MEDICAL SCIENCES, 2022, 77 (03): : 524 - 530
[2] Ascertainment of Veterans With Metastatic Prostate Cancer in Electronic Health Records: Demonstrating the Case for Natural Language Processing
Alba, Patrick R.
Gao, Anthony
Lee, Kyung Min
Anglin-Foote, Tori
Robison, Brian
Katsoulakis, Evangelia
Rose, Brent S.
Efimova, Olga
Ferraro, Jeffrey P.
Patterson, Olga V.
Shelton, Jeremy B.
Duvall, Scott L.
Lynch, Julie A.
JCO CLINICAL CANCER INFORMATICS, 2021, 5 : 1005 - 1014
[3] Using Natural Language Processing to Predict Risk in Electronic Health Records
Duy Van Le
Montgomery, James
Kirkby, Kenneth
Scanlan, Joel
MEDINFO 2023 - THE FUTURE IS ACCESSIBLE, 2024, 310 : 574 - 578
[4] Neural Natural Language Processing for unstructured data in electronic health records: A review
Li, Irene
Pan, Jessica
Goldwasser, Jeremy
Verma, Neha
Wong, Wai Pan
Nuzumlali, Muhammed Yavuz
Rosand, Benjamin
Li, Yixin
Zhang, Matthew
Chang, David
Taylor, R. Andrew
Krumholz, Harlan M.
Radev, Dragomir
COMPUTER SCIENCE REVIEW, 2022, 46
[5] Natural Language Processing to Improve Prediction of Incident Atrial Fibrillation Using Electronic Health Records
Ashburner, Jeffrey M.
Chang, Yuchiao
Wang, Xin
Khurshid, Shaan
Anderson, Christopher D.
Dahal, Kumar
Weisenfeld, Dana
Cai, Tianrun
Liao, Katherine P.
Wagholikar, Kavishwar B.
Murphy, Shawn N.
Atlas, Steven J.
Lubitz, Steven A.
Singer, Daniel E.
JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2022, 11 (15):
[6] Using Natural Language Processing on Electronic Health Records to Enhance Detection and Prediction of Psychosis Risk
Irving, Jessica
Patel, Rashmi
Oliver, Dominic
Colling, Craig
Pritchard, Megan
Broadbent, Matthew
Baldwin, Helen
Stahl, Daniel
Stewart, Robert
Fusar-Poli, Paolo
SCHIZOPHRENIA BULLETIN, 2021, 47 (02) : 405 - 414
[7] Automated Extraction of Pain Symptoms: A Natural Language Approach using Electronic Health Records
Dave, Amisha D.
Ruano, Gualberto
Kost, Jonathan
Wang, Xiaoyan
PAIN PHYSICIAN, 2022, 25 (02) : E245 - E254
[8] Natural language processing to identify lupus nephritis phenotype in electronic health records
Deng, Yu
Pacheco, Jennifer A.
Ghosh, Anika
Chung, Anh
Mao, Chengsheng
Smith, Joshua C.
Zhao, Juan
Wei, Wei-Qi
Barnado, April
Dorn, Chad
Weng, Chunhua
Liu, Cong
Cordon, Adam
Yu, Jingzhi
Tedla, Yacob
Kho, Abel
Ramsey-Goldman, Rosalind
Walunas, Theresa
Luo, Yuan
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 22 (SUPPL 2)
[9] Automated derivation of diagnostic criteria for lung cancer using natural language processing on electronic health records: a pilot study
Houston, Andrew
Williams, Sophie
Ricketts, William
Gutteridge, Charles
Tackaberry, Chris
Conibear, John
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
[10] Extracting social determinants of health from electronic health records using natural language processing: a systematic review
Patra, Braja G.
Sharma, Mohit M.
Vekaria, Veer
Adekkanattu, Prakash
Patterson, Olga, V
Glicksberg, Benjamin
Lepow, Lauren A.
Ryu, Euijung
Biernacka, Joanna M.
Furmanchuk, Al'ona
George, Thomas J.
Hogan, William
Wu, Yonghui
Yang, Xi
Bian, Jiang
Weissman, Myrna
Wickramaratne, Priya
Mann, J. John
Olfson, Mark
Campion, Thomas R., Jr.
Weiner, Mark
Pathak, Jyotishman
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 28 (12) : 2716 - 2727

← 1 2 3 4 5 →