Analysis of free text in electronic health records for identification of cancer patient trajectories

被引:85
作者
Jensen, Kasper [1 ,2 ]
Soguero-Ruiz, Cristina [3 ]
Mikalsen, Karl Oyvind [4 ]
Lindsetmo, Rolv-Ole [5 ]
Kouskoumvekaki, Irene [6 ]
Girolami, Mark [2 ,7 ,8 ]
Skrovseth, Stein Olav [1 ,4 ]
Augestad, Knut Magne [9 ]
机构
[1] Univ Hosp North Norway, Norwegian Ctr E Hlth Res, Tromso, Norway
[2] Univ Warwick, Dept Stat, Coventry, W Midlands, England
[3] Univ Rey Juan Carlos, Dept Signal Theory & Commun, Mostoles, Spain
[4] UiT Arctic Univ Norway, Dept Math & Stat, Tromso, Norway
[5] Univ Hosp North Norway, Dept Gastrointestinal Surg, Tromso, Norway
[6] Tech Univ Denmark, Dept Syst Biol, Lyngby, Denmark
[7] Imperial Coll London, Dept Math, Exhibit Rd, London, England
[8] British Lib, Alan Turing Inst, 96 Euston Rd, London, England
[9] Akershus Univ Hosp, Dept Gastrointestinal Surg, Oslo, Norway
关键词
RISK-FACTORS; SURGICAL COMPLICATIONS; CLINICAL-DATA; STRATIFICATION; CLASSIFICATION; READMISSIONS; OUTCOMES;
D O I
10.1038/srep46226
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
With an aging patient population and increasing complexity in patient disease trajectories, physicians are often met with complex patient histories from which clinical decisions must be made. Due to the increasing rate of adverse events and hospitals facing financial penalties for readmission, there has never been a greater need to enforce evidence-led medical decision-making using available health care data. In the present work, we studied a cohort of 7,741 patients, of whom 4,080 were diagnosed with cancer, surgically treated at a University Hospital in the years 2004-2012. We have developed a methodology that allows disease trajectories of the cancer patients to be estimated from free text in electronic health records (EHRs). By using these disease trajectories, we predict 80% of patient events ahead in time. By control of confounders from 8326 quantified events, we identified 557 events that constitute high subsequent risks (risk > 20%), including six events for cancer and seven events for metastasis. We believe that the presented methodology and findings could be used to improve clinical decision support and personalize trajectories, thereby decreasing adverse events and optimizing cancer treatment.
引用
收藏
页数:12
相关论文
共 55 条
[1]  
Aasen S. E., 2015, 2015AA UMLS MESH NOR
[2]  
Aigner Wolfgang, 2011, Foundations and Trends in Human-Computer Interaction, V5, P207, DOI 10.1561/1100000039
[3]  
[Anonymous], 2022, International Classification of Diseases
[4]  
[Anonymous], 2010, Text Mining: Applications and Theory
[5]  
ATC/DDD Index 2017, INDEX
[6]   LIMITATIONS OF THE APPLICATION OF FOURFOLD TABLE ANALYSIS TO HOSPITAL DATA [J].
BERKSON, J .
BIOMETRICS BULLETIN, 1946, 2 (03) :47-53
[7]   Frequent item set mining [J].
Borgelt, Christian .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 2 (06) :437-456
[8]   Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection [J].
Botsis, Taxiarchis ;
Nguyen, Michael D. ;
Woo, Emily Jane ;
Markatou, Marianthi ;
Ball, Robert .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (05) :631-638
[9]   CIGARETTE-SMOKING AND LUNG-CANCER - A CONTINUING CONTROVERSY [J].
BURCH, PRJ .
MEDICAL HYPOTHESES, 1982, 9 (03) :293-306
[10]   Essential elements of personalized medicine [J].
Burke, Wylie ;
Trinidad, Susan Brown ;
Press, Nancy A. .
UROLOGIC ONCOLOGY-SEMINARS AND ORIGINAL INVESTIGATIONS, 2014, 32 (02) :193-197