Identification of potentially undiagnosed patients with nontuberculous mycobacteria lung disease using machine learning applied to primary care data in the UK

被引:23
作者
Doyle, Orla M. [1 ]
van der Laan, Roald [2 ]
Obradovic, Marko [2 ]
McMahon, Peter [3 ]
Daniels, Flora [4 ]
Pitcher, Ashley [5 ]
Loebinger, Michael R. [6 ,7 ]
机构
[1] IQVIA, Real World Analyt Solut, Predict Analyt, London, England
[2] Insmed Utrecht, Utrecht, Netherlands
[3] IQVIA, Real World Insights, London, England
[4] IQVIA, Real World Insights, Basel, Switzerland
[5] IQVIA, Real World Insights, Copenhagen, Denmark
[6] Royal Brompton & Harefield NHS Fdn Trust, London, England
[7] Imperial Coll London, London, England
关键词
PULMONARY-DISEASE; FUNCTION DECLINE; MORTALITY; INFECTIONS; GERMANY; RISK; COPD;
D O I
10.1183/13993003.00045-2020
中图分类号
R56 [呼吸系及胸部疾病];
学科分类号
摘要
Nontuberculous mycobacterial lung disease (NTMLD) is a rare lung disease often missed due to a low index of suspicion and unspecific clinical presentation. This retrospective study was designed to characterise the prediagnosis features of NTMLD patients in primary care and to assess the feasibility of using machine learning to identify undiagnosed NTMLD patients. IQVIA Medical Research Data (incorporating THIN, a Cegedim Database), a UK electronic medical records primary care database was used. NTMLD patients were identified between 2003 and 2017 by diagnosis in primary or secondary care or record of NTMLD treatment regimen. Risk factors and treatments were extracted in the prediagnosis period, guided by literature and expert clinical opinion. The control population was enriched to have at least one of these features. 741 NTMLD and 112 784 control patients were selected. Annual prevalence rates of NTMLD from 2006 to 2016 increased from 2.7 to 5.1 per 100000. The most common pre-existing diagnoses and treatments for NTMLD patients were COPD and asthma and penicillin, macrolides and inhaled corticosteroids. Compared to random testing, machine learning improved detection of patients with NTMLD by almost a thousand-fold with AUC of 0.94. The total prevalence of diagnosed and undiagnosed cases of NTMLD in 2016 was estimated to range between 9 and 16 per 100000. This study supports the feasibility of machine learning applied to primary care data to screen for undiagnosed NTMLD patients, with results indicating that there may be a substantial number of undiagnosed cases of NTMLD in the UK.
引用
收藏
页数:11
相关论文
共 23 条
  • [21] Machine Learning Models Using Routinely Collected Clinical Data Offer Robust and Interpretable Predictions of 90-Day Unplanned Acute Care Use for Cancer Immunotherapy Patients
    Lu, Sheng-Chieh
    Knafl, Mark
    Turin, Anastasia
    Offodile, Anaeze C., II
    Ravi, Vinod
    Sidey-Gibbons, Chris
    [J]. JCO CLINICAL CANCER INFORMATICS, 2023, 7
  • [22] Early prediction of ventricular peritoneal shunt dependency in aneurysmal subarachnoid haemorrhage patients by recurrent neural network-based machine learning using routine intensive care unit data
    Schweingruber, Nils
    Bremer, Jan
    Wiehe, Anton
    Mader, Marius Marc-Daniel
    Mayer, Christina
    Woo, Marcel Seungsu
    Kluge, Stefan
    Grensemann, Joern
    Quandt, Fanny
    Gempt, Jens
    Fischer, Marlene
    Thomalla, Goetz
    Gerloff, Christian
    Sauvigny, Jennifer
    Czorlich, Patrick
    [J]. JOURNAL OF CLINICAL MONITORING AND COMPUTING, 2024, 38 (05) : 1175 - 1186
  • [23] Development of a machine learning-based prediction model for extremely rapid decline in estimated glomerular filtration rate in patients with chronic kidney disease: a retrospective cohort study using a large data set from a hospital in Japan
    Inaguma, Daijo
    Hayashi, Hiroki
    Yanagiya, Ryosuke
    Koseki, Akira
    Iwamori, Toshiya
    Kudo, Michiharu
    Fukuma, Shingo
    Yuzawa, Yukio
    [J]. BMJ OPEN, 2022, 12 (06):