Identification of potentially undiagnosed patients with nontuberculous mycobacteria lung disease using machine learning applied to primary care data in the UK
被引:23
作者:
Doyle, Orla M.
论文数: 0引用数: 0
h-index: 0
机构:
IQVIA, Real World Analyt Solut, Predict Analyt, London, EnglandIQVIA, Real World Analyt Solut, Predict Analyt, London, England
Doyle, Orla M.
[1
]
van der Laan, Roald
论文数: 0引用数: 0
h-index: 0
机构:
Insmed Utrecht, Utrecht, NetherlandsIQVIA, Real World Analyt Solut, Predict Analyt, London, England
van der Laan, Roald
[2
]
Obradovic, Marko
论文数: 0引用数: 0
h-index: 0
机构:
Insmed Utrecht, Utrecht, NetherlandsIQVIA, Real World Analyt Solut, Predict Analyt, London, England
Obradovic, Marko
[2
]
McMahon, Peter
论文数: 0引用数: 0
h-index: 0
机构:
IQVIA, Real World Insights, London, EnglandIQVIA, Real World Analyt Solut, Predict Analyt, London, England
McMahon, Peter
[3
]
Daniels, Flora
论文数: 0引用数: 0
h-index: 0
机构:
IQVIA, Real World Insights, Basel, SwitzerlandIQVIA, Real World Analyt Solut, Predict Analyt, London, England
Daniels, Flora
[4
]
Pitcher, Ashley
论文数: 0引用数: 0
h-index: 0
机构:
IQVIA, Real World Insights, Copenhagen, DenmarkIQVIA, Real World Analyt Solut, Predict Analyt, London, England
Pitcher, Ashley
[5
]
Loebinger, Michael R.
论文数: 0引用数: 0
h-index: 0
机构:
Royal Brompton & Harefield NHS Fdn Trust, London, England
Imperial Coll London, London, EnglandIQVIA, Real World Analyt Solut, Predict Analyt, London, England
Loebinger, Michael R.
[6
,7
]
机构:
[1] IQVIA, Real World Analyt Solut, Predict Analyt, London, England
[2] Insmed Utrecht, Utrecht, Netherlands
[3] IQVIA, Real World Insights, London, England
[4] IQVIA, Real World Insights, Basel, Switzerland
[5] IQVIA, Real World Insights, Copenhagen, Denmark
[6] Royal Brompton & Harefield NHS Fdn Trust, London, England
PULMONARY-DISEASE;
FUNCTION DECLINE;
MORTALITY;
INFECTIONS;
GERMANY;
RISK;
COPD;
D O I:
10.1183/13993003.00045-2020
中图分类号:
R56 [呼吸系及胸部疾病];
学科分类号:
摘要:
Nontuberculous mycobacterial lung disease (NTMLD) is a rare lung disease often missed due to a low index of suspicion and unspecific clinical presentation. This retrospective study was designed to characterise the prediagnosis features of NTMLD patients in primary care and to assess the feasibility of using machine learning to identify undiagnosed NTMLD patients. IQVIA Medical Research Data (incorporating THIN, a Cegedim Database), a UK electronic medical records primary care database was used. NTMLD patients were identified between 2003 and 2017 by diagnosis in primary or secondary care or record of NTMLD treatment regimen. Risk factors and treatments were extracted in the prediagnosis period, guided by literature and expert clinical opinion. The control population was enriched to have at least one of these features. 741 NTMLD and 112 784 control patients were selected. Annual prevalence rates of NTMLD from 2006 to 2016 increased from 2.7 to 5.1 per 100000. The most common pre-existing diagnoses and treatments for NTMLD patients were COPD and asthma and penicillin, macrolides and inhaled corticosteroids. Compared to random testing, machine learning improved detection of patients with NTMLD by almost a thousand-fold with AUC of 0.94. The total prevalence of diagnosed and undiagnosed cases of NTMLD in 2016 was estimated to range between 9 and 16 per 100000. This study supports the feasibility of machine learning applied to primary care data to screen for undiagnosed NTMLD patients, with results indicating that there may be a substantial number of undiagnosed cases of NTMLD in the UK.