Data consistency in the English Hospital Episodes Statistics database

被引:19
作者
Hardy, Flavien [1 ,2 ]
Heyl, Johannes [1 ,2 ]
Tucker, Katie [3 ]
Hopper, Adrian [1 ,4 ]
Marcha, Maria J. [5 ]
Briggs, Tim W. R. [1 ,6 ]
Yates, Jeremy [5 ,7 ]
Day, Jamie [1 ]
Wheeler, Andrew [1 ]
Eve-Jones, Sue [1 ]
Gray, William K. [1 ]
机构
[1] NHS England & NHS Improvement London, Getting It Right First Time, London, England
[2] UCL, Dept Phys & Astron, London, England
[3] Royal Free London NHS Fdn Trust, Innovat & Intelligent Automat Unit, London, England
[4] Guys & St Thomas NHS Fdn Trust, Ageing & Hlth, London, England
[5] Sci & Technol Facil Council Distributed Res Utili, London, England
[6] Royal Natl Orthopaed Hosp NHS Trust, Stanmore, Middx, England
[7] UCL, Dept Comp Sci, London, England
基金
英国科学技术设施理事会;
关键词
information technology; SURGERY; AREAS;
D O I
10.1136/bmjhci-2022-100633
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background To gain maximum insight from large administrative healthcare datasets it is important to understand their data quality. Although a gold standard against which to assess criterion validity rarely exists for such datasets, internal consistency can be evaluated. We aimed to identify inconsistencies in the recording of mandatory International Statistical Classification of Diseases and Related Health Problems, tenth revision (ICD-10) codes within the Hospital Episodes Statistics dataset in England. Methods Three exemplar medical conditions where recording is mandatory once diagnosed were chosen: autism, type II diabetes mellitus and Parkinson's disease dementia. We identified the first occurrence of the condition ICD-10 code for a patient during the period April 2013 to March 2021 and in subsequent hospital spells. We designed and trained random forest classifiers to identify variables strongly associated with recording inconsistencies. Results For autism, diabetes and Parkinson's disease dementia respectively, 43.7%, 8.6% and 31.2% of subsequent spells had inconsistencies. Coding inconsistencies were highly correlated with non-coding of an underlying condition, a change in hospital trust and greater time between the spell with the first coded diagnosis and the subsequent spell. For patients with diabetes or Parkinson's disease dementia, the code recording for spells without an overnight stay were found to have a higher rate of inconsistencies. Conclusions Data inconsistencies are relatively common for the three conditions considered. Where these mandatory diagnoses are not recorded in administrative datasets, and where clinical decisions are made based on such data, there is potential for this to impact patient care.
引用
收藏
页数:8
相关论文
共 30 条
[1]   Big data in digital healthcare: lessons learnt and recommendations for general practice [J].
Agrawal, Raag ;
Prabakaran, Sudhakaran .
HEREDITY, 2020, 124 (04) :525-534
[2]   The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) Statement [J].
Benchimol, Eric I. ;
Smeeth, Liam ;
Guttmann, Astrid ;
Harron, Katie ;
Moher, David ;
Petersen, Irene ;
Sorensen, Henrik T. ;
von Elm, Erik ;
Langan, Sinead M. .
PLOS MEDICINE, 2015, 12 (10)
[3]   COMPARING THE AREAS UNDER 2 OR MORE CORRELATED RECEIVER OPERATING CHARACTERISTIC CURVES - A NONPARAMETRIC APPROACH [J].
DELONG, ER ;
DELONG, DM ;
CLARKEPEARSON, DI .
BIOMETRICS, 1988, 44 (03) :837-845
[4]  
Flach PA, 2015, ADV NEUR IN, V28
[5]   Development and validation of a Hospital Frailty Risk Score focusing on older people in acute care settings using electronic hospital records: an observational study [J].
Gilbert, Thomas ;
Neuburger, Jenny ;
Kraindler, Joshua ;
Keeble, Eilis ;
Smith, Paul ;
Ariti, Cono ;
Arora, Sandeepa ;
Street, Andrew ;
Parker, Stuart ;
Roberts, Helen C. ;
Bardsley, Martin ;
Conroy, Simon .
LANCET, 2018, 391 (10132) :1775-1782
[6]   Safety of day-case paediatric tonsillectomy in England: an analysis of administrative data for the Getting It Right First Time programme [J].
Gray, W. K. ;
Takhar, A. S. ;
Navaratnam, A., V ;
Day, J. ;
Swart, M. ;
Snowden, C. ;
Briggs, T. W. R. ;
Marshall, A. .
ANAESTHESIA, 2022, 77 (03) :277-285
[7]   Changes in COVID-19 in-hospital mortality in hospitalised adults in England over the first seven months of the pandemic: An observational study using administrative data [J].
Gray, William K. ;
Navaratnam, Annakan, V ;
Day, Jamie ;
Wendon, Julia ;
Briggs, Tim W. R. .
LANCET REGIONAL HEALTH-EUROPE, 2021, 5
[8]   Volume-Outcome Relationships in Elective Abdominal Aortic Aneurysm Surgery: Analysis of the UK Hospital Episodes Statistics Database for the Getting It Right First Time (GIRFT) Programme [J].
Gray, William K. ;
Day, Jamie ;
Horrocks, Michael .
EUROPEAN JOURNAL OF VASCULAR AND ENDOVASCULAR SURGERY, 2020, 60 (04) :509-517
[9]   Collider bias undermines our understanding of COVID-19 disease risk and severity [J].
Griffith, Gareth J. ;
Morris, Tim T. ;
Tudball, Matthew J. ;
Herbert, Annie ;
Mancano, Giulia ;
Pike, Lindsey ;
Sharp, Gemma C. ;
Sterne, Jonathan ;
Palmer, Tom M. ;
Smith, George Davey ;
Tilling, Kate ;
Zuccolo, Luisa ;
Davies, Neil M. ;
Hemani, Gibran .
NATURE COMMUNICATIONS, 2020, 11 (01)
[10]   Data Resource Profile: Hospital Episode Statistics Admitted Patient Care (HES APC) [J].
Herbert, Annie ;
Wijlaars, Linda ;
Zylbersztejn, Ania ;
Cromwell, David ;
Hardelid, Pia .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2017, 46 (04) :1093-+