The MIMIC Code Repository: enabling reproducibility in critical care research

被引:274
作者
Johnson, Alistair E. W. [1 ]
Stone, David J. [2 ]
Celi, Leo A. [1 ,3 ]
Pollard, Tom J. [1 ]
机构
[1] MIT, E25-505,77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Univ Virginia, Sch Med, Charlottesville, VA 22908 USA
[3] Beth Israel Deaconess Med Ctr, Boston, MA 02215 USA
基金
美国国家卫生研究院;
关键词
critical care; reproducibility; mimic-iii; data mining; intensive care; electronic health record; INTERNATIONAL CONSENSUS DEFINITIONS; ACUTE PHYSIOLOGY; UNITED-STATES; SEPSIS; SYSTEM; SCORE; EPIDEMIOLOGY; MORTALITY; FAILURE;
D O I
10.1093/jamia/ocx084
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Lack of reproducibility in medical studies is a barrier to the generation of a robust knowledge base to support clinical decision-making. In this paper we outline the Medical Information Mart for Intensive Care (MIMIC) Code Repository, a centralized code base for generating reproducible studies on an openly available critical care dataset. Code is provided to load the data into a relational structure, create extractions of the data, and reproduce entire analysis plans including research studies. Concepts extracted include severity of illness scores, comorbid status, administrative definitions of sepsis, physiologic criteria for sepsis, organ failure scores, treatment administration, and more. Executable documents are used for tutorials and reproduce published studies end-to-end, providing a template for future researchers to replicate. The repository's issue tracker enables community discussion about the data and concepts, allowing users to collaboratively improve the resource. The centralized repository provides a platform for users of the data to interact directly with the data generators, facilitating greater understanding of the data. It also provides a location for the community to collaborate on necessary concepts for research progress and share them with a larger audience. Consistent application of the same code for underlying concepts is a key step in ensuring that research studies on the MIMIC database are comparable and reproducible. By providing open source code alongside the freely accessible MIMIC-III database, we enable end-to-end reproducible analysis of electronic health records.
引用
收藏
页码:32 / 39
页数:8
相关论文
共 33 条
[1]   Epidemiology of severe sepsis in the United States: Analysis of incidence, outcome, and associated costs of care [J].
Angus, DC ;
Linde-Zwirble, WT ;
Lidicker, J ;
Clermont, G ;
Carcillo, J ;
Pinsky, MR .
CRITICAL CARE MEDICINE, 2001, 29 (07) :1303-1310
[2]  
[Anonymous], 2009, MED CARE
[3]  
Baker M, 2016, NATURE, V533, P452, DOI 10.1038/533452a
[4]   Data Authorship as an Incentive to Data Sharing [J].
Bierer, Barbara E. ;
Crosas, Merce ;
Pierce, Heather H. .
NEW ENGLAND JOURNAL OF MEDICINE, 2017, 376 (17) :1684-1687
[5]   Comorbidity measures for use with administrative data [J].
Elixhauser, A ;
Steiner, C ;
Harris, DR ;
Coffey, RN .
MEDICAL CARE, 1998, 36 (01) :8-27
[6]  
Ghassemi Marzyeh, 2017, AMIA Jt Summits Transl Sci Proc, V2017, P82
[7]  
Gruber WH, 2016, HEALTHCARE
[8]   The Association Between Indwelling Arterial Catheters and Mortality in Hemodynamically Stable Patients With Respiratory Failure A Propensity Score Analysis [J].
Hsu, Douglas J. ;
Feng, Mengling ;
Kothari, Rishi ;
Zhou, Hufeng ;
Chen, Kenneth P. ;
Celi, Leo A. .
CHEST, 2015, 148 (06) :1470-1476
[9]   Identifying Patients With Severe Sepsis Using Administrative Claims Patient-Level Validation of the Angus Implementation of the International Consensus Conference Definition of Severe Sepsis [J].
Iwashyna, Theodore J. ;
Odden, Andrew ;
Rohde, Jeffrey ;
Bonham, Catherine ;
Kuhn, Latoya ;
Malani, Preeti ;
Chen, Lena ;
Flanders, Scott .
MEDICAL CARE, 2014, 52 (06) :E39-E43
[10]   MIMIC-III, a freely accessible critical care database [J].
Johnson, Alistair E. W. ;
Pollard, Tom J. ;
Shen, Lu ;
Lehman, Li-wei H. ;
Feng, Mengling ;
Ghassemi, Mohammad ;
Moody, Benjamin ;
Szolovits, Peter ;
Celi, Leo Anthony ;
Mark, Roger G. .
SCIENTIFIC DATA, 2016, 3