The MIMIC Code Repository: enabling reproducibility in critical care research

被引:274
作者
Johnson, Alistair E. W. [1 ]
Stone, David J. [2 ]
Celi, Leo A. [1 ,3 ]
Pollard, Tom J. [1 ]
机构
[1] MIT, E25-505,77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Univ Virginia, Sch Med, Charlottesville, VA 22908 USA
[3] Beth Israel Deaconess Med Ctr, Boston, MA 02215 USA
基金
美国国家卫生研究院;
关键词
critical care; reproducibility; mimic-iii; data mining; intensive care; electronic health record; INTERNATIONAL CONSENSUS DEFINITIONS; ACUTE PHYSIOLOGY; UNITED-STATES; SEPSIS; SYSTEM; SCORE; EPIDEMIOLOGY; MORTALITY; FAILURE;
D O I
10.1093/jamia/ocx084
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Lack of reproducibility in medical studies is a barrier to the generation of a robust knowledge base to support clinical decision-making. In this paper we outline the Medical Information Mart for Intensive Care (MIMIC) Code Repository, a centralized code base for generating reproducible studies on an openly available critical care dataset. Code is provided to load the data into a relational structure, create extractions of the data, and reproduce entire analysis plans including research studies. Concepts extracted include severity of illness scores, comorbid status, administrative definitions of sepsis, physiologic criteria for sepsis, organ failure scores, treatment administration, and more. Executable documents are used for tutorials and reproduce published studies end-to-end, providing a template for future researchers to replicate. The repository's issue tracker enables community discussion about the data and concepts, allowing users to collaboratively improve the resource. The centralized repository provides a platform for users of the data to interact directly with the data generators, facilitating greater understanding of the data. It also provides a location for the community to collaborate on necessary concepts for research progress and share them with a larger audience. Consistent application of the same code for underlying concepts is a key step in ensuring that research studies on the MIMIC database are comparable and reproducible. By providing open source code alongside the freely accessible MIMIC-III database, we enable end-to-end reproducible analysis of electronic health records.
引用
收藏
页码:32 / 39
页数:8
相关论文
共 33 条
[11]   A New Severity of Illness Scale Using a Subset of Acute Physiology and Chronic Health Evaluation Data Elements Shows Comparable Predictive Accuracy [J].
Johnson, Alistair E. W. ;
Kramer, Andrew A. ;
Clifford, Gari D. .
CRITICAL CARE MEDICINE, 2013, 41 (07) :1711-1718
[12]   The first international consensus conference on continuous renal replacement therapy [J].
Kellum, JA ;
Mehta, RL ;
Angus, DC ;
Palevsky, P ;
Ronco, C .
KIDNEY INTERNATIONAL, 2002, 62 (05) :1855-1863
[13]   Foreword [J].
Eckardt, Kai-Uwe ;
Kasiske, Bertram L. .
KIDNEY INTERNATIONAL, 2009, 76 :S1-S2
[14]   Jupyter Notebooks-a publishing format for reproducible computational workflows [J].
Kluyver, Thomas ;
Ragan-Kelley, Benjamin ;
Perez, Fernando ;
Granger, Brian ;
Bussonnier, Matthias ;
Frederic, Jonathan ;
Kelley, Kyle ;
Hamrick, Jessica ;
Grout, Jason ;
Corlay, Sylvain ;
Ivanov, Paul ;
Avila, Damin ;
Abdalla, Safia ;
Willing, Carol .
POSITIONING AND POWER IN ACADEMIC PUBLISHING: PLAYERS, AGENTS AND AGENDAS, 2016, :87-90
[15]   APACHE 1978-2001: The development of a quality assurance system based on prognosis - Milestones and personal reflections [J].
Knaus, WA .
ARCHIVES OF SURGERY, 2002, 137 (01) :37-41
[16]   THE APACHE-III PROGNOSTIC SYSTEM - RISK PREDICTION OF HOSPITAL MORTALITY FOR CRITICALLY ILL HOSPITALIZED ADULTS [J].
KNAUS, WA ;
WAGNER, DP ;
DRAPER, EA ;
ZIMMERMAN, JE ;
BERGNER, M ;
BASTOS, PG ;
SIRIO, CA ;
MURPHY, DJ ;
LOTRING, T ;
DAMIANO, A ;
HARRELL, FE .
CHEST, 1991, 100 (06) :1619-1636
[17]   The logistic organ dysfunction system - A new way to assess organ dysfunction in the intensive care unit [J].
LeGall, JR ;
Klar, J ;
Lemeshow, S ;
Saulnier, F ;
Alberti, C ;
Artigas, A ;
Teres, D .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1996, 276 (10) :802-810
[18]   A NEW SIMPLIFIED ACUTE PHYSIOLOGY SCORE (SAPS-II) BASED ON A EUROPEAN NORTH-AMERICAN MULTICENTER STUDY [J].
LEGALL, JR ;
LEMESHOW, S ;
SAULNIER, F .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1993, 270 (24) :2957-2963
[19]   Data Sharing [J].
Longo, Dan L. ;
Drazen, Jeffrey M. .
NEW ENGLAND JOURNAL OF MEDICINE, 2016, 374 (03) :276-277
[20]   The epidemiology of sepsis in the United States from 1979 through 2000 [J].
Martin, GS ;
Mannino, DM ;
Eaton, S ;
Moss, M .
NEW ENGLAND JOURNAL OF MEDICINE, 2003, 348 (16) :1546-1554