Toward a Literature-Driven Definition of Big Data in Healthcare

被引:117
作者
Baro, Emilie [1 ]
Degoul, Samuel [1 ]
Beuscart, Regis [1 ]
Chazard, Emmanuel [1 ]
机构
[1] Univ Lille, Dept Publ Hlth, EA 2694, 1 Pl Verdun, F-59045 Lille, France
关键词
DATA SCIENCE; MEDICINE; ERA; INFRASTRUCTURE; OPPORTUNITIES; INFORMATICS; COLLECTIONS; CHALLENGES; MANAGEMENT; FUTURE;
D O I
10.1155/2015/639021
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Objective. The aim of this study was to provide a definition of big data in healthcare. Methods. A systematic search of PubMed literature published until May 9, 2014, was conducted. We noted the number of statistical individuals (n) and the number of variables (p) for all papers describing a dataset. These papers were classified into fields of study. Characteristics attributed to big data by authors were also considered. Based on this analysis, a definition of big data was proposed. Results. A total of 196 papers were included. Big data can be defined as datasets with Log(n * p) >= 7. Properties of big data are its great variety and high velocity. Big data raises challenges on veracity, on all aspects of the workflow, on extracting meaningful information, and on sharing information. Big data requires new computational methods that optimize data management. Related concepts are data reuse, false knowledge discovery, and privacy issues. Conclusion. Big data is defined by volume. Big data should not be confused with data reuse: data can be big without being reused for another purpose, for example, in omics. Inversely, data can be reused without being necessarily big, for example, secondary use of Electronic Medical Records (EMR) data.
引用
收藏
页数:9
相关论文
共 79 条
[1]  
Aji Ablimit, 2012, Proc ACM SIGSPATIAL Int Conf Adv Inf, V2012, P309
[2]  
[Anonymous], 2014, Hosp Health Netw, V88, P61
[3]  
[Anonymous], 2012, Glob Adv Health Med, V1, P5, DOI 10.7453/gahmj.2012.1.3.001
[4]  
[Anonymous], 2014, Healthc Financ Manage, V68, P35
[5]  
[Anonymous], COMPUTATIONAL STRUCT
[6]  
[Anonymous], 2012, R LANG ENV STAT COMP
[7]   From the Journal archives: Improving patient outcomes in the era of Big Data [J].
Ansermino, J. Mark .
CANADIAN JOURNAL OF ANESTHESIA-JOURNAL CANADIEN D ANESTHESIE, 2014, 61 (10) :959-962
[8]  
Berger ML, 2014, J COMP EFFECT RES, V3, P167, DOI [10.2217/cer.14.2, 10.2217/CER.14.2]
[9]  
Bonney Steven, 2013, J AHIMA, V84, P62
[10]   Metadata and Annotations for Multi-scale Electrophysiological Data [J].
Bower, Mark R. ;
Stead, Matt ;
Brinkmann, Benjamin H. ;
Dufendach, Kevin ;
Worrell, Gregory A. .
2009 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-20, 2009, :2811-2814