A review of data mining using big data in health informatics

被引:65
作者
Herland M. [1 ]
Khoshgoftaar T.M. [1 ]
Wald R. [1 ]
机构
[1] Florida Atlantic University, 777 Glades Road, Boca Raton, FL
关键词
Big data; Bioinformatics; Clinical informatics; Health informatics; Neuroinformatics; Public health informatics; Social media;
D O I
10.1186/2196-1115-1-2
中图分类号
学科分类号
摘要
The amount of data produced within Health Informatics has grown to be quite vast, and analysis of this Big Data grants potentially limitless possibilities for knowledge to be gained. In addition, this information can improve the quality of healthcare offered to patients. However, there are a number of issues that arise when dealing with these vast quantities of data, especially how to analyze this data in a reliable manner. The basic goal of Health Informatics is to take in real world medical data from all levels of human existence to help advance our understanding of medicine and medical practice. This paper will present recent research using Big Data tools and approaches for the analysis of Health Informatics data gathered at multiple levels, including the molecular, tissue, patient, and population levels. In addition to gathering data at multiple levels, multiple levels of questions are addressed: human-scale biology, clinical-scale, and epidemic-scale. We will also analyze and examine possible future work for each of these areas, as well as how combining data from each level may provide the most promising approach to gain the most knowledge in Health Informatics. © 2014, Herland et al.; licensee Springer.
引用
收藏
相关论文
共 73 条
[1]  
Chen J., Qian F., Yan W., Shen B., Translational biomedical informatics in the cloud: present and future, BioMed Res Int 2013, (2013)
[2]  
Martin M., Big Cdata/social media combo poised to advance healthcare, HPC Source, pp. 33-35, (2013)
[3]  
Demchenko Y., Zhao Z., Grosso P., Wibisono A., de Laat C., Addressing Big Data challenges for Scientific Data Infrastructure, IEEE 4th International Conference on Cloud Computing Technology and Science (CloudCom 2012), pp. 614-617, (2012)
[4]  
Huan J.L., Pai V., Teredesai A.M., Yu S.E., IEEE Workshop on BigData In Bioinformatics and Health Care Informatics, (2013)
[5]  
Yuan Q., Nsoesie E.O., Lv B., Peng G., Chunara R., Brownstein J.S., Monitoring influenza epidemics in China with search query from Baidu, PLoS ONE, 8, 5, (2013)
[6]  
Haferlach T., Kohlmann A., Wieczorek L., Basso G., Kronnie G.T., Bene M.C., De Vos J., Hernandez J.M., Hofmann W.K., Mills K.I., Gilkes A., Chiaretti S., Shurtleff S.A., Kipps T.J., Rassenti L.Z., Yeoh A.E., Papenhausen P.R., Wm L., Williams P.M., Fo R., Clinical utility of microarray-based gene expression profiling in the diagnosis and subclassification of leukemia: report from the international microarray innovations in leukemia study group, J Clin Oncol, 28, 15, pp. 2529-2537, (2010)
[7]  
Salazar R., Roepman P., Capella G., Moreno V., Simon I., Dreezen C., Lopez-Doriga A., Santos C., Marijnen C., Westerga J., Bruin S., Kerr D., Kuppen P., van de Velde C., Morreau H., Van Velthuysen L., Glas A.M., Van't Veer L.J., Tollenaar R., Gene expression signature to improve prognosis prediction of stage II and III colorectal cancer, J Clin Oncol, 29, pp. 17-24, (2011)
[8]  
Annese J., The importance of combining MRI and large-scale digital histology in neuroimaging studies of brain connectivity and disease, Front Neuroinform, 6, (2012)
[9]  
Van Essen D.C., Smith S.M., Barch D.M., Behrens T.E., Yacoub E., Ugurbil K., The WU-Minn human connectome project: an overview, NeuroImage, 80, pp. 62-79, (2013)
[10]  
Yoshida H., Kawaguchi A., Tsuruya K., Radial basis function-sparse partial least squares for application to brain imaging data, Comput Math Methods Med, 2013, (2013)