Health Twitter Big Bata Management with Hadoop Framework

被引:21
作者
Cunha, Joao [1 ]
Silva, Catarina [1 ,2 ]
Antunes, Mario [1 ,3 ]
机构
[1] Polytech Inst Leiria, Sch Technol & Management, Leiria, Portugal
[2] Univ Coimbra, Ctr Informat & Syst, P-3000 Coimbra, Portugal
[3] Univ Porto, INESC TEC, Ctr Res Adv Comp Syst, P-4100 Oporto, Portugal
来源
CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS/INTERNATIONAL CONFERENCE ON PROJECT MANAGEMENT/CONFERENCE ON HEALTH AND SOCIAL CARE INFORMATION SYSTEMS AND TECHNOLOGIES, CENTERIS/PROJMAN / HCIST 2015 | 2015年 / 64卷
关键词
Big Data; Apache Hadoop; Mahout; Healthcare; Twitter;
D O I
10.1016/j.procs.2015.08.536
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media advancements and the rapid increase in volume and complexity of data generated by Internet services are becoming challenging not only technologically, but also in terms of application areas. Performance and availability of data processing are critical factors that need to be evaluated since conventional data processing mechanisms may not provide adequate support. Apache Hadoop with Mahout is a framework to storage and process data at large-scale, including different tools to distribute processing. It has been considered an effective tool currently used by both small and large businesses and corporations, like Google and Facebook, but also public and private healthcare institutions. Given its recent emergence and the increasing complexity of the associated technological issues, a variety of holistic framework solutions have been put forward for each specific application. In this work, we propose a generic functional architecture with Apache Hadoop framework and Mahout for handling, storing and analyzing big data that can be used in different scenarios. To demonstrate its value, we will show its features, advantages and applications on health Twitter data. We show that big health social data can generate important information, valuable both for common users and practitioners. Preliminary results of data analysis on Twitter health data using Apache Hadoop demonstrate the potential of the combination of these technologies. (C) 2015 The Authors. Published by Elsevier B.V.
引用
收藏
页码:425 / 431
页数:7
相关论文
共 12 条
[1]  
Agarwal A., 2012, 24 INT C COMP LING D, V2, P39
[2]  
[Anonymous], 2014, International Journal of Computer Science and Information Technologies
[3]  
[Anonymous], 2009, Hadoop: The Definitive Guide
[4]  
Bian J, 2012, PROCEEDINGS OF THE 2012 INTERNATIONAL WORKSHOP ON SMART HEALTH AND WELLBEING, P25, DOI 10.1145/2389707.2389713
[5]  
Costa J, 2011, LECT NOTES COMPUT SC, V6918, P178, DOI 10.1007/978-3-642-24443-8_20
[6]  
Gopal Ananth, 2013, ENHANCED CLUSTERING
[7]  
Jain E., 2014, 9 INT C IND INF SYST
[8]  
Jones M. L. H., 2013, P 2013 INT ERGONOMIC, P1
[9]  
Prajapati V., 2013, BIG DATA ANAL R HADO
[10]  
Priyanka K., 2014, International Journal of Computer Science and Information Technologies, V5, P5865, DOI DOI 10.1109/ICSSIT46314.2019.8987882