Cloud based framework for diagnosis of diabetes mellitus using K-means clustering

被引:82
作者
Shakeel P.M. [1 ]
Baskar S. [2 ]
Dhulipala V.R.S. [3 ]
Jaber M.M. [4 ]
机构
[1] Faculty of Information and Communication Technology, Universiti Teknikal Malaysia Melaka, Durian Tunggal
[2] Department of ECE, Karpagam Academy of Higher Education, Coimbatore
[3] Department of Physics, Anna University, BIT-Campus, Tiruchirappalli
[4] Dijlah University College, Baghdad
关键词
Cloud computing; Clustering techniques; Diabetes mellitus; Dynamic data; Hadoop;
D O I
10.1007/s13755-018-0054-0
中图分类号
学科分类号
摘要
Diabetes mellitus is a serious health problem affecting the entire population all over the world for many decades. It is a group of metabolic disorder characterized by chronic disease which occurs due to high blood sugar, unhealthy foods, lack of physical activity and also hereditary. The sorts of diabetes mellitus are type1, type2 and gestational diabetes. The type1 appears during childhood and type2 diabetes develop at any age, mostly affects older than 40. The gestational diabetes occurs for pregnant women. According to the statistical report of WHO 79% of deaths occurred in people under the age of 60, due to diabetes. With a specific end goal to deal with the vast volume, speed, assortment, veracity and estimation of information a scalable environment is needed. Cloud computing is an interesting computing model suitable for accommodating huge volume of dynamic data. To overcome the data handling problems this work focused on Hadoop framework along with clustering technique. This work also predicts the occurrence of diabetes under various circumstances which is more useful for the human. This paper also compares the efficiency of two different clustering techniques suitable for the environment. The predicted result is used to diagnose which age group and gender are mostly affected by diabetes. Further some of the attributes such as hyper tension and work nature are also taken into consideration for analysis. © 2018, Springer Nature Switzerland AG.
引用
收藏
相关论文
共 18 条
[1]  
Barakat N.H., Bradley A.P., Barakat N.B.H., Intelligible support vector machines for diagnosis of diabetes mellitus, IEEE Trans Inf Technol BioMed, 14, 4, pp. 1114-1120, (2010)
[2]  
A survey on data-mining technologies for prediction and diagnosis of diabetes, International conference on intelligent computing applications, 978-1-4799-3966-4/14, (2014)
[3]  
Survey on clustering methods: towards fuzzy clustering for big data, International conference of soft computing and pattern recognition, 978-1-4799-5934-1/14, (2014)
[4]  
Han J., Kamber M., Pei J., Data mining: concepts and techniques, (2011)
[5]  
Sivanandini L.D., Raj M.M., A survey on data clustering algorithms based on fuzzy techniques, Int J Sci Res, 2, 4, pp. 246-251, (2013)
[6]  
Fahad A., Alshatri N., Tari Z., Alamri A., Khalil I., Zomaya A.Y., Foufou S., Bouras A., Survey of clustering algorithms for big data: taxonomy and empirical analysis, IEEE Trans Emerg Top Comput, 2, 3, pp. 267-279, (2014)
[7]  
Dharmarajan A., Velmurugan T., Applications of partition based clustering algorithms: a survey, International conference on computational intelligence and computing research, (2013)
[8]  
Kazi A., Kurian D.T., A survey of data clustering techniques, Int J Eng Res Technol, 3, 4, (2014)
[9]  
An efficient data clustering method for very large databases. In: Proceedings of the 1996 ACM SIGMOD international conference on management of data, 73–84, June 01–04, (1998)
[10]  
Vidhya K., Shanmugalakshmi R., Cloud based framework to handle and analyze diabetes data C, Int J Innov Sci Res, 22, 2, pp. 401-407, (2016)