A Hybrid Clustering Technique to Improve Big Data Accessibility Based on Machine Learning Approaches

被引:3
作者
Ebadati, E. Omid Mahdi [1 ]
Tabrizi, Mohammad Mortazavi [2 ]
机构
[1] Kharazmi Univ, Dept Math & Comp Sci, 242 Somayeh St, Tehran, Iran
[2] Kharazmi Univ, Dept Knowledge Engn & Decis Sci, 242 Somayeh St, Tehran, Iran
来源
INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 1, INDIA 2016 | 2016年 / 433卷
关键词
Hybrid clustering; Data mining; k-means algorithm; Genetic algorithm; Machine learning; K-MEANS; ALGORITHM;
D O I
10.1007/978-81-322-2755-7_43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big data is called to a large or complex data from traditional ones, which is unstructured in many case. Accessing to a specific value in a huge data that is not sorted or organized can be time consuming and require a high processing. With growing of data, clustering can be a most important unsupervised approach that finds a structure for data. In this paper, we demonstrate two approaches to cluster data with high accuracy, and then we sort data by implementing merge sort algorithm finally, we use binary search to find a data value point in a specific range of data. This research presents a high value efficiency combo method in big data by using genetic and k-means. After clustering with k-means total sum of the Euclidean distances is 3.37233e+09 for 4 clusters, and after genetic algorithm this number reduce to 0.0300344 in the best fit. In the second and third stage we show that after this implementation, we can access to a particular data much faster and accurate than other older methods.
引用
收藏
页码:413 / 423
页数:11
相关论文
共 22 条
  • [1] K-Means clustering technique applied to availability of micro hydro power
    Adhau, S. P.
    Moharil, R. M.
    Adhau, P. G.
    [J]. SUSTAINABLE ENERGY TECHNOLOGIES AND ASSESSMENTS, 2014, 8 : 191 - 201
  • [2] Hierarchical kernel spectral clustering
    Alzate, Carlos
    Suykens, Johan A. K.
    [J]. NEURAL NETWORKS, 2012, 35 : 21 - 30
  • [3] [Anonymous], 2002, UCI MACHINE LEARNING
  • [4] Binary clustering
    Barthelemy, Jean-Pierre
    Brucker, Francois
    [J]. DISCRETE APPLIED MATHEMATICS, 2008, 156 (08) : 1237 - 1250
  • [5] Ebadati E. O. M., 2014, CIENC TEC SCI TECHNO, V29, P9
  • [6] Implementation of Two Stages k-Means Algorithm to Apply a Payment System Provider Framework in Banking Systems
    Ebadati, Omid Mahdi E.
    Babaie, Sara Sadat
    [J]. ARTIFICIAL INTELLIGENCE PERSPECTIVES AND APPLICATIONS (CSOC2015), 2015, 347 : 203 - 213
  • [7] Advancing big data for humanitarian needs
    Fadiya, Samson Oluwaseun
    Saydam, Serdar
    Zira, Vanduhe Vany
    [J]. HUMANITARIAN TECHNOLOGY: SCIENCE, SYSTEMS AND GLOBAL IMPACT 2014, (HUMTECH2014), 2014, 78 : 88 - 95
  • [8] Smartphone image clustering
    Garcia Villalba, Luis Javier
    Sandoval Orozco, Ana Lucila
    Rosales Corripio, Jocelin
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (04) : 1927 - 1940
  • [9] Gupta R., 2012, Big Data Analytics, P42, DOI DOI 10.1007/978-3-642-35542-4_
  • [10] The rise of "big data" on cloud computing: Review and open research issues
    Hashem, Ibrahim Abaker Targio
    Yaqoob, Ibrar
    Anuar, Nor Badrul
    Mokhtar, Salimah
    Gani, Abdullah
    Khan, Samee Ullah
    [J]. INFORMATION SYSTEMS, 2015, 47 : 98 - 115