A Clustering Hybrid Algorithm for Smart Datasets using Machine Learning

被引:0
|
作者
Amin, Dar Masroof [1 ]
Rai, Munishwar [1 ]
机构
[1] Deemed Univ, MMICT & BM Maharishi Markandeshwar, Mullana 133203, Haryana, India
关键词
Random Forests (RF); Jaccard Similarity ([!text type='JS']JS[!/text]); triangle; smart data; root mean square error; mean absolute error; machine learning; RANDOM FOREST; THINGS IOT; INTERNET;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the field of data science, Machine Learning is treated as sub-field which primarily deals with designing of algorithms which have ability to learn from previous information and make future predictions accordingly. In traditional computational world the Machine Learning was generally performed on highly performance servers and machines. The implementation of these concepts on Big Data analytics algorithms has high potential and is still in its early stages. So far as machine learning is concerned, performance measure is an important parameter to evaluate the overall functionality of the algorithms. The data set is a different entity and the measuring of performance on a data which is unseen is also called as test set, and training set is a Data set which is training itself. The Data Mining is extensively using learning algorithms for data analysis and to formulate future predications based on archived data. The research presented provides a step forward to make smart data sets out of training data set by evaluating machine learning algorithm. The research presented a novel hybrid algorithm that attempts to incorporate the feature of similarities in Random Forest machine learning algorithm for improving the classification accuracy and efficiency of working.
引用
收藏
页码:165 / 172
页数:8
相关论文
共 50 条
  • [1] Machine learning algorithm for clustering of heart disease and chemoinformatics datasets
    Balaji, K.
    Lavanya, K.
    Mary, A. Geetha
    COMPUTERS & CHEMICAL ENGINEERING, 2020, 143
  • [2] Clustering of mixed datasets using deep learning algorithm
    Balaji, K.
    Lavanya, K.
    Mary, A. Geetha
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 204
  • [3] A novel hybrid machine learning algorithm for detection in smart cities
    Jiang, Jiahui
    Han, Shuangxu
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 99
  • [4] Forecasting Obsolescence of Components by Using a Clustering-Based Hybrid Machine-Learning Algorithm
    Moon, Kyoung-Sook
    Lee, Hee Won
    Kim, Hee Jean
    Kim, Hongjoong
    Kang, Jeehoon
    Paik, Won Chul
    SENSORS, 2022, 22 (09)
  • [5] A Hybrid Optimal Clustering Algorithm for Datasets with Missing Attributes
    Zhang, Shengnan
    Ren, Jia
    Zhang, Liyong
    PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 315 - 319
  • [6] A Hybrid Machine Learning Methodology for Imbalanced Datasets
    Lipitakis, Anastasia-Dimitra
    Kotsiantis, Sotirios
    5TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS, IISA 2014, 2014, : 252 - +
  • [7] Enhancing Machine Learning Training Performance in Smart Agriculture Datasets Using a Mobile App
    Zarymkanov, Temirlan
    Kargar, Amin
    Pinotti, Cristina M.
    O'Flynn, Brendan
    Zorbas, Dimitrios
    PROCEEDINGS OF 2023 IEEE INTERNATIONAL WORKSHOP ON METROLOGY FOR AGRICULTURE AND FORESTRY, METROAGRIFOR, 2023, : 455 - 460
  • [8] Machine Learning Applications on Agricultural Datasets for Smart Farm Enhancement
    Balducci, Fabrizio
    Impedovo, Donato
    Pirlo, Giuseppe
    MACHINES, 2018, 6 (03)
  • [9] Contemplation of Machine Learning Algorithm under Distinct Datasets
    Shah, Kushagra
    Chaturvedi, Pradhyumn
    Jain, Akagra
    2018 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATION AND TELECOMMUNICATION (ICACAT), 2018,
  • [10] Product pricing solutions using hybrid machine learning algorithm
    Namburu, Anupama
    Selvaraj, Prabha
    Varsha, M.
    INNOVATIONS IN SYSTEMS AND SOFTWARE ENGINEERING, 2024, 20 (03) : 413 - 424