A Novel Machine Learning Algorithm to Reduce Prediction Error and Accelerate Learning Curve for Very Large Datasets

被引:1
作者
Hou, Wenjun [1 ]
Perkowski, Marek [1 ]
机构
[1] Portland State Univ, Dept Elect & Comp Engn, Portland, OR 97207 USA
来源
2019 IEEE 49TH INTERNATIONAL SYMPOSIUM ON MULTIPLE-VALUED LOGIC (ISMVL) | 2019年
关键词
Machine Learning; Prediction Accuracy; Learning Curve; Multivalued logic; Supervised Classifier;
D O I
10.1109/ISMVL.2019.00025
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a novel machine learning algorithm with an improved accuracy and a faster learning curve, for very large datasets. Previously, an algorithm using lr-partitions was designed to improve upon C4.5. However, this algorithm has a relatively high percentage of undefined combinations of attribute values in its final results, increasing the learning error. In this paper, a new type of clustering algorithm was proposed to generate output values for those undefined combinations, thus accelerating the learning curve and reducing the prediction error by several percentage points on various popular datasets from the UCI Machine Learning Database.
引用
收藏
页码:97 / 101
页数:5
相关论文
共 50 条
  • [1] A Machine Learning Approach to Reduce Dimensional Space in Large Datasets
    Terol, Rafael Munoz
    Reina, Alejandro Reina
    Ziaei, Saber
    Gil, David
    IEEE ACCESS, 2020, 8 : 148181 - 148192
  • [2] Machine Learning Models for Early Prediction of Sepsis on Large Healthcare Datasets
    Camacho-Cogollo, Javier Enrique
    Bonet, Isis
    Gil, Bladimir
    Iadanza, Ernesto
    ELECTRONICS, 2022, 11 (09)
  • [3] Can Short and Partial Observations Reduce Model Error and Facilitate Machine Learning Prediction?
    Chen, Nan
    ENTROPY, 2020, 22 (10)
  • [4] Contemplation of Machine Learning Algorithm under Distinct Datasets
    Shah, Kushagra
    Chaturvedi, Pradhyumn
    Jain, Akagra
    2018 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATION AND TELECOMMUNICATION (ICACAT), 2018,
  • [5] A novel machine learning algorithm for large measurement range of quadrant photodetector
    Cao, Wentao
    Huang, Yubin
    Fan, Kuang-Chao
    Zhang, Jiyun
    OPTIK, 2021, 227
  • [6] Application of Ensemble Machine Learning for Classification Problems on Very Small Datasets
    Pavic, Ognjen
    Dasic, Lazar
    Geroski, Tijana
    Pirkovic, Marijana Stanojevic
    Milovanovic, Aleksandar
    Filipovic, Nenad
    APPLIED ARTIFICIAL INTELLIGENCE 2: MEDICINE, BIOLOGY, CHEMISTRY, FINANCIAL, GAMES, ENGINEERING, SICAAI 2023, 2024, 999 : 108 - 115
  • [7] APPLICATION OF MACHINE LEARNING TO LIMITED DATASETS: PREDICTION OF PROJECT SUCCESS
    Bang, Sofie
    Aarvold, Magnus O.
    Hartvig, Wilhelm J.
    Olsson, Nils O. E.
    Rauzy, Antoine
    JOURNAL OF INFORMATION TECHNOLOGY IN CONSTRUCTION, 2022, 27 : 732 - 755
  • [8] A Clustering Hybrid Algorithm for Smart Datasets using Machine Learning
    Amin, Dar Masroof
    Rai, Munishwar
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (09) : 165 - 172
  • [9] Learning curve for laparoscopic repair of very large hiatal hernia
    Neo, Eu Ling
    Zingg, Urs
    Devitt, Peter G.
    Jamieson, Glyn G.
    Watson, David I.
    SURGICAL ENDOSCOPY AND OTHER INTERVENTIONAL TECHNIQUES, 2011, 25 (06): : 1775 - 1782
  • [10] Learning curve for laparoscopic repair of very large hiatal hernia
    Eu Ling Neo
    Urs Zingg
    Peter G. Devitt
    Glyn G. Jamieson
    David I. Watson
    Surgical Endoscopy, 2011, 25 : 1775 - 1782