Comparison of the decision tree, artificial neural network, and linear regression methods based on the number and types of independent variables and sample size

被引:72
|
作者
Kim, Yong Soo [1 ]
机构
[1] SK Telecom, CI Div, Seoul 100999, South Korea
关键词
data mining; statistical method; artificial neural network; decision tree; linear regression;
D O I
10.1016/j.eswa.2006.12.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, the performance of data mining and statistical techniques was empirically compared while varying the number of independent variables, the types of independent variables, the number of classes of the independent variables, and the sample size. Our study employed 60 simulated examples, with artificial neural networks and decision trees as the data mining techniques, and linear regression as the statistical method. In the performance study, we use the RMSE value as the metric and come up with some additional findings: (i) for continuous independent variables, a statistical technique (i.e., linear regression) was superior to data mining (i.e., decision tree and artificial neural network) regardless of the number of variables and the sample size; (ii) for continuous and categorical independent variables, linear regression was best when the number of categorical variables was one, while the artificial neural network was superior when the number of categorical variables was two or more; (iii) the artificial neural network performance improved faster than that of the other methods as the number of classes of categorical variable increased. (C) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1227 / 1234
页数:8
相关论文
共 50 条
  • [41] Development of performance-based models for green concrete using multiple linear regression and artificial neural network
    Singh, Priyanka
    Adebanjo, Abiola
    Shafiq, Nasir
    Razak, Siti Nooriza Abd
    Kumar, Vicky
    Farhan, Syed Ahmad
    Adebanjo, Ifeoluwa
    Singh, Archisha
    Dixit, Saurav
    Singh, Subhav
    Sergeevna, Meshcheryakova Tatyana
    INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2024, 18 (05): : 2945 - 2956
  • [42] Improving precipitation estimates for Turkey with multimodel ensemble: a comparison of nonlinear artificial neural network method with linear methods
    Mesta, Buket
    Akgun, O. Burak
    Kentel, Elcin
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (17): : 10219 - 10238
  • [43] APPLICATION OF ARTIFICIAL NEURAL NETWORK AND LOGISTIC REGRESSION METHODS TO LANDSLIDE SUSCEPTIBILITY MAPPING AND COMPARISON OF THE RESULTS FOR THE ULUS DISTRICT, BARTIN
    Eker, Arif Mert
    Dikmen, Mehmet
    Cambazoglu, Selim
    Duzgun, Sebnem H. S. B.
    Akgun, Haluk
    JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2012, 27 (01): : 163 - 173
  • [44] Comparison of artificial neural network and linear regression models for prediction of ring spun yarn properties. II. Prediction of yarn hairiness and unevenness
    Mustafa E. Üreyen
    Pelin Gürkan
    Fibers and Polymers, 2008, 9 : 92 - 96
  • [45] Comparison of artificial neural network and linear regression models for prediction of ring spun yarn properties. I. Prediction of yarn tensile properties
    Mustafa E. Üreyen
    Pelin Gürkan
    Fibers and Polymers, 2008, 9 : 87 - 91
  • [46] Comparison of artificial neural network and linear regression models for prediction of ring spun yarn properties.: II.: Prediction of yarn hairiness and unevenness
    Ureyen, Mustafa E.
    Gurkan, Pelin
    FIBERS AND POLYMERS, 2008, 9 (01) : 92 - 96
  • [47] Comparison of artificial neural network and linear regression models for prediction of ring spun yarn properties.: I.: Prediction of yarn tensile properties
    Ureyen, Mustafa E.
    Gurkan, Pelin
    FIBERS AND POLYMERS, 2008, 9 (01) : 87 - 91
  • [48] CUSTOMER LOYALTY EVALUATION AND PREDICTION BASED ON DECISION TREE AND ARTIFICIAL NEURAL NETWORK: CASE OF OFOGH KOOROSH STORES IN TEHRAN
    Haghighi, Amirreza Estakhrian
    Shirazi, Abdolreza
    INTERNATIONAL TRANSACTION JOURNAL OF ENGINEERING MANAGEMENT & APPLIED SCIENCES & TECHNOLOGIES, 2020, 11 (05):
  • [49] Comparison of artificial neural network and decision tree models in estimating spatial distribution of snow depth in a semi-arid region of Iran
    Gharaei-Manesh, Samaneh
    Fathzadeh, Ali
    Taghizadeh-Mehrjardi, Ruhollah
    COLD REGIONS SCIENCE AND TECHNOLOGY, 2016, 122 : 26 - 35
  • [50] Comparison of QSAR models based on combinations of genetic algorithm, stepwise multiple linear regression, and artificial neural network methods to predict Kd of some derivatives of aromatic sulfonamides as carbonic anhydrase II inhibitors
    Afshin Maleki
    Hiua Daraei
    Loghman Alaei
    Aram Faraji
    Russian Journal of Bioorganic Chemistry, 2014, 40 : 61 - 75