Comparison of the decision tree, artificial neural network, and linear regression methods based on the number and types of independent variables and sample size

被引:72
|
作者
Kim, Yong Soo [1 ]
机构
[1] SK Telecom, CI Div, Seoul 100999, South Korea
关键词
data mining; statistical method; artificial neural network; decision tree; linear regression;
D O I
10.1016/j.eswa.2006.12.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, the performance of data mining and statistical techniques was empirically compared while varying the number of independent variables, the types of independent variables, the number of classes of the independent variables, and the sample size. Our study employed 60 simulated examples, with artificial neural networks and decision trees as the data mining techniques, and linear regression as the statistical method. In the performance study, we use the RMSE value as the metric and come up with some additional findings: (i) for continuous independent variables, a statistical technique (i.e., linear regression) was superior to data mining (i.e., decision tree and artificial neural network) regardless of the number of variables and the sample size; (ii) for continuous and categorical independent variables, linear regression was best when the number of categorical variables was one, while the artificial neural network was superior when the number of categorical variables was two or more; (iii) the artificial neural network performance improved faster than that of the other methods as the number of classes of categorical variable increased. (C) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1227 / 1234
页数:8
相关论文
共 50 条
  • [21] A comparison of linear regression and neural network methods for predicting excess returns on large stocks
    Desai, VS
    Bharati, R
    ANNALS OF OPERATIONS RESEARCH, 1998, 78 (0) : 127 - 163
  • [22] A comparison of linear regression and neural network methods for predicting excess returns on large stocks
    Desai, V. S.
    Bharati, R.
    Annals of Operations Research, (78):
  • [23] Comparison of NDT Data Fusion for Concrete Strength using Decision Tree and Artificial Neural Network
    Dauji, Saha
    JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2023, 82 (08): : 831 - 840
  • [24] MODELING AND COMPARISON OF BONDING STRENGTH OF IMPREGNATED WOOD MATERIAL BY USING DIFFERENT METHODS: ARTIFICIAL NEURAL NETWORK AND MULTIPLE LINEAR REGRESSION
    Akyuz, Ilker
    Ersen, Nadir
    Tiryaki, Sebahattin
    Bayram, Bahadir Cagri
    Akyuz, Kadri Cemil
    Peker, Huseyin
    WOOD RESEARCH, 2019, 64 (03) : 483 - 497
  • [25] Classifying dysmorphic syndromes by using artificial neural network based hierarchical decision tree
    Ozdemir, Merve Erkinay
    Telatar, Ziya
    Erogul, Osman
    Tunca, Yusuf
    AUSTRALASIAN PHYSICAL & ENGINEERING SCIENCES IN MEDICINE, 2018, 41 (02) : 451 - 461
  • [26] Estimation of dynamic properties of sandstones based on index properties using artificial neural network and multivariate linear regression methods
    Alizadeh, Sayed Mehdi
    Iraji, Amin
    Tabasi, Somayeh
    Ahmed, Alim Al Ayub
    Motahari, Mohammad Reza
    ACTA GEOPHYSICA, 2022, 70 (01) : 225 - 242
  • [27] Estimation of dynamic properties of sandstones based on index properties using artificial neural network and multivariate linear regression methods
    Sayed Mehdi Alizadeh
    Amin Iraji
    Somayeh Tabasi
    Alim Al Ayub Ahmed
    Mohammad Reza Motahari
    Acta Geophysica, 2022, 70 : 225 - 242
  • [28] Classifying dysmorphic syndromes by using artificial neural network based hierarchical decision tree
    Merve Erkınay Özdemir
    Ziya Telatar
    Osman Eroğul
    Yusuf Tunca
    Australasian Physical & Engineering Sciences in Medicine, 2018, 41 : 451 - 461
  • [29] An absolute magnitude deviation of HRV for the prediction of prediabetes with combined artificial neural network and regression tree methods
    Igbe, Tobore
    Li, Jingzhen
    Kandwal, Abhishek
    Omisore, Olatunji Mumini
    Yetunde, Efetobore
    Yuhang, Liu
    Wang, Lei
    Nie, Zedong
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (03) : 2221 - 2244
  • [30] An absolute magnitude deviation of HRV for the prediction of prediabetes with combined artificial neural network and regression tree methods
    Tobore Igbe
    Jingzhen Li
    Abhishek Kandwal
    Olatunji Mumini Omisore
    Efetobore Yetunde
    Liu Yuhang
    Lei Wang
    Zedong Nie
    Artificial Intelligence Review, 2022, 55 : 2221 - 2244