A Comparison of Machine Learning Methods in a High-Dimensional Classification Problem

被引:9
作者
Zekic-Susac, Marijana [1 ]
Pfeifer, Sanja [1 ]
Sarlija, Natasa [1 ]
机构
[1] Univ Josip Juraj Strossmayer Osijek, Fac Econ, Osijek, Croatia
来源
BUSINESS SYSTEMS RESEARCH JOURNAL | 2014年 / 5卷 / 03期
关键词
machine learning; support vector machines; artificial neural networks; CART classification trees; k-nearest neighbour; large-dimensional data; cross-validation;
D O I
10.2478/bsrj-2014-0021
中图分类号
F [经济];
学科分类号
02 ;
摘要
Background: Large-dimensional data modelling often relies on variable reduction methods in the pre-processing and in the post-processing stage. However, such a reduction usually provides less information and yields a lower accuracy of the model. Objectives: The aim of this paper is to assess the high-dimensional classification problem of recognizing entrepreneurial intentions of students by machine learning methods. Methods/Approach: Four methods were tested: artificial neural networks, CART classification trees, support vector machines, and k-nearest neighbour on the same dataset in order to compare their efficiency in the sense of classification accuracy. The performance of each method was compared on ten subsamples in a 10-fold cross-validation procedure in order to assess computing sensitivity and specificity of each model. Results: The artificial neural network model based on multilayer perceptron yielded a higher classification rate than the models produced by other methods. The pairwise t-test showed a statistical significance between the artificial neural network and the k-nearest neighbour model, while the difference among other methods was not statistically significant. Conclusions: Tested machine learning methods are able to learn fast and achieve high classification accuracy. However, further advancement can be assured by testing a few additional methodological refinements in machine learning methods.
引用
收藏
页码:82 / 96
页数:15
相关论文
共 37 条
[11]   New business start-up and subsequent entry into self-employment [J].
Kolvereid, Lars ;
Isaksen, Espen .
JOURNAL OF BUSINESS VENTURING, 2006, 21 (06) :866-885
[12]  
Krueger N.F., 2000, ENTREP THEORY PRACT, V25, P5, DOI DOI 10.1177/104225870002400301
[13]   Competing models of entrepreneurial intentions [J].
Krueger, NF ;
Reilly, MD ;
Carsrud, AL .
JOURNAL OF BUSINESS VENTURING, 2000, 15 (5-6) :411-432
[14]   The impact of multinationality on firm value: A comparative analysis of machine learning techniques [J].
Kuzey, Cemil ;
Uyar, Ali ;
Delen, Dursun .
DECISION SUPPORT SYSTEMS, 2014, 59 :127-142
[15]   Using data envelopment analysis and decision trees for efficiency analysis and recommendation of B2C controls [J].
Lee, Sangjae .
DECISION SUPPORT SYSTEMS, 2010, 49 (04) :486-497
[16]   A comparative study on the trends of entrepreneurial behaviors of enterprises in different strategies: Application of the social cognition theory [J].
Lin, W. -B. .
EXPERT SYSTEMS WITH APPLICATIONS, 2006, 31 (02) :207-220
[17]   A hierarchical intrusion detection model based on the PCA neural networks [J].
Liu, Guisong ;
Yi, Zhang ;
Yang, Shangming .
NEUROCOMPUTING, 2007, 70 (7-9) :1561-1568
[18]  
Masters T, 1995, ADV ALGORITHMS NEURA
[19]   Entrepreneurial Self-Efficacy: Refining the Measure [J].
McGee, Jeffrey E. ;
Peterson, Mark ;
Mueller, Stephen L. ;
Sequeira, Jennifer M. .
ENTREPRENEURSHIP THEORY AND PRACTICE, 2009, 33 (04) :965-988
[20]   Bankruptcy prediction using support vector machine with optimal choice of kernel function parameters [J].
Min, JH ;
Lee, YC .
EXPERT SYSTEMS WITH APPLICATIONS, 2005, 28 (04) :603-614