Optimization of classification algorithm based on gene expression programming

被引:0
作者
Yang L. [1 ]
Li K. [1 ]
Zhang W. [2 ]
Zheng L. [1 ]
Ke Z. [1 ]
Qi Y. [1 ]
机构
[1] College of Mathematics and Informatics, South China Agricultural University, Guangzhou
[2] Institute of Automation, Chinese Academy of Sciences, Beijing
关键词
Data classification; Data set; Gene expression programming; Optimization;
D O I
10.1007/s12652-017-0563-8
中图分类号
学科分类号
摘要
Data classification is an important task in the field of data mining, which can be used to mine the model of important data and forecast the future trend of those data. Although some breakthroughs have been made in data classification theoretically and technically, there are still some problems, such as lack accuracy of classification modeling algorithm, poor comprehensibility of classification rules and so on. Accuracy improvement and accurate achievement of classification has become hot research topics. Gene expression programming (GEP) has been considered a powerful evolutionary method for data classification. Aiming at the shortage of basic GEP classification algorithm, a novel classification algorithm based on GEP named O_GEPCA has been proposed in this paper. By using this method the initialization and mutation operator adjustment method, calibration set, evolution function and correction strategy will be improved, and the basic GEP classification algorithm will be optimized. The proposed O_GEPCA method shows significantly improvement after comparative study between our proposed O_GEPCA methods and the primitive GEP. The efficiency and capability of our proposed O_GEPCA for data classification will be tested in four well-studied benchmark test cases including card, cancer, heart, glass classification problem demonstrate. © Springer-Verlag GmbH Germany 2017.
引用
收藏
页码:1261 / 1275
页数:14
相关论文
共 23 条
  • [1] Bartczuk L., Gene expression programming in correction modelling of nonlinear dynamic objects, Information Systems Architecture and Technology: Proceedings of 36Th International Conference on Information Systems Architecture and technology–ISAT 2015–Part I, pp. 125-134, (2016)
  • [2] Dey P., Das A.K., A utilization of GEP (gene expression programming) metamodel and PSO (particle swarm optimization) tool to predict and optimize the forced convection around a cylinder, Energy, 95, pp. 447-458, (2016)
  • [3] Ferreira C., Gene expression programming: a new adaptive algorithm for solving problems, Comput Sci, 21, 2, pp. 87-129, (2001)
  • [4] Han J.W., Kamber M., Pei J., Data mining: concepts and techniques, (2012)
  • [5] Hastie T., Tibshirani R., Friedman J., The elements of statistical learning: data mining, inference and prediction, Math Intell, 27, 2, pp. 83-85, (2005)
  • [6] Hu J.J., Tang C.J., Duan L., Et al., The diversification strategy of the initial population of gene expression programming, J comput, 30, 2, pp. 305-310, (2007)
  • [7] Jin D.D., Research and Application of Gene Expression Programming Classification Algorithm Based on Particle Swarm Optimization, (2014)
  • [8] Kotsiantis S.B., Supervised machine learning: A review of classification techniques, Conference on Emerging Artificial Intelligence Applications in Computer Engineering: Real Word AI Systems with Applications in Ehealth, HCI, Information Retrieval and Pervasive Technologies, pp. 3-24, (2007)
  • [9] Lee C.H., Yang C.B., Chen H.H., Taiwan stock investment with gene expression programming, Procedia Comput Sci, 35, pp. 137-146, (2014)
  • [10] Li T.Y., Tang C.J., Wu J., Et al., Adaptive control algorithm for population diversity in gene expression programming, J Univ Electron Sci Technol, 39, 2, pp. 279-283, (2010)