A genetic algorithm based feature selection for handwritten digit recognition

被引:1
作者
Ahlawat S. [1 ]
Rishi R. [2 ]
机构
[1] Department of Computer Science and Engineering, Maharaja Surajmal Institute of Technology, New Delhi
[2] Department of Computer Science and Engineering, University Institute of Engineering and Technology, Ma-harshi Dayanand University, Rohtak
关键词
Digit recognition; Feature reduction; Feature selection; Genetic algorithm; Neural networks;
D O I
10.2174/2213275911666181120111342
中图分类号
学科分类号
摘要
Background: The data proliferation has been resulted in large-scale, high dimensional data and brings new challenges for feature selection in handwriting recognition problems. The practical challenges like the large variability and ambiguities present in the individual’s handwriting style demand an optimal feature selection algorithm that would be capable to enhance the recognition accuracy of handwriting recognition system with reduced training efforts and computational cost. Objective: This paper gives emphasis on the feature selection process and proposed a genetic algorithm based feature selection technique for handwritten digit recognition. Method: A hybrid feature set of statistical and geometrical features is developed in order to get the effective feature set consist of local and global characteristics of sample digits. The method utilizes a genetic algorithm based feature selection for selecting best distinguishable features and k-nearest neighbour for evaluating the fitness of features of handwritten digit dataset. Results: The experiments are carried out on standard The Chars74K handwritten digit dataset and reported a 66% reduction in the original feature set without sacrificing the recognition accuracy. Conclusion: The experiment results show the effectiveness of the proposed approach. © 2019 Bentham Science Publishers.
引用
收藏
页码:304 / 316
页数:12
相关论文
共 89 条
[1]  
Liu H., Yu L., Toward integrating feature selection algorithms for classification and clustering, IEEE Transactions on Knowledge and Data Engineering, 17, pp. 491-502, (2005)
[2]  
Goswami S., Das A.K., Chakrabarti A., Chakraborty B., A feature cluster taxonomy based feature selection technique, Expert Systems with Applications, 79, pp. 76-89, (2017)
[3]  
Wang L., Wang Y., Chang Q., Feature selection methods for big data bioinformatics: A survey from the search perspective, Methods, 111, pp. 21-31, (2016)
[4]  
Liu H., Zhao Z., Manipulating data and dimension reduction methods: Feature selection, Encyclopedia of Complexity and Systems Science, pp. 5348-5359, (2009)
[5]  
Liu H., Motoda H., Setiono R., Zhao Z., Feature selection: An ever evolving frontier in data mining, Feature Selection for Data Mining, JMLR Proceedings, 10, pp. 4-13, (2010)
[6]  
Dash M., Liu H., Feature selection for classification, Intelligent Data Analysis, 1, 4, pp. 131-156, (1997)
[7]  
Zhang L.X., Wang J.X., Zhao Y.N., Yang Z.H., A novel hybrid feature selection algorithm: Using ReliefF estimation for GA-Wrapper search, Proceedings of the 2003 International Conference on Machine Learning and Cybernetics (IEEE Cat. No. 03EX693), Vol.1, pp. 380-384, (2003)
[8]  
Shen J., Xia J., Zhang X., Jia W., Sliding Block-Based Hybrid Feature Subset Selection in Network Traffic, IEEE Access, 5, pp. 18179-18186, (2017)
[9]  
Song Q., Ni J., Wang G., A Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data, IEEE Transactions on Knowledge and Data Engineering, 25, 1, pp. 1-14, (2013)
[10]  
Chen M.Y., Kundu A., Zhou J., Off-line handwritten word recognition using a hidden Markov model type stochastic network, IEEE Trans. Pattern Anal. Mach Intell, 16, 5, pp. 481-496, (1994)