Performance Analysis of Feed Forward MLP with various Activation Functions for Handwritten Numerals Recognition

被引:6
作者
Choudhary, Amit [1 ]
Rishi, Rahul [2 ]
Ahlawat, Savita [3 ]
Dhaka, Vijaypal Singh [4 ]
机构
[1] Maharaja Surajmal Inst, Dept Comp Sci, New Delhi, India
[2] TITS, Dept Comp Sci & Engn, Bhiwani, Haryana, India
[3] Maharaja Surajmal Inst Technol, Dept Comp Sci & Engn, New Delhi, India
[4] IMS, Dept Comp Sci, Noida, UP, India
来源
2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 5 | 2010年
关键词
Numeral Recognition; MLP; Backpropagation; Conjugate Gradient Descent; Activation Functions;
D O I
10.1109/ICCAE.2010.5451890
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Aim of this paper is to analyze the performance of back-propagation feed-forward algorithm using various different activation functions for the neurons of hidden and output layers. For sample creation, 250 numerals were gathered form 35 people. After binarization, these numerals were clubbed together to form training patterns for the neural network. Network was trained to learn its behavior by adjusting the connection strengths at every iteration. The conjugate gradient descent of each presented training pattern was calculated to identify the minima on the error surface for each training pattern. Experiments were performed by selecting different combinations of two activation functions out of the three activation functions 'logsig', 'tansig' and 'purelin' for the neurons of the hidden and output layers and the results revealed that the percentage recognition accuracy of the neural network was observed to be optimum when 'tansig'-'tansig' combination of activation functions was used for neurons of hidden and output layers.
引用
收藏
页码:852 / 856
页数:5
相关论文
共 6 条
[1]   An overview of character recognition focused on off-line handwriting [J].
Arica, N ;
Yarman-Vural, FT .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2001, 31 (02) :216-233
[2]  
Banashree NP, 2007, PROC WRLD ACAD SCI E, V22, P78
[3]  
Neves R.F.P., 2008, P 15 IWSSIP IN PRESS
[4]  
Ouchtati S., 1997, J COMPUTER SCI, V3, P242
[5]  
SRIHARI S, 1994, P 4 INT WORKSH FRONT, P265
[6]  
Verma BK, 1995, 1995 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS PROCEEDINGS, VOLS 1-6, P2111, DOI 10.1109/ICNN.1995.489003