Arabic (Indian) digit handwritten recognition using recurrent transfer deep architecture

被引：20

作者：

Alkhawaldeh, Rami S. ^{[1
]}

机构：

[1] Univ Jordan, Dept Comp Informat Syst, Aqaba 77110, Jordan

来源：

SOFT COMPUTING | 2021年 / 25卷 / 04期

关键词：

Arabic (Indian) handwritten recognition; Deep supervised learning; LSTM; Transfer learning;

D O I：

10.1007/s00500-020-05368-8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The rapid volume of digit texts and images motivates researchers to build solid and efficient prediction models to recognize such media. The Arabic language is considered one of the difficult languages regarding the way of writing characters and digits. Recent research focuses on such language for building predictive approaches to recognize written materials. The Arabic (Indian) digit recognition task has been a challenging task and has gained more attention from researchers who build optimal predictive models from historical images that are used in many applications. However, transfer learning approaches exploit deep pre-trained models that could be re-used for similar tasks. So, in this paper, we propose an adapted deep hybrid transfer model developed using two well-known pre-trained convolutional neural networks (CNN) models. These are further adapted by adding recurrent neural networks especially long short-term memory (LSTM) architectures to detect Arabic (Indian) Handwritten Digits (AHD). The CNN model learns the relevant features of Arabic (Indian) digits, while the sequence learning process in the LSTM layers extracts long-term dependence features. The experimental results, using popular datasets, show significant performance obtained by the adapted transfer models with accuracy reached up to 98.92% as well as with precision and recall values at most cases reached to 100% with statisticalttest usingp-value (p <= 0.05) compared to baseline methods.

引用

页码：3131 / 3141

页数：11

共 35 条

[1] Arabic handwritten digit recognition [J].

Abdleazeem, Sherif ;

El-Sherif, Ezzat .

INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2008, 11 (03) :127-141

[2]

AlKhateeb JH, 2014, INT CONF COMP SCI, P222, DOI 10.1109/CSIT.2014.6806004

[3] DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural Network [J].

Alkhawaldeh, Rami S. .

SCIENTIFIC PROGRAMMING, 2019, 2019

[4] NIML: non-intrusive machine learning-based speech quality prediction on VoIP networks [J].

Alkhawaldeh, Rami S. ;

Khawaldeh, Saed ;

Pervaiz, Usama ;

Alawida, Moatsum ;

Alkhawaldeh, Hamzah .

IET COMMUNICATIONS, 2019, 13 (16) :2609-2616

[5] A State-of-the-Art Survey on Deep Learning Theory and Architectures [J].

Alom, Md Zahangir ;

Taha, Tarek M. ;

Yakopcic, Chris ;

Westberg, Stefan ;

Sidike, Paheding ;

Nasrin, Mst Shamima ;

Hasan, Mahmudul ;

Van Essen, Brian C. ;

Awwal, Abdul A. S. ;

Asari, Vijayan K. .

ELECTRONICS, 2019, 8 (03)

[6]

[Anonymous], 2014, WORLD COMPUT SCI INF

[7] Learning Deep Architectures for AI [J].

Bengio, Yoshua .

FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127

[8]

Bergstra J, 2012, J MACH LEARN RES, V13, P281

[9]

Canziani A, 2016, ARXIV

[10]

Chollet F., 2018, DEEP LEARNING PHYTON

← 1 2 3 4 →