A mixed image segmentation method based on intelligent equipment

被引:0
作者
Qi, Xiangwei [1 ,2 ,4 ]
Ge, Ren [2 ]
Chen, Bingcai [2 ,3 ]
Altenbek, Gulila [1 ,4 ,5 ]
机构
[1] Xinjiang Univ, Coll Informat Sci & Engn, Urumqi 830046, Xinjiang, Peoples R China
[2] Xinjiang Normal Univ, Coll Comp Sci & Technol, Urumqi 830054, Xinjiang, Peoples R China
[3] Dalian Univ Technol, Coll Comp Sci & Technol, Dalian 116024, Liaoning, Peoples R China
[4] Base Kazakh & Kirghiz Language Natl Language Reso, Urumqi 830046, Xinjiang, Peoples R China
[5] Xinjiang Lab Multilanguage Informat Technol, Urumqi 830046, Xinjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation method; prior domain knowledge; CNN; character recognition; deep learning;
D O I
10.3233/JCM-214924
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Taking Uyghur character recognition as an example, this paper use a method of image segmentation which combines traditional methods with CNN, and transforms and implements the Uyghur character on intelligent devices. Firstly, after analyzing the application of several general segmentation methods, this paper finds some shortcomings but also some ideas for Uighur segmentation. Then, starting from the characteristics of Uyghur language, such as structure, word formation and input habits, the author studies the idea of Uyghur adhesive language segmentation from the perspective of language characteristics, puts forward the basic algorithm of Uyghur symbol segmentation, and applies the Uyghur character adhesion segmentation based on minimum spanning tree and multi-queue primitive merging model to improve the segmentation efficiency. In addition, in order to solve the limitations of the traditional handwriting recognition framework of "preprocessing + feature extraction + classifier", this paper puts forward a new solution of Uyghur handwriting recognition technology combining prior domain knowledge with CNN, constructs a Uyghur handwriting recognition model on the basis of CNN and random elastic deformation, and also improves the recognition rate of the system by combining domain knowledge with CNN. According to the experimental results, we can conclude that the mixed method proposed in this paper can effectively break the technical bottleneck of traditional methods, thus improving the efficiency of segmentation and recognition.
引用
收藏
页码:1277 / 1291
页数:15
相关论文
共 19 条
[1]   An Enhanced Technique for Offline Arabic Handwritten Words Segmentation [J].
Abdeen, Roqyiah M. ;
Afifi, Ahmed ;
El-Sisi, Ashraf B. .
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT II, 2015, 9042 :663-681
[2]  
[Anonymous], 2011, 8 INT MULTICONFERENC
[3]   Offline handwritten Amharic word recognition [J].
Assabie, Yaregal ;
Bigun, Josef .
PATTERN RECOGNITION LETTERS, 2011, 32 (08) :1089-1099
[4]  
Boubaker H, 2014, 2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), P71, DOI 10.1109/SOCPAR.2014.7007984
[5]   IESK-ArDB: a database for handwritten Arabic and an optimized topological segmentation approach [J].
Elzobi, Moftah ;
Al-Hamadi, Ayoub ;
Al Aghbari, Zaher ;
Dings, Laslo .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2013, 16 (03) :295-308
[6]   A dynamic programming method for segmentation of online cursive uyghur handwritten words into basic recognizable units [J].
Ibrayim, Mayire ;
Hamdulla, Askar ;
Tursun, Dilmurat .
Journal of Software, 2013, 8 (10) :2535-2540
[7]  
Ibrayim M, 2015, INT J SIGNAL PROCESS, V8, P51, DOI DOI 10.14257/IJSIP.2015.8.6.07
[8]   Towards Arabic Handwritten Word Recognition via Probabilistic Graphical Models [J].
Khemiri, Akram ;
Kacem, Afef ;
Belaid, Abdel .
2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, :678-683
[9]   KHATT: An open Arabic offline handwritten text database [J].
Mahmoud, Sabri A. ;
Ahmad, Irfan ;
Al-Khati, Wasfi G. ;
Alshayeb, Mohammad ;
Parvez, Mohammad Tanvir ;
Maergner, Volker ;
Fink, Gernot A. .
PATTERN RECOGNITION, 2014, 47 (03) :1096-1112
[10]  
Maqqor A, 2014, INT CONF MULTIMED, P475, DOI 10.1109/ICMCS.2014.6911316