Chinese Character Recognition Using Non-negative Matrix Factorization

被引:0
作者
Voon, Chen Huey [1 ]
Shin, Ker [1 ]
Shean, Ng Wei [1 ]
机构
[1] Univ Tunku Abdul Rahman, Lee Kong Chian Fac Engn & Sci, Dept Math & Actuarial Sci, Jalan Sungai Long, Kajang 43000, Selangor, Malaysia
来源
JURNAL KEJURUTERAAN | 2024年 / 36卷 / 02期
关键词
Chinese characters recognitions; matrix factorizations; non-negative matrix factorization;
D O I
10.17576/jkukm-2024-36(2)-24
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Non -negative matrix factorization (NMF) was introduced by Paatero and Tapper in 1994 and it was a general way of reducing the dimension of the matrix with non -negative entries. Non -negative matrix factorization is very useful in many data analysis applications such as character recognition, text mining, and others. This paper aims to study the application in Chinese character recognition using non -negative matrix factorization. Python was used to carry out the LU factorization and non -negative matrix factorization of a Chinese character in Boolean Matrix. Preliminary analysis confirmed that the data size of and and are chosen for the NMF of the Boolean matrix. In this project, one hundred printed Chinese characters were selected, and all the Chinese characters can be categorized into ten categories according to the number of strokes , for . The Euclidean distance between the Boolean matrix of a Chinese character and the matrix after both LU factorization and NMF is calculated for further analysis. Paired t -test confirmed that the factorization of Chinese characters in the Boolean matrix using NMF is better than the LU factorization. Finally, ten handwritten Chinese characters were selected to test whether the program is able to identify the handwritten and the printed Chinese characters. Experimental results showed that 70% of the characters can be recognized via the least Euclidean distance obtained. NMF is suitable to be applied in Chinese character recognition since it can reduce the dimension of the image and the error between the original Boolean matrix and after NMF is less than 5%.
引用
收藏
页码:653 / 660
页数:8
相关论文
共 21 条
[1]   Pavement Surface Distress Detection Using Digital Image Processing Techniques [J].
Alayat, Abdulsalam Basher ;
Omar, Hend Ali .
JURNAL KEJURUTERAAN, 2023, 35 (01) :247-256
[2]  
Allahyari M, 2017, Arxiv, DOI arXiv:1707.02919
[3]  
Bernard K., 2001, Introductory Linear Algebra with Applications
[4]   A survey on deep matrix factorizations [J].
De Handschutter, Pierre ;
Gillis, Nicolas ;
Siebert, Xavier .
COMPUTER SCIENCE REVIEW, 2021, 42
[5]   Non-negative Matrix Factorization: A Survey [J].
Gan, Jiangzhang ;
Liu, Tong ;
Li, Li ;
Zhang, Jilian .
COMPUTER JOURNAL, 2021, 64 (07) :1080-1092
[6]  
Golub G.H., 2022, Matrix Computations, V4
[7]  
Hogben L., 2013, Handbook of linear algebra
[8]  
Langville AmyN., 2014, ALGORITHMS INITIALIZ
[9]   Learning the parts of objects by non-negative matrix factorization [J].
Lee, DD ;
Seung, HS .
NATURE, 1999, 401 (6755) :788-791
[10]  
Lee DD, 2001, ADV NEUR IN, V13, P556