Optical Character Recognition with Chinese and Korean Character Decomposition

被引:2
作者
Chang, Chun Chieh [1 ,2 ]
Arora, Ashish [1 ,2 ]
Perera, Leibny Paola Garcia [1 ]
Etter, David [2 ]
Povey, Daniel [1 ,2 ]
Khudanpur, Sanjeev [1 ,2 ]
机构
[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
[2] Johns Hopkins Univ, Human Language Technol Ctr Excellence, Baltimore, MD 21218 USA
来源
2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW), VOL 5 | 2019年
关键词
Optical Character Recognition; Handwriting Recognition; Character Decomposition; Chinese OCR; Korean OCR;
D O I
10.1109/ICDARW.2019.40094
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present our work on Optical Character Recognition on Chinese and Korean Characters for line level transcriptions. One challenge for recognizing Chinese and Korean is that there are thousands of characters for a system to recognize. In addition, many uncommon characters only appear a couple of times in training. We use character decomposition methods to break characters into smaller constituent graphemes. CangJie is used for Chinese character decomposition and Korean Jamo is used for Korean character decomposition. Character decomposition reduces the size of the Neural Network models and allows training examples to be shared across uncommon characters with the same graphemes. We report that a CNN-TDNN neural network model using character decomposition has significantly fewer parameters than the baseline while also improving character error rate.
引用
收藏
页码:134 / 139
页数:6
相关论文
共 30 条
[1]  
[Anonymous], INTERSPEECH 2015
[2]  
[Anonymous], 2008, Springer Handbook of Speech Processing, DOI DOI 10.1007/978-3-540-49127-9_28
[3]  
[Anonymous], 2018, Manuscript in preparation
[4]  
[Anonymous], INTERSPEECH 2018
[5]  
[Anonymous], 2011, WORKSH AUT SPEECH RE
[6]  
Arora A., 2019, ICDAR 2019 UNPUB
[7]  
Bluche Theodore, 2016, 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), P530, DOI 10.1109/ICFHR.2016.0103
[8]  
Eisele Andreas, 2010, P 7 INT C LANG RES E
[9]  
Etter D., ICDAR 2019 UNPUB
[10]  
Jin G., PH CORPUS