On Optimization of Traditional Chinese Character Recognition

被引:0
作者
Huang, Yanbo [1 ]
Mondal, Subrota Kumar [1 ]
Cheng, Yuning [1 ]
Wang, Chengwei [1 ]
机构
[1] Macau Univ Sci & Technol, Sch Comp Sci & Engn, Taipa 999078, Macao, Peoples R China
来源
2024 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE SERVICES ENGINEERING, SSE 2024 | 2024年
关键词
Text Detection; Text Recognition; Traditional Chinese Character; OCR; Deep Learning; Traditional Chinese; TEXT;
D O I
10.1109/SSE62657.2024.00051
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Text recognition in textures is a pressing need for effective contextual perception in our daily life. In OCR scope, deep learning methods can meet the needs of real-time scenarios. However, characters with different textures, such as printed and handwritten characters leads OCR to deal with different scenarios. To this, in this paper, we focus on the implementation and optimization of text recognition using deep learning methods. In particular, we go with a two-stage OCR approach based on deep learning methods to detect and recognize Traditional Chinese characters including handwritten and printed while improving the accuracy. In particular, DBNet [1] is used in the text detection stage and CRNN [2] (baseline) and ABINet [3] (advanced) models in the text recognition stage. For the ABINet, we modify fusion module by using attention mechanism. For the CRNN, we utillize combined loss function. The experiment results show that The CRNN model with the combined loss function improves nearly 4.2% compared with the baseline CRNN. The ABINet model with new fusion module achieves 92.30% on the Traditional Chinese recognition dataset, improves nearly 0.8% compared to the original ABINet. Notably, we also have our own Traditional Chinese handwritten datasets for text detection and recognition. In fine, we believe that our endeavour can help grow the community in a better way.
引用
收藏
页码:293 / 302
页数:10
相关论文
共 32 条
  • [1] Character Region Awareness for Text Detection
    Baek, Youngmin
    Lee, Bado
    Han, Dongyoon
    Yun, Sangdoo
    Lee, Hwalsuk
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9357 - 9366
  • [2] Chen P.-C., 2020, Traditional chinese handwriting dataset
  • [3] Chen Y.-C., 2021, arXiv
  • [4] AON: Towards Arbitrarily-Oriented Text Recognition
    Cheng, Zhanzhan
    Xu, Yangliu
    Bai, Fan
    Niu, Yi
    Pu, Shiliang
    Zhou, Shuigeng
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5571 - 5579
  • [5] Focusing Attention: Towards Accurate Text Recognition in Natural Images
    Cheng, Zhanzhan
    Bai, Fan
    Xu, Yunlu
    Zheng, Gang
    Pu, Shiliang
    Zhou, Shuigeng
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5086 - 5094
  • [6] Epshtein B, 2010, PROC CVPR IEEE, P2963, DOI 10.1109/CVPR.2010.5540041
  • [7] Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
    Fang, Shancheng
    Xie, Hongtao
    Wang, Yuxin
    Mao, Zhendong
    Zhang, Yongdong
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7094 - 7103
  • [8] fantitxt.com, 2022, About Us
  • [9] Graves A., 2006, P 23 INT C MACH LEAR, P369, DOI DOI 10.1145/1143844.1143891
  • [10] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778