Calligraphy Character Detection Based on Deep Convolutional Neural Network

被引:2
作者
Peng, Xianlin [1 ]
Kang, Jian [2 ]
Wu, Yinjie [2 ]
Feng, Xiaoyi [3 ]
机构
[1] Northwest Univ, Art Sch, Xian 710127, Peoples R China
[2] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710129, Peoples R China
[3] Northwestern Polytech Univ Shenzhen, Res & Dev Inst, Shenzhen 518057, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 19期
基金
中国国家自然科学基金;
关键词
calligraphy detection; character boundary; HRNet; channel attention; RECOGNITION; ONLINE;
D O I
10.3390/app12199488
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Calligraphy (the special art of drawing characters with a brush specially made by the Chinese) is an integral part of Chinese culture, and detecting Chinese calligraphy characters is highly significant. At present, there are still some challenges in the detection of ancient calligraphy. In this paper, we are interested in the calligraphy character detection problem focusing on the calligraphy character boundary. We chose High-Resolution Net (HRNet) as the calligraphy character feature extraction backbone network to learn reliable high-resolution representations. Then, we used the scale prediction branch and the spatial information prediction branch to detect the calligraphy character region and categorize the calligraphy character and its boundaries. We used the channel attention mechanism and the feature fusion method to improve the detection effectiveness in this process. Finally, we pre-trained with a self-generated calligraphy database and fine-tuned with a real calligraphy database. We set up two groups of ablation studies for comparison, and the comparison results proved the superiority of our method. This paper found that the classification of characters and character boundaries has a certain auxiliary effect on single character detection.
引用
收藏
页数:15
相关论文
共 29 条
[1]  
Ceniza A. M., 2018, J IMAGE GRAPHICS, V6, P44, DOI [10.18178/joig.6.1.44-47, DOI 10.18178/JOIG.6.1.44-47]
[2]  
Chen Y.S., 2016, MACHINE LEARNING CAL
[3]  
Cheriet M, 2007, CHARACTER RECOGNITION SYSTEMS: A GUIDE FOR STUDENTS AND PRACTIONERS, P1, DOI 10.1002/9780470176535
[4]  
Guan LB, 2017, 2017 2ND INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2017), P26, DOI 10.1109/ICIVC.2017.7984452
[5]   Deep Direct Regression for Multi-Oriented Scene Text Detection [J].
He, Wenhao ;
Zhang, Xu-Yao ;
Yin, Fei ;
Liu, Cheng-Lin .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :745-753
[6]   Reducing the dimensionality of data with neural networks [J].
Hinton, G. E. ;
Salakhutdinov, R. R. .
SCIENCE, 2006, 313 (5786) :504-507
[7]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
[8]   Reading Text in the Wild with Convolutional Neural Networks [J].
Jaderberg, Max ;
Simonyan, Karen ;
Vedaldi, Andrea ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 116 (01) :1-20
[9]   A Survey of Deep Learning-Based Object Detection [J].
Jiao, Licheng ;
Zhang, Fan ;
Liu, Fang ;
Yang, Shuyuan ;
Li, Lingling ;
Feng, Zhixi ;
Qu, Rong .
IEEE ACCESS, 2019, 7 :128837-128868
[10]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324