Handwritten Character Generation using Y-Autoencoder for Character Recognition Model Training

被引:0
作者
Kitagawa, Tomoki [1 ]
Leow, Chee Siang [1 ,2 ]
Nishizaki, Hiromitsu [1 ]
机构
[1] Univ Yamanashi, Engn & Agr Sci, Integrated Grad Sch Med, 4-3-11 Takeda, Kofu, Yamanashi 4008511, Japan
[2] Artibrains LLC, 3-8-6 Joto, Kofu, Yamanashi 4000861, Japan
来源
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2022年
关键词
handwritten characters generation; optical character recognition (OCR); Y-Autoencoder;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
It is well-known that the deep learning-based optical character recognition (OCR) system needs a large amount of data to train a high-performance character recognizer. However, it is costly to collect a large amount of realistic handwritten characters. This paper introduces a Y-Autoencoder (Y-AE)-based handwritten character generator to generate multiple Japanese Hiragana characters with a single image to increase the amount of data for training a handwritten character recognizer. The adaptive instance normalization (AdaIN) layer allows the generator to be trained and generate handwritten character images without paired-character image labels. The experiment showed that the Y-AE could generate Japanese character images then used to train the handwritten character recognizer, producing an F1-score improved from 0.8664 to 0.9281. We further analyzed the usefulness of the Y-AE-based generator with shape images and out-of-character (OOC) images, which have different character image styles in model training. The result showed that the generator could generate a handwritten image with a similar style to that of the input character.
引用
收藏
页码:7344 / 7351
页数:8
相关论文
共 31 条
[1]  
Aksan E., 2018, P C HUM FACT COMP SY
[2]  
[Anonymous], 2008, Advances in neural information processing systems
[3]  
Bo Chang Q. Z., 2018, P IEEE 2018 WINT C A
[4]   Randaugment: Practical automated data augmentation with a reduced search space [J].
Cubuk, Ekin D. ;
Zoph, Barret ;
Shlens, Jonathon ;
Le, Quoc, V .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :3008-3017
[5]  
Electrotechnical Laboratory, 2011, ETL CHAR DAT
[6]   ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation [J].
Fogel, Sharon ;
Averbuch-Elor, Hadar ;
Cohen, Sarel ;
Mazor, Shai ;
Litman, Roee .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4323-4332
[7]  
Gatys L.A., 2015, NEURIPS, DOI DOI 10.1167/16.12.326
[8]  
Glorot X., 2010, Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, P249, DOI DOI 10.1109/LGRS.2016.2565705
[9]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[10]  
Graves A, 2006, P INT C MACH LEARN, P369, DOI DOI 10.1145/1143844.1143891