Chinese Character Image Completion Using a Generative Latent Variable Model

被引:4
作者
Jo, In-su [1 ]
Choi, Dong-bin [1 ]
Park, Young B. [2 ]
机构
[1] Dankook Univ, Dept Comp, Yongin 16890, Gyeonggi Do, South Korea
[2] Dankook Univ, Dept Software Sci, Yongin 16890, Gyeonggi Do, South Korea
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 02期
关键词
variational autoencoder; class activation map; object removal; image; INFORMATION;
D O I
10.3390/app11020624
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Chinese characters in ancient books have many corrupted characters, and there are cases in which objects are mixed in the process of extracting the characters into images. To use this incomplete image as accurate data, we use image completion technology, which removes unnecessary objects and restores corrupted images. In this paper, we propose a variational autoencoder with classification (VAE-C) model. This model is characterized by using classification areas and a class activation map (CAM). Through the classification area, the data distribution is disentangled, and then the node to be adjusted is tracked using CAM. Through the latent variable, with which the determined node value is reduced, an image from which unnecessary objects have been removed is created. The VAE-C model can be utilized not only to eliminate unnecessary objects but also to restore corrupted images. By comparing the performance of removing unnecessary objects with mask regions with convolutional neural networks (Mask R-CNN), one of the prevalent object detection technologies, and also comparing the image restoration performance with the partial convolution model (PConv) and the gated convolution model (GConv), which are image inpainting technologies, our model is proven to perform excellently in terms of removing objects and restoring corrupted areas.
引用
收藏
页码:1 / 19
页数:19
相关论文
共 41 条
[1]  
Behnke S, 2003, LECT NOTES COMPUT SC, V2766, P1
[2]  
Bernt S., 2018, ARXIV180601911
[3]  
Bishop C. M., 1994, Technical Report No. NCRG/94/004.
[4]  
Cha E.Y., 2015, J KOREA I INF COMMUN, V19, P2924, DOI [10.6109/jkiice.2015.19.12.2924, DOI 10.6109/JKIICE.2015.19.12.2924]
[5]   The devil is in the details: an evaluation of recent feature encoding methods [J].
Chatfield, Ken ;
Lempitsky, Victor ;
Vedaldi, Andrea ;
Zisserman, Andrew .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
[6]  
Chen X, 2016, ADV NEURAL INFORM PR
[7]   Image quality measures and their performance [J].
Eskicioglu, AM ;
Fisher, PS .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1995, 43 (12) :2959-2965
[8]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[9]   Monocular SLAM and Obstacle Removal for Indoor Navigation [J].
Han, Shibo ;
Ahmed, Minhaz Uddin ;
Rhee, Phill Kyu .
2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND DATA ENGINEERING (ICMLDE 2018), 2018, :67-76
[10]  
He K., 2017, ARXIV170306870