An End-to-End Generation Model for Chinese Calligraphy Characters Based on Dense Blocks and Capsule Network

被引:0
作者
Zhang, Weiqi [1 ]
Sun, Zengguo [1 ,2 ]
Wu, Xiaojun [1 ,2 ]
机构
[1] Shaanxi Normal Univ, Sch Comp Sci, Xian 710119, Peoples R China
[2] Minist Culture & Tourism, Key Lab Intelligent Comp & Serv Technol Folk Song, Xian 710119, Peoples R China
基金
中国国家自然科学基金;
关键词
calligraphy generation; generative adversarial network; capsule network; self-attention; perceptual loss; IMAGE TRANSLATION;
D O I
10.3390/electronics13152983
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Chinese calligraphy is a significant aspect of traditional culture, as it involves the art of writing Chinese characters. Despite the development of numerous deep learning models for generating calligraphy characters, the resulting outputs often suffer from issues related to stroke accuracy and stylistic consistency. To address these problems, an end-to-end generation model for Chinese calligraphy characters based on dense blocks and a capsule network is proposed. This model aims to solve issues such as redundant and broken strokes, twisted and deformed strokes, and dissimilarity with authentic ones. The generator of the model employs self-attention mechanisms and densely connected blocks to reduce redundant and broken strokes. The discriminator, on the other hand, consists of a capsule network and a fully connected network to reduce twisted and deformed strokes. Additionally, the loss function includes perceptual loss to enhance the similarity between the generated calligraphy characters and the authentic ones. To demonstrate the validity of the proposed model, we conducted comparison and ablation experiments on the datasets of Yan Zhenqing's regular script, Deng Shiru's clerical script, and Wang Xizhi's running script. The experimental results show that, compared to the comparison model, the proposed model improves SSIM by 0.07 on average, reduces MSE by 1.95 on average, and improves PSNR by 0.92 on average, which proves the effectiveness of the proposed model.
引用
收藏
页数:21
相关论文
共 47 条
  • [1] TPE-GAN: Thumbnail Preserving Encryption Based on GAN With Key
    Chai, Xiuli
    Wang, Yinjing
    Chen, Xiuhui
    Gan, Zhihua
    Zhang, Yushu
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 972 - 976
  • [2] Chen Kai, 2023, ICIGP '23: Proceedings of the 2023 6th International Conference on Image and Graphics Processing, P104, DOI 10.1145/3582649.3582682
  • [3] DCAMCP: A deep learning model based on capsule network and attention mechanism for molecular carcinogenicity prediction
    Chen, Zhe
    Zhang, Li
    Sun, Jianqiang
    Meng, Rui
    Yin, Shuaidong
    Zhao, Qi
    [J]. JOURNAL OF CELLULAR AND MOLECULAR MEDICINE, 2023, 27 (20) : 3117 - 3126
  • [4] Choi Y, 2020, PROC CVPR IEEE, P8185, DOI 10.1109/CVPR42600.2020.00821
  • [5] Gao YM, 2020, AAAI CONF ARTIF INTE, V34, P646
  • [6] DenseNet-II: an improved deep convolutional neural network for melanoma cancer detection
    Girdhar, Nancy
    Sinha, Aparna
    Gupta, Shivang
    [J]. SOFT COMPUTING, 2023, 27 (18) : 13285 - 13304
  • [7] CycleGAN With an Improved Loss Function for Cell Detection Using Partly Labeled Images
    He, Jin
    Wang, Cong
    Jiang, Dan
    Li, Zhuo
    Liu, Yangyi
    Zhang, Tao
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (09) : 2473 - 2480
  • [8] He L., 2015, Orchid Pavilion Preface and Its Cultural Significance in Calligraphy
  • [9] Hu C, 2010, Academics, V7, P164
  • [10] Huang Yuge, 2020, ECCV