Towards better long-tailed oracle character recognition with adversarial data augmentation

被引:22
作者
Li, Jing [1 ,4 ]
Wang, Qiu-Feng [1 ]
Huang, Kaizhu [2 ]
Yang, Xi [1 ]
Zhang, Rui [3 ]
Goulermas, John Y. [4 ]
机构
[1] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Suzhou, Peoples R China
[2] Duke Kunshan Univ, Data Sci Res Ctr, Suzhou, Peoples R China
[3] Xian Jiaotong Liverpool Univ, Sch Sci, Suzhou, Peoples R China
[4] Univ Liverpool, Dept Comp Sci, Liverpool, England
基金
中国国家自然科学基金;
关键词
Oracle character recognition; Long tail; Data imbalance; Data augmentation; Mixup strategy; Generative adversarial networks;
D O I
10.1016/j.patcog.2023.109534
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deciphering oracle bone script is of great significance to the study of ancient Chinese culture as well as archaeology. Although recent studies on oracle character recognition have made substantial progress, they still suffer from the long-tailed data situation that results in a noticeable performance drop on the tail classes. To mitigate this issue, we propose a generative adversarial framework to augment oracle characters in the problematic classes. In this framework, the generator produces synthetic data through convex combinations of all the available samples in the corresponding classes, and is further optimized through adversarial learning with the classifier and simultaneously the discriminator. Meanwhile, we in-troduce Repatch to generalize samples in the generator. Since tail classes do not have sufficient data for convex combinations, we propose the TailMix mechanism to generate suitable tail class samples from other classes. Experimental results show that our proposed algorithm obtains remarkable performance in oracle character recognition and achieves new state-of-the-art average (total) accuracy with 86.03% (89.46%), 86.54% (93.86%), 95.22% (96.17%) on the three datasets Oracle-AYNU, OBC306 and Oracle-20K, respectively.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
[41]   A Study of Data Augmentation for Handwritten Character Recognition Using Deep Learning [J].
Hayashi, Taihei ;
Gyohten, Keiji ;
Ohki, Hidehiro ;
Takami, Toshiya .
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, :552-557
[42]   Glyph-Based Data Augmentation for Accurate Kanji Character Recognition [J].
Ofusa, Kenichiro ;
Miyazaki, Tomo ;
Sugaya, Yoshihiro ;
Omachi, Shinichiro .
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, :597-602
[43]   Data Augmentation using Conditional Generative Adversarial Networks for Robust Speech Recognition [J].
Sheng, Peiyao ;
Yang, Zhuolin ;
Hu, Hu ;
Tan, Tian ;
Qian, Yanmin .
2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, :121-125
[44]   DMRS: Long-tailed remote sensing recognition via semantic-aware mixing and diversity experts [J].
Wang, Yifan ;
Zhang, Fan ;
Zhao, Qihao ;
Hu, Wei ;
Ma, Fei .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2025, 141
[45]   Bangla Handwritten Character Recognition using Convolutional Neural Network with Data Augmentation [J].
Chowdhury, Rumman Rashid ;
Hossain, Mohammad Shahadat ;
Ul Islam, Raihan ;
Andersson, Karl ;
Hossain, Sazzad .
2019 JOINT 8TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2019 3RD INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR) WITH INTERNATIONAL CONFERENCE ON ACTIVITY AND BEHAVIOR COMPUTING (ABC), 2019, :318-323
[46]   Improving Oracle Bone Characters Recognition via A CycleGAN-Based Data Augmentation Method [J].
Wang, Wei ;
Zhang, Ting ;
Zhao, Yiwen ;
Jin, Xinxin ;
Mouchere, Harold ;
Yu, Xinguo .
NEURAL INFORMATION PROCESSING, ICONIP 2022, PT VI, 2023, 1793 :88-100
[47]   Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition [J].
Eltay, Mohamed ;
Zidouri, Abdelmalek ;
Ahmad, Irfan ;
Elarian, Yousef .
PEERJ COMPUTER SCIENCE, 2022, 8
[48]   Data Augmentation for EEG-Based Emotion Recognition Using Generative Adversarial Networks [J].
Bao, Guangcheng ;
Yan, Bin ;
Tong, Li ;
Shu, Jun ;
Wang, Linyuan ;
Yang, Kai ;
Zeng, Ying .
FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2021, 15
[49]   Skeleton-Based Data Augmentation for Sign Language Recognition Using Adversarial Learning [J].
Nakamura, Yuriya ;
Jing, Lei .
IEEE ACCESS, 2025, 13 :15290-15300
[50]   ECMEE: Expert Constrained Multi-Expert Ensembles with Category Entropy Minimization for Long-tailed Visual Recognition [J].
Fu, Yu ;
Shang, Changjing ;
Han, Jungong ;
Shen, Qiang .
NEUROCOMPUTING, 2024, 576