Towards better long-tailed oracle character recognition with adversarial data augmentation

被引:16
|
作者
Li, Jing [1 ,4 ]
Wang, Qiu-Feng [1 ]
Huang, Kaizhu [2 ]
Yang, Xi [1 ]
Zhang, Rui [3 ]
Goulermas, John Y. [4 ]
机构
[1] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Suzhou, Peoples R China
[2] Duke Kunshan Univ, Data Sci Res Ctr, Suzhou, Peoples R China
[3] Xian Jiaotong Liverpool Univ, Sch Sci, Suzhou, Peoples R China
[4] Univ Liverpool, Dept Comp Sci, Liverpool, England
基金
中国国家自然科学基金;
关键词
Oracle character recognition; Long tail; Data imbalance; Data augmentation; Mixup strategy; Generative adversarial networks;
D O I
10.1016/j.patcog.2023.109534
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deciphering oracle bone script is of great significance to the study of ancient Chinese culture as well as archaeology. Although recent studies on oracle character recognition have made substantial progress, they still suffer from the long-tailed data situation that results in a noticeable performance drop on the tail classes. To mitigate this issue, we propose a generative adversarial framework to augment oracle characters in the problematic classes. In this framework, the generator produces synthetic data through convex combinations of all the available samples in the corresponding classes, and is further optimized through adversarial learning with the classifier and simultaneously the discriminator. Meanwhile, we in-troduce Repatch to generalize samples in the generator. Since tail classes do not have sufficient data for convex combinations, we propose the TailMix mechanism to generate suitable tail class samples from other classes. Experimental results show that our proposed algorithm obtains remarkable performance in oracle character recognition and achieves new state-of-the-art average (total) accuracy with 86.03% (89.46%), 86.54% (93.86%), 95.22% (96.17%) on the three datasets Oracle-AYNU, OBC306 and Oracle-20K, respectively.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] BSDA in Visual Recognition: Balanced Semantic Data Augmentation for Long-Tailed Data
    Wang, Yifan
    Huang, Eaven
    Wang, Runan
    Leng, Tuo
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [2] Attentive Feature Augmentation for Long-Tailed Visual Recognition
    Wang, Weiqiu
    Zhao, Zhicheng
    Wang, Pingyu
    Su, Fei
    Meng, Hongying
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 5803 - 5816
  • [3] Enhanced Long-Tailed Recognition With Contrastive CutMix Augmentation
    Pan, Haolin
    Guo, Yong
    Yu, Mianjie
    Chen, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 4215 - 4230
  • [4] Margin-aware rectified augmentation for long-tailed recognition
    Xiang, Liuyu
    Han, Jungong
    Ding, Guiguang
    PATTERN RECOGNITION, 2023, 141
  • [5] DATA AUGMENTATION FOR LONG-TAILED AND IMBALANCED POLYPHONE DISAMBIGUATION IN MANDARIN
    Zhang, Yang
    Zhang, Haitong
    Lin, Yue
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7137 - 7141
  • [6] Exploiting the Tail Data for Long-Tailed Face Recognition
    Song, Guo
    Liu, Rujie
    Wang, Mengjiao
    Meng, Zhang
    Nie, Shijie
    Lina, Septiana
    Abe, Narishige
    IEEE ACCESS, 2022, 10 : 97945 - 97953
  • [7] Mix-Up Augmentation for Oracle Character Recognition with Imbalanced Data Distribution
    Li, Jing
    Wang, Qiu-Feng
    Zhang, Rui
    Huang, Kaizhu
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT I, 2021, 12821 : 237 - 251
  • [8] Long-tailed recognition via key attribute learning
    Fu, Yu
    Han, Jungong
    Chang, Xiang
    Chen, Changrui
    Shang, Changjing
    Shen, Qiang
    NEUROCOMPUTING, 2025, 627
  • [9] Inverse Image Frequency for Long-Tailed Image Recognition
    Alexandridis, Konstantinos Panagiotis
    Luo, Shan
    Nguyen, Anh
    Deng, Jiankang
    Zafeiriou, Stefanos
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5721 - 5736
  • [10] Exploring the auxiliary learning for long-tailed visual recognition
    Zhang, Junjie
    Liu, Lingqiao
    Wang, Peng
    Zhang, Jian
    NEUROCOMPUTING, 2021, 449 : 303 - 314