Noisy-Aware Unsupervised Domain Adaptation for Scene Text Recognition

被引:0
|
作者
Liu, Xiao-Qian [1 ]
Zhang, Peng-Fei [2 ]
Luo, Xin [1 ]
Huang, Zi [2 ]
Xu, Xin-Shun [1 ]
机构
[1] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
[2] Univ Queensland, Sch Elect Engn & Comp Sci, Brisbane, Qld 4072, Australia
基金
中国国家自然科学基金;
关键词
Text recognition; domain adaptation; entropy; noisy-aware; consistency regularization; NETWORK;
D O I
10.1109/TIP.2024.3492705
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised Domain Adaptation (UDA) has shown promise in Scene Text Recognition (STR) by facilitating knowledge transfer from labeled synthetic text (source) to more challenging unlabeled real scene text (target). However, existing UDA-based STR methods fully rely on the pseudo-labels of target samples, which ignores the impact of domain gaps (inter-domain noise) and various natural environments (intra-domain noise), resulting in poor pseudo-label quality. In this paper, we propose a novel noisy-aware unsupervised domain adaptation framework tailored for STR, which aims to enhance model robustness against both inter- and intra-domain noise, thereby providing more precise pseudo-labels for target samples. Concretely, we propose a reweighting target pseudo-labels by estimating the entropy of refined probability distributions, which mitigates the impact of domain gaps on pseudo-labels. Additionally, a decoupled triple-P-N consistency matching module is proposed, which leverages data augmentation to increase data diversity, enhancing model robustness in diverse natural environments. Within this module, we design a low-confidence-based character negative learning, which is decoupled from high-confidence-based positive learning, thus improving sample utilization under scarce target samples. Furthermore, we extend our framework to the more challenging Source-Free UDA (SFUDA) setting, where only a pre-trained source model is available for adaptation, with no access to source data. Experimental results on benchmark datasets demonstrate the effectiveness of our framework. Under the SFUDA setting, our method exhibits faster convergence and superior performance with less training data than previous UDA-based STR methods. Our method surpasses representative STR methods, establishing new state-of-the-art results across multiple datasets.
引用
收藏
页码:6550 / 6563
页数:14
相关论文
共 50 条
  • [11] Unsupervised Domain Adaptation With Class-Aware Memory Alignment
    Wang, Hui
    Zheng, Liangli
    Zhao, Hanbin
    Li, Shijian
    Li, Xi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9930 - 9942
  • [12] Uncertainty-Aware Unsupervised Domain Adaptation in Object Detection
    Guan, Dayan
    Huang, Jiaxing
    Xiao, Aoran
    Lu, Shijian
    Cao, Yanpeng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2502 - 2514
  • [13] An Unsupervised Domain Adaptation Being Aware of Domain-specific and Label Information
    Zhang, Yuhong
    Zhang, Qi
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [14] Discrepancy-Aware Collaborative Representation for Unsupervised Domain Adaptation
    Han, Chao
    Zhou, Deyun
    Xie, Yu
    Lei, Yu
    Shi, Jiao
    Gong, Maoguo
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [15] UNSUPERVISED DOMAIN ADAPTATION FOR GENDER-AWARE PLDA MIXTURE MODELS
    Li, Longxin
    Mak, Man-Wai
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5269 - 5273
  • [16] Kurcuma: a kitchen utensil recognition collection for unsupervised domain adaptation
    Rosello, Adrian
    Valero-Mas, Jose J.
    Gallego, Antonio Javier
    Saez-Perez, Javier
    Calvo-Zaragoza, Jorge
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (04) : 1557 - 1569
  • [17] Kurcuma: a kitchen utensil recognition collection for unsupervised domain adaptation
    Adrian Rosello
    Jose J. Valero-Mas
    Antonio Javier Gallego
    Javier Sáez-Pérez
    Jorge Calvo-Zaragoza
    Pattern Analysis and Applications, 2023, 26 : 1557 - 1569
  • [18] Unsupervised domain adaptation for activity recognition across heterogeneous datasets
    Sanabria, Andrea Rosales
    Ye, Juan
    PERVASIVE AND MOBILE COMPUTING, 2020, 64
  • [19] An unsupervised deep domain adaptation approach for robust speech recognition
    Sun, Sining
    Zhang, Binbin
    Xie, Lei
    Zhang, Yanning
    NEUROCOMPUTING, 2017, 257 : 79 - 87
  • [20] Surface defect identification of cross scene strip based on unsupervised domain adaptation
    Liu K.
    Yang X.-S.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (03): : 477 - 485