Noisy-Aware Unsupervised Domain Adaptation for Scene Text Recognition

被引:0
作者
Liu, Xiao-Qian [1 ]
Zhang, Peng-Fei [2 ]
Luo, Xin [1 ]
Huang, Zi [2 ]
Xu, Xin-Shun [1 ]
机构
[1] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
[2] Univ Queensland, Sch Elect Engn & Comp Sci, Brisbane, Qld 4072, Australia
基金
中国国家自然科学基金;
关键词
Text recognition; domain adaptation; entropy; noisy-aware; consistency regularization; NETWORK;
D O I
10.1109/TIP.2024.3492705
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised Domain Adaptation (UDA) has shown promise in Scene Text Recognition (STR) by facilitating knowledge transfer from labeled synthetic text (source) to more challenging unlabeled real scene text (target). However, existing UDA-based STR methods fully rely on the pseudo-labels of target samples, which ignores the impact of domain gaps (inter-domain noise) and various natural environments (intra-domain noise), resulting in poor pseudo-label quality. In this paper, we propose a novel noisy-aware unsupervised domain adaptation framework tailored for STR, which aims to enhance model robustness against both inter- and intra-domain noise, thereby providing more precise pseudo-labels for target samples. Concretely, we propose a reweighting target pseudo-labels by estimating the entropy of refined probability distributions, which mitigates the impact of domain gaps on pseudo-labels. Additionally, a decoupled triple-P-N consistency matching module is proposed, which leverages data augmentation to increase data diversity, enhancing model robustness in diverse natural environments. Within this module, we design a low-confidence-based character negative learning, which is decoupled from high-confidence-based positive learning, thus improving sample utilization under scarce target samples. Furthermore, we extend our framework to the more challenging Source-Free UDA (SFUDA) setting, where only a pre-trained source model is available for adaptation, with no access to source data. Experimental results on benchmark datasets demonstrate the effectiveness of our framework. Under the SFUDA setting, our method exhibits faster convergence and superior performance with less training data than previous UDA-based STR methods. Our method surpasses representative STR methods, establishing new state-of-the-art results across multiple datasets.
引用
收藏
页码:6550 / 6563
页数:14
相关论文
共 50 条
  • [21] Adversarial unsupervised domain adaptation for cross scenario waveform recognition
    Wang, Qing
    Du, Panfei
    Liu, Xiaofeng
    Yang, Jingyu
    Wang, Guohua
    SIGNAL PROCESSING, 2020, 171 (171)
  • [22] Surface defect identification of cross scene strip based on unsupervised domain adaptation
    Liu K.
    Yang X.-S.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (03): : 477 - 485
  • [23] Domain Adaptation Transformer for Unsupervised Driving-Scene Segmentation in Adverse Conditions
    Liu, Wenyu
    Wang, Song
    Zhu, Jianke
    Xie, Xuansong
    Zhang, Lei
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, : 21129 - 21141
  • [24] Unsupervised Domain Adaptation for Cross-Scene Multispectral Point Cloud Classification
    Wang, Qingwang
    Wang, Mingye
    Huang, Jiangbo
    Liu, Tianzhu
    Shen, Tao
    Gu, Yanfeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [25] Unsupervised Cross-Media Retrieval Using Domain Adaptation With Scene Graph
    Peng, Yuxin
    Chi, Jingze
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 4368 - 4379
  • [26] Autoencoder-based Unsupervised Domain Adaptation for Speech Emotion Recognition
    Deng, Jun
    Zhang, Zixing
    Eyben, Florian
    Schuller, Bjoern
    IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (09) : 1068 - 1072
  • [27] Learning Kernels for Unsupervised Domain Adaptation with Applications to Visual Object Recognition
    Gong, Boqing
    Grauman, Kristen
    Sha, Fei
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 109 (1-2) : 3 - 27
  • [28] Unsupervised Domain Adaptation with Generative Adversarial Networks for Facial Emotion Recognition
    Fan, Yingruo
    Lam, Jacqueline C. K.
    Li, Victor O. K.
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 4460 - 4464
  • [29] Learning Kernels for Unsupervised Domain Adaptation with Applications to Visual Object Recognition
    Boqing Gong
    Kristen Grauman
    Fei Sha
    International Journal of Computer Vision, 2014, 109 : 3 - 27
  • [30] An effective unsupervised domain adaptation for in-field potato disease recognition
    Gao, Xueze
    Feng, Quan
    Wang, Shuzhi
    Zhang, Jianhua
    Yang, Sen
    BIOSYSTEMS ENGINEERING, 2024, 247 : 267 - 282