ProtoUDA: Prototype-Based Unsupervised Adaptation for Cross-Domain Text Recognition

被引:0
|
作者
Liu, Xiao-Qian [1 ]
Ding, Xue-Ying [1 ]
Luo, Xin [1 ]
Xu, Xin-Shun [1 ]
机构
[1] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
基金
中国国家自然科学基金;
关键词
Text recognition; Prototypes; Feature extraction; Task analysis; Visualization; Decoding; Adaptation models; Unsupervised learning; prototype; text recognition; contrastive learning; domain adaptation; MODEL; DIFFUSION;
D O I
10.1109/TKDE.2023.3344761
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text recognition reads from real scene text or handwritten text, facilitating many real-world applications such as driverless cars, visual Q&A, and image-based machine translation. Although impressive results have been achieved in single-domain text recognition, it still suffers from great challenges in cross-domain due to the domain gaps among the synthetic text, the real scene text, and the handwritten text. Existing standard unsupervised domain adaptation (UDA) methods struggle to solve the text recognition task since they view a domain or a text image (containing a character sequence) as a whole, ignoring the subunits that make up the sequence. In the paper, we present a Prototyped-based Unsupervised Domain Adaptation method for text recognition (ProtoUDA), where the class prototypes are computed from the source domain, target domain, and the mixed (source-target) domain, respectively. Technically, ProtoUDA initially extracts pseudo-labeled character features under word-level supervised information. Further, based on these character features, we propose two parallel and complementary modules to perform class-level and instance-level alignment, which explicitly transfer the knowledge learned in the source domain to the target domain. Among them, class-level alignment is to close the distance between the similar source prototypes and target prototypes. The instance-level alignment is based on contrastive learning, making the character instances of the mixed domain close to the corresponding class mixed prototype while staying away from other class mixed prototypes. To our knowledge, we are the first to adopt contrastive learning in UDA-based text recognition tasks. Extensive experiments on several benchmark datasets show the superiority of our method over state-of-the-art methods.
引用
收藏
页码:9096 / 9108
页数:13
相关论文
共 50 条
  • [31] Cross-Domain Similarity in Domain Adaptation for Human Activity Recognition
    Kasim, Samra
    Sheppard, John W.
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [32] Mutual-Prototype Adaptation for Cross-Domain Polyp Segmentation
    Yang, Chen
    Guo, Xiaoqing
    Zhu, Meilu
    Ibragimov, Bulat
    Yuan, Yixuan
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (10) : 3886 - 3897
  • [33] Unsupervised cross-domain speaker recognition based on distribution alignment and adversarial learning
    Chen, Zhigao
    Zhao, Qingwei
    Wang, Li
    Wang, Wenchao
    Shengxue Xuebao/Acta Acustica, 2021, 46 (05): : 767 - 774
  • [34] Hypergraph and cross-attention-based unsupervised domain adaptation framework for cross-domain myocardial infarction localization
    Yuan, Shuaiying
    He, Ziyang
    Zhao, Jianhui
    Yuan, Zhiyong
    Alhudhaif, Adi
    Alenezi, Fayadh
    INFORMATION SCIENCES, 2023, 633 : 245 - 263
  • [35] Cross-Domain Correlation Distillation for Unsupervised Domain Adaptation in Nighttime Semantic Segmentation
    Gao, Huan
    Guo, Jichang
    Wang, Guoli
    Zhang, Qian
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9903 - 9913
  • [36] Unsupervised Adversarial Domain Adaptation for Cross-Domain Face Presentation Attack Detection
    Wang, Guoqing
    Han, Hu
    Shan, Shiguang
    Chen, Xilin
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 56 - 69
  • [37] Unsupervised domain adaptation by cross-domain consistency learning for CT body composition
    Ali, Shahzad
    Lee, Yu Rim
    Park, Soo Young
    Tak, Won Young
    Jung, Soon Ki
    MACHINE VISION AND APPLICATIONS, 2025, 36 (01)
  • [38] Unsupervised Domain Adaptation via Class Aggregation for Text Recognition
    Liu, Xiao-Qian
    Ding, Xue-Ying
    Luo, Xin
    Xu, Xin-Shun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 5617 - 5630
  • [39] Balanced Adaptation Regularization Based Transfer Learning for Unsupervised Cross-Domain Fault Diagnosis
    Hu, Qin
    Si, Xiaosheng
    Qin, Aisong
    Lv, Yunrong
    Liu, Mei
    IEEE SENSORS JOURNAL, 2022, 22 (12) : 12139 - 12151
  • [40] Global-local prototype-based few-shot learning for cross-domain hyperspectral image classification
    Tang, Haojin
    Wu, Yuelin
    Li, Hongyi
    Tang, Dong
    Yang, Xiaofei
    Xie, Weixin
    KNOWLEDGE-BASED SYSTEMS, 2025, 314