ProtoUDA: Prototype-Based Unsupervised Adaptation for Cross-Domain Text Recognition

被引:0
|
作者
Liu, Xiao-Qian [1 ]
Ding, Xue-Ying [1 ]
Luo, Xin [1 ]
Xu, Xin-Shun [1 ]
机构
[1] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
基金
中国国家自然科学基金;
关键词
Text recognition; Prototypes; Feature extraction; Task analysis; Visualization; Decoding; Adaptation models; Unsupervised learning; prototype; text recognition; contrastive learning; domain adaptation; MODEL; DIFFUSION;
D O I
10.1109/TKDE.2023.3344761
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text recognition reads from real scene text or handwritten text, facilitating many real-world applications such as driverless cars, visual Q&A, and image-based machine translation. Although impressive results have been achieved in single-domain text recognition, it still suffers from great challenges in cross-domain due to the domain gaps among the synthetic text, the real scene text, and the handwritten text. Existing standard unsupervised domain adaptation (UDA) methods struggle to solve the text recognition task since they view a domain or a text image (containing a character sequence) as a whole, ignoring the subunits that make up the sequence. In the paper, we present a Prototyped-based Unsupervised Domain Adaptation method for text recognition (ProtoUDA), where the class prototypes are computed from the source domain, target domain, and the mixed (source-target) domain, respectively. Technically, ProtoUDA initially extracts pseudo-labeled character features under word-level supervised information. Further, based on these character features, we propose two parallel and complementary modules to perform class-level and instance-level alignment, which explicitly transfer the knowledge learned in the source domain to the target domain. Among them, class-level alignment is to close the distance between the similar source prototypes and target prototypes. The instance-level alignment is based on contrastive learning, making the character instances of the mixed domain close to the corresponding class mixed prototype while staying away from other class mixed prototypes. To our knowledge, we are the first to adopt contrastive learning in UDA-based text recognition tasks. Extensive experiments on several benchmark datasets show the superiority of our method over state-of-the-art methods.
引用
收藏
页码:9096 / 9108
页数:13
相关论文
共 50 条
  • [21] DRANet: Disentangling Representation and Adaptation Networks for Unsupervised Cross-Domain Adaptation
    Lee, Seunghun
    Cho, Sunghyun
    Im, Sunghoon
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15247 - 15256
  • [22] Cross-domain Network Traffic Classification Using Unsupervised Domain Adaptation
    Li, Dongpu
    Yuan, Qifeng
    Li, Tan
    Chen, Shuangwu
    Yang, Jian
    2020 34TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2020), 2020, : 245 - +
  • [23] Cross-Domain Attention Network for Unsupervised Domain Adaptation Crowd Counting
    Zhang, Anran
    Xu, Jun
    Luo, Xiaoyan
    Cao, Xianbin
    Zhen, Xiantong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6686 - 6699
  • [24] Learning cross-domain representations by vision transformer for unsupervised domain adaptation
    Ye, Yifan
    Fu, Shuai
    Chen, Jing
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (15): : 10847 - 10860
  • [25] Adversarial Domain Adaptation With Prototype-Based Normalized Output Conditioner
    Hu, Dapeng
    Liang, Jian
    Hou, Qibin
    Yan, Hanshu
    Chen, Yunpeng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9359 - 9371
  • [26] A cross-domain fruit classification method based on lightweight attention networks and unsupervised domain adaptation
    Wang, Jin
    Zhang, Cheng
    Yan, Ting
    Yang, Jingru
    Lu, Xiaohui
    Lu, Guodong
    Huang, Bincheng
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 4227 - 4247
  • [27] A cross-domain fruit classification method based on lightweight attention networks and unsupervised domain adaptation
    Jin Wang
    Cheng Zhang
    Ting Yan
    Jingru Yang
    Xiaohui Lu
    Guodong Lu
    Bincheng Huang
    Complex & Intelligent Systems, 2023, 9 : 4227 - 4247
  • [28] Prototype-based Domain Description
    Angiulli, Fabrizio
    ECAI 2008, PROCEEDINGS, 2008, 178 : 107 - +
  • [29] Crowd Counting via Unsupervised Cross-Domain Feature Adaptation
    Ding, Guanchen
    Yang, Daiqin
    Wang, Tao
    Wang, Sihan
    Zhang, Yunfei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4665 - 4678
  • [30] Joint cross-domain classification and subspace learning for unsupervised adaptation
    Fernando, Basura
    Tommasi, Tatiana
    Tuytelaars, Tinne
    PATTERN RECOGNITION LETTERS, 2015, 65 : 60 - 66