Adaptive Teaching for Cross-Domain Crowd Counting

被引:2
|
作者
Gong, Shenjian [1 ,2 ,3 ]
Yang, Jian [1 ,2 ,3 ]
Zhang, Shanshan [1 ,2 ,3 ]
机构
[1] Nanjing Univ Sci & Technol, PCA Lab, Nanjing 210094, Peoples R China
[2] Nanjing Univ Sci & Technol, Key Lab Intelligent Percept & Syst High Dimens Inf, Minist Educ, Nanjing 210094, Peoples R China
[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Lab Image & Video Understanding Social, Nanjing 210094, Peoples R China
基金
中国国家自然科学基金;
关键词
Crowd counting; domain adaptation; mean teacher;
D O I
10.1109/TMM.2023.3305815
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The main challenge of Unsupervised Domain Adaptation (UDA) crowd counting is the large domain gap between a synthetic domain with annotations (source) and a real-world domain of interest without annotations (target). Previous mainstream UDA crowd counting methods either employ feature alignment or a semi-supervised learning paradigm via pseudo-labels. We for the first time combine both of their advantages and propose an Adversarial Mean Teacher (AMT) framework. On the one hand, we optimize the student model with domain adversarial learning. On the other hand, we feed perturbed target images to the teacher model to generate pseudo-labels. Furthermore, to improve the quality of the pseudo-labels, we propose an Adaptive Teaching (AT) module, consisting of pseudo-label refinement and credible pseudo-label selection. Concretely, we first generate two candidate pseudo-labels from the prediction of the teacher model and obtain a refined pseudo-label by mixing them at the pixel-level. Moreover, we introduce an auxiliary task of foreground-background classification to assist credible region selection and only activate supervision signals on those regions. Extensive experiments on four real-world crowd counting benchmarks demonstrate the effectiveness of our method namely Cross-Domain Adaptive Teacher (CDAT).
引用
收藏
页码:2943 / 2952
页数:10
相关论文
共 50 条
  • [1] Cross-Domain Attention Network for Unsupervised Domain Adaptation Crowd Counting
    Zhang, Anran
    Xu, Jun
    Luo, Xiaoyan
    Cao, Xianbin
    Zhen, Xiantong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6686 - 6699
  • [2] Crowd Counting via Unsupervised Cross-Domain Feature Adaptation
    Ding, Guanchen
    Yang, Daiqin
    Wang, Tao
    Wang, Sihan
    Zhang, Yunfei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4665 - 4678
  • [3] DATASET-LEVEL DIRECTED IMAGE TRANSLATION FOR CROSS-DOMAIN CROWD COUNTING
    Tan, Xin
    Ishikawa, Hiroshi
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 400 - 404
  • [4] Head-Aware Density Adaptation Networks for Cross-Domain Crowd Counting
    Cai Y.
    Ma Z.
    Wang T.
    Lyu C.
    Wang C.
    He G.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (10): : 1514 - 1523
  • [5] Joint perturbation consistency across image and feature levels for cross-domain adaptive crowd counting
    Xie, Chengjie
    Lu, Shuhua
    Shi, Yangyu
    Zheng, Diwen
    VISUAL COMPUTER, 2025,
  • [6] Fine-Grained Fragment Diffusion for Cross-Domain Crowd Counting
    Zhu, Huilin
    Yuan, Jingling
    Yang, Zhengwei
    Zhong, Xian
    Wang, Zheng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5659 - 5668
  • [7] Dynamic Momentum Adaptation for Zero-Shot Cross-Domain Crowd Counting
    Wu, Qiangqiang
    Wan, Jia
    Chan, Antoni B.
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 658 - 666
  • [8] Striking a Balance: Unsupervised Cross-Domain Crowd Counting via Knowledge Diffusion
    Xie, Haiyang
    Yang, Zhengwei
    Zhu, Huilin
    Wang, Zheng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6520 - 6529
  • [9] FOCUS ON SEMANTIC CONSISTENCY FOR CROSS-DOMAIN CROWD UNDERSTANDING
    Han, Tao
    Gao, Junyu
    Yuan, Yuan
    Wang, Qi
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1848 - 1852
  • [10] Domain Adaptation in Crowd Counting
    Hossain, Mohammad Asiful
    Reddy, Mahesh Kumar Krishna
    Cannons, Kevin
    Xu, Zhan
    Wang, Yang
    2020 17TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2020), 2020, : 150 - 157