Adaptive Teaching for Cross-Domain Crowd Counting

被引：2

作者：

Gong, Shenjian ^{[1
,2
,3
]}

Yang, Jian ^{[1
,2
,3
]}

Zhang, Shanshan ^{[1
,2
,3
]}

机构：

[1] Nanjing Univ Sci & Technol, PCA Lab, Nanjing 210094, Peoples R China

[2] Nanjing Univ Sci & Technol, Key Lab Intelligent Percept & Syst High Dimens Inf, Minist Educ, Nanjing 210094, Peoples R China

[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Lab Image & Video Understanding Social, Nanjing 210094, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

基金：

中国国家自然科学基金;

关键词：

Crowd counting; domain adaptation; mean teacher;

D O I：

10.1109/TMM.2023.3305815

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The main challenge of Unsupervised Domain Adaptation (UDA) crowd counting is the large domain gap between a synthetic domain with annotations (source) and a real-world domain of interest without annotations (target). Previous mainstream UDA crowd counting methods either employ feature alignment or a semi-supervised learning paradigm via pseudo-labels. We for the first time combine both of their advantages and propose an Adversarial Mean Teacher (AMT) framework. On the one hand, we optimize the student model with domain adversarial learning. On the other hand, we feed perturbed target images to the teacher model to generate pseudo-labels. Furthermore, to improve the quality of the pseudo-labels, we propose an Adaptive Teaching (AT) module, consisting of pseudo-label refinement and credible pseudo-label selection. Concretely, we first generate two candidate pseudo-labels from the prediction of the teacher model and obtain a refined pseudo-label by mixing them at the pixel-level. Moreover, we introduce an auxiliary task of foreground-background classification to assist credible region selection and only activate supervision signals on those regions. Extensive experiments on four real-world crowd counting benchmarks demonstrate the effectiveness of our method namely Cross-Domain Adaptive Teacher (CDAT).

引用

页码：2943 / 2952

页数：10

共 50 条

[1] Cross-Domain Attention Network for Unsupervised Domain Adaptation Crowd Counting
Zhang, Anran
Xu, Jun
Luo, Xiaoyan
Cao, Xianbin
Zhen, Xiantong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6686 - 6699
[2] Crowd Counting via Unsupervised Cross-Domain Feature Adaptation
Ding, Guanchen
Yang, Daiqin
Wang, Tao
Wang, Sihan
Zhang, Yunfei
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4665 - 4678
[3] DATASET-LEVEL DIRECTED IMAGE TRANSLATION FOR CROSS-DOMAIN CROWD COUNTING
Tan, Xin
Ishikawa, Hiroshi
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 400 - 404
[4] Head-Aware Density Adaptation Networks for Cross-Domain Crowd Counting
Cai Y.
Ma Z.
Wang T.
Lyu C.
Wang C.
He G.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (10): : 1514 - 1523
[5] Joint perturbation consistency across image and feature levels for cross-domain adaptive crowd counting
Xie, Chengjie
Lu, Shuhua
Shi, Yangyu
Zheng, Diwen
VISUAL COMPUTER, 2025,
[6] Fine-Grained Fragment Diffusion for Cross-Domain Crowd Counting
Zhu, Huilin
Yuan, Jingling
Yang, Zhengwei
Zhong, Xian
Wang, Zheng
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5659 - 5668
[7] Dynamic Momentum Adaptation for Zero-Shot Cross-Domain Crowd Counting
Wu, Qiangqiang
Wan, Jia
Chan, Antoni B.
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 658 - 666
[8] Striking a Balance: Unsupervised Cross-Domain Crowd Counting via Knowledge Diffusion
Xie, Haiyang
Yang, Zhengwei
Zhu, Huilin
Wang, Zheng
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6520 - 6529
[9] FOCUS ON SEMANTIC CONSISTENCY FOR CROSS-DOMAIN CROWD UNDERSTANDING
Han, Tao
Gao, Junyu
Yuan, Yuan
Wang, Qi
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1848 - 1852
[10] Domain Adaptation in Crowd Counting
Hossain, Mohammad Asiful
Reddy, Mahesh Kumar Krishna
Cannons, Kevin
Xu, Zhan
Wang, Yang
2020 17TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2020), 2020, : 150 - 157

← 1 2 3 4 5 →