Adaptive Teaching for Cross-Domain Crowd Counting

被引：3

作者：

Gong, Shenjian ^{[1
,2
,3
]}

Yang, Jian ^{[1
,2
,3
]}

Zhang, Shanshan ^{[1
,2
,3
]}

机构：

[1] Nanjing Univ Sci & Technol, PCA Lab, Nanjing 210094, Peoples R China

[2] Nanjing Univ Sci & Technol, Key Lab Intelligent Percept & Syst High Dimens Inf, Minist Educ, Nanjing 210094, Peoples R China

[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Lab Image & Video Understanding Social, Nanjing 210094, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

基金：

中国国家自然科学基金;

关键词：

Crowd counting; domain adaptation; mean teacher;

D O I：

10.1109/TMM.2023.3305815

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The main challenge of Unsupervised Domain Adaptation (UDA) crowd counting is the large domain gap between a synthetic domain with annotations (source) and a real-world domain of interest without annotations (target). Previous mainstream UDA crowd counting methods either employ feature alignment or a semi-supervised learning paradigm via pseudo-labels. We for the first time combine both of their advantages and propose an Adversarial Mean Teacher (AMT) framework. On the one hand, we optimize the student model with domain adversarial learning. On the other hand, we feed perturbed target images to the teacher model to generate pseudo-labels. Furthermore, to improve the quality of the pseudo-labels, we propose an Adaptive Teaching (AT) module, consisting of pseudo-label refinement and credible pseudo-label selection. Concretely, we first generate two candidate pseudo-labels from the prediction of the teacher model and obtain a refined pseudo-label by mixing them at the pixel-level. Moreover, we introduce an auxiliary task of foreground-background classification to assist credible region selection and only activate supervision signals on those regions. Extensive experiments on four real-world crowd counting benchmarks demonstrate the effectiveness of our method namely Cross-Domain Adaptive Teacher (CDAT).

引用

页码：2943 / 2952

页数：10

共 50 条

[21] Enhancing Manatee Aggregation Counting Through Augmentation and Cross-Domain Learning [J].

Zaramella, Matteo ;

Zhu, Xingquan ;

Amerini, Irene .

IEEE ACCESS, 2024, 12 :131148-131163

[22] SELF-SUPERVISED DOMAIN ADAPTATION IN CROWD COUNTING [J].

Nguyen, Pha ;

Truong, Thanh-Dat ;

Huang, Miaoqing ;

Liang, Yi ;

Le, Ngan ;

Luu, Khoa .

2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, :2786-2790

[23] Domain-Adaptive Crowd Counting via High-Quality Image Translation and Density Reconstruction [J].

Gao, Junyu ;

Han, Tao ;

Yuan, Yuan ;

Wang, Qi .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) :4803-4815

[24] Cross-Domain Attention Alignment for Domain Adaptive Person re-ID [J].

Zhang, Zhen ;

Wang, Wei ;

Kane, Guoliang .

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XII, 2025, 15042 :114-127

[25] Adaptive Domain Alignment Neural Networks for Cross-Domain EEG Emotion Recognition [J].

Hong, Xuezhu ;

Du, Changde ;

He, Huiguang .

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2025, 16 (02) :903-914

[26] Graph Adaptive Semantic Transfer for Cross-domain Sentiment Classification [J].

Zhang, Kai ;

Liu, Qi ;

Huang, Zhenya ;

Cheng, Mingyue ;

Zhang, Kun ;

Zhang, Mengdi ;

Wu, Wei ;

Chen, Enhong .

PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, :1566-1576

[27] ADAPTIVE SCENARIO DISCOVERY FOR CROWD COUNTING [J].

Wu, Xingjiao ;

Zheng, Yingbin ;

Ye, Hao ;

Hu, Wenxin ;

Yang, Jing ;

He, Liang .

2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, :2382-2386

[28] Cross-scene crowd counting based on supervised adaptive network parameters [J].

Shufang Li ;

Zhengping Hu ;

Mengyao Zhao ;

Shuai Bi ;

Zhe Sun .

Signal, Image and Video Processing, 2022, 16 :2113-2120

[29] A scale adaptive network for crowd counting [J].

Zhang, Youmei ;

Zhou, Chunluan ;

Chang, Faliang ;

Kot, Alex C. .

NEUROCOMPUTING, 2019, 362 :139-146

[30] Cross-scene crowd counting based on supervised adaptive network parameters [J].

Li, Shufang ;

Hu, Zhengping ;

Zhao, Mengyao ;

Bi, Shuai ;

Sun, Zhe .

SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (08) :2113-2120

← 1 2 3 4 5 →