Adaptive Teaching for Cross-Domain Crowd Counting

被引：2

作者：

Gong, Shenjian ^{[1
,2
,3
]}

Yang, Jian ^{[1
,2
,3
]}

Zhang, Shanshan ^{[1
,2
,3
]}

机构：

[1] Nanjing Univ Sci & Technol, PCA Lab, Nanjing 210094, Peoples R China

[2] Nanjing Univ Sci & Technol, Key Lab Intelligent Percept & Syst High Dimens Inf, Minist Educ, Nanjing 210094, Peoples R China

[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Lab Image & Video Understanding Social, Nanjing 210094, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

基金：

中国国家自然科学基金;

关键词：

Crowd counting; domain adaptation; mean teacher;

D O I：

10.1109/TMM.2023.3305815

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The main challenge of Unsupervised Domain Adaptation (UDA) crowd counting is the large domain gap between a synthetic domain with annotations (source) and a real-world domain of interest without annotations (target). Previous mainstream UDA crowd counting methods either employ feature alignment or a semi-supervised learning paradigm via pseudo-labels. We for the first time combine both of their advantages and propose an Adversarial Mean Teacher (AMT) framework. On the one hand, we optimize the student model with domain adversarial learning. On the other hand, we feed perturbed target images to the teacher model to generate pseudo-labels. Furthermore, to improve the quality of the pseudo-labels, we propose an Adaptive Teaching (AT) module, consisting of pseudo-label refinement and credible pseudo-label selection. Concretely, we first generate two candidate pseudo-labels from the prediction of the teacher model and obtain a refined pseudo-label by mixing them at the pixel-level. Moreover, we introduce an auxiliary task of foreground-background classification to assist credible region selection and only activate supervision signals on those regions. Extensive experiments on four real-world crowd counting benchmarks demonstrate the effectiveness of our method namely Cross-Domain Adaptive Teacher (CDAT).

引用

页码：2943 / 2952

页数：10

共 50 条

[31] Multi Layered Deep Neural Network for Feature Extraction in Cross Domain Crowd Counting [J].

Gunawardhana, Janith ;

Senevirathne, Rukmal ;

Karunarathne, Buddhika .

2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, :1051-1056

[32] Scale Adaptive Enhance Network for Crowd Counting [J].

Fan, Zirui ;

Ruan, Jun .

2022 11TH INTERNATIONAL CONFERENCE ON EDUCATIONAL AND INFORMATION TECHNOLOGY (ICEIT 2022), 2022, :220-225

[33] Adaptive Context Learning Network for Crowd Counting [J].

Liu, Zhao ;

Zeng, Guanqi ;

Feng, Zunlei ;

Zhang, Rong ;

Song, Mingli ;

Shen, Jianping .

2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, :4103-4109

[34] ADAPTIVE DEPTH NETWORK FOR CROWD COUNTING AND BEYOND [J].

Rong, Liangzi ;

Li, Chunping .

2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,

[35] Crowd Counting Using Adaptive Segmentation in a Congregation [J].

Sajid, Muhamad ;

Hassan, Ali ;

Khan, Shoab A. .

2016 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2016, :745-749

[36] Dual-Level Adaptive and Discriminative Knowledge Transfer for Cross-Domain Recognition [J].

Meng, Min ;

Lan, Mengcheng ;

Yu, Jun ;

Wu, Jigang ;

Liu, Ligang .

IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 :2266-2279

[37] AcroFOD: An Adaptive Method for Cross-Domain Few-Shot Object Detection [J].

Gao, Yipeng ;

Yang, Lingxiao ;

Huang, Yunmu ;

Xie, Song ;

Li, Shiyong ;

Zheng, Wei-Shi .

COMPUTER VISION - ECCV 2022, PT XXXIII, 2022, 13693 :673-690

[38] Cross-domain few-shot learning via adaptive transformer networks [J].

Paeedeh, Naeem ;

Pratama, Mahardhika ;

Ma'sum, Muhammad Anwar ;

Mayer, Wolfgang ;

Cao, Zehong ;

Kowlczyk, Ryszard .

KNOWLEDGE-BASED SYSTEMS, 2024, 288

[39] Transferable adaptive channel attention module for unsupervised cross-domain fault diagnosis [J].

Shi, Yaowei ;

Deng, Aidong ;

Deng, Minqiang ;

Xu, Meng ;

Liu, Yang ;

Ding, Xue ;

Li, Jing .

RELIABILITY ENGINEERING & SYSTEM SAFETY, 2022, 226

[40] Towards Learning Multi-Domain Crowd Counting [J].

Yan, Zhaoyi ;

Li, Pengyu ;

Wang, Biao ;

Ren, Dongwei ;

Zuo, Wangmeng .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) :6544-6557

← 1 2 3 4 5 →