Improving Point-Based Crowd Counting and Localization Based on Auxiliary Point Guidance

被引：0

作者：

Chen, I-Hsiang ^{[1
]}

Chen, Wei-Ting ^{[1
,2
]}

Liu, Yu-Wei ^{[1
]}

Yang, Ming-Hsuan ^{[2
,3
]}

Kuo, Sy-Yen ^{[1
,4
]}

机构：

[1] Natl Taiwan Univ, Taipei, Taiwan

[2] Univ Calif Merced, Merced, CA USA

[3] Google DeepMind, New York, NY USA

[4] Chang Gung Univ, Taoyuan, Taiwan

来源：

COMPUTER VISION - ECCV 2024, PT XXIV | 2025年 / 15082卷

关键词：

Crowd Counting; Crowd Localization; Auxiliary Learning; Feature Interpolation;

D O I：

10.1007/978-3-031-72691-0_24

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Crowd counting and localization have become increasingly important in computer vision due to their wide-ranging applications. While point-based strategies have been widely used in crowd counting methods, they face a significant challenge, i.e., the lack of an effective learning strategy to guide the matching process. This deficiency leads to instability in matching point proposals to target points, adversely affecting overall performance. To address this issue, we introduce an effective approach to stabilize the proposal-target matching in point-based methods. We propose Auxiliary Point Guidance (APG) to provide clear and effective guidance for proposal selection and optimization, addressing the core issue of matching uncertainty. Additionally, we develop Implicit Feature Interpolation (IFI) to enable adaptive feature extraction in diverse crowd scenarios, further enhancing the model's robustness and accuracy. Extensive experiments demonstrate the effectiveness of our approach, showing significant improvements in crowd counting and localization performance, particularly under challenging conditions.

引用

页码：428 / 444

页数：17

共 53 条

[1] Abousamra S, 2021, AAAI CONF ARTIF INTE, V35, P872
[2] Adaptive Dilated Network with Self-Correction Supervision for Counting
Bai, Shuai
He, Zhiqun
Qiao, Yu
Hu, Hanzhe
Wu, Wei
Yan, Junjie
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4593 - 4602
[3] Basri R, 2020, PR MACH LEARN RES, V119
[4] Learning Continuous Image Representation with Local Implicit Image Function
Chen, Yinbo
Liu, Sifei
Wang, Xiaolong
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8624 - 8634
[5] Rethinking Spatial Invariance of Convolutional Networks for Object Counting
Cheng, Zhi-Qi
Dai, Qi
Li, Hong
Song, Jingkuan
Wu, Xiao
Hauptmann, Alexander G.
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19606 - 19616
[6] Redesigning Multi-Scale Neural Network for Crowd Counting
Du, Zhipeng
Shi, Miaojing
Deng, Jiankang
Zafeiriou, Stefanos
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3664 - 3678
[7] Florian L.C., 2017, CVPR
[8] Gao JY, 2021, Arxiv, DOI arXiv:2012.04164
[9] Gao JY, 2021, Arxiv, DOI arXiv:1912.03677
[10] Finding Tiny Faces
Hu, Peiyun
Ramanan, Deva
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1522 - 1530

← 1 2 3 4 5 6 →