Improving Point-Based Crowd Counting and Localization Based on Auxiliary Point Guidance

被引:0
作者
Chen, I-Hsiang [1 ]
Chen, Wei-Ting [1 ,2 ]
Liu, Yu-Wei [1 ]
Yang, Ming-Hsuan [2 ,3 ]
Kuo, Sy-Yen [1 ,4 ]
机构
[1] Natl Taiwan Univ, Taipei, Taiwan
[2] Univ Calif Merced, Merced, CA USA
[3] Google DeepMind, New York, NY USA
[4] Chang Gung Univ, Taoyuan, Taiwan
来源
COMPUTER VISION - ECCV 2024, PT XXIV | 2025年 / 15082卷
关键词
Crowd Counting; Crowd Localization; Auxiliary Learning; Feature Interpolation;
D O I
10.1007/978-3-031-72691-0_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowd counting and localization have become increasingly important in computer vision due to their wide-ranging applications. While point-based strategies have been widely used in crowd counting methods, they face a significant challenge, i.e., the lack of an effective learning strategy to guide the matching process. This deficiency leads to instability in matching point proposals to target points, adversely affecting overall performance. To address this issue, we introduce an effective approach to stabilize the proposal-target matching in point-based methods. We propose Auxiliary Point Guidance (APG) to provide clear and effective guidance for proposal selection and optimization, addressing the core issue of matching uncertainty. Additionally, we develop Implicit Feature Interpolation (IFI) to enable adaptive feature extraction in diverse crowd scenarios, further enhancing the model's robustness and accuracy. Extensive experiments demonstrate the effectiveness of our approach, showing significant improvements in crowd counting and localization performance, particularly under challenging conditions.
引用
收藏
页码:428 / 444
页数:17
相关论文
共 53 条
  • [1] Abousamra S, 2021, AAAI CONF ARTIF INTE, V35, P872
  • [2] Adaptive Dilated Network with Self-Correction Supervision for Counting
    Bai, Shuai
    He, Zhiqun
    Qiao, Yu
    Hu, Hanzhe
    Wu, Wei
    Yan, Junjie
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4593 - 4602
  • [3] Basri R, 2020, PR MACH LEARN RES, V119
  • [4] Learning Continuous Image Representation with Local Implicit Image Function
    Chen, Yinbo
    Liu, Sifei
    Wang, Xiaolong
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8624 - 8634
  • [5] Rethinking Spatial Invariance of Convolutional Networks for Object Counting
    Cheng, Zhi-Qi
    Dai, Qi
    Li, Hong
    Song, Jingkuan
    Wu, Xiao
    Hauptmann, Alexander G.
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19606 - 19616
  • [6] Redesigning Multi-Scale Neural Network for Crowd Counting
    Du, Zhipeng
    Shi, Miaojing
    Deng, Jiankang
    Zafeiriou, Stefanos
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3664 - 3678
  • [7] Florian L.C., 2017, CVPR
  • [8] Gao JY, 2021, Arxiv, DOI arXiv:2012.04164
  • [9] Gao JY, 2021, Arxiv, DOI arXiv:1912.03677
  • [10] Finding Tiny Faces
    Hu, Peiyun
    Ramanan, Deva
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1522 - 1530