Improving Point-Based Crowd Counting and Localization Based on Auxiliary Point Guidance

被引:0
作者
Chen, I-Hsiang [1 ]
Chen, Wei-Ting [1 ,2 ]
Liu, Yu-Wei [1 ]
Yang, Ming-Hsuan [2 ,3 ]
Kuo, Sy-Yen [1 ,4 ]
机构
[1] Natl Taiwan Univ, Taipei, Taiwan
[2] Univ Calif Merced, Merced, CA USA
[3] Google DeepMind, New York, NY USA
[4] Chang Gung Univ, Taoyuan, Taiwan
来源
COMPUTER VISION - ECCV 2024, PT XXIV | 2025年 / 15082卷
关键词
Crowd Counting; Crowd Localization; Auxiliary Learning; Feature Interpolation;
D O I
10.1007/978-3-031-72691-0_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowd counting and localization have become increasingly important in computer vision due to their wide-ranging applications. While point-based strategies have been widely used in crowd counting methods, they face a significant challenge, i.e., the lack of an effective learning strategy to guide the matching process. This deficiency leads to instability in matching point proposals to target points, adversely affecting overall performance. To address this issue, we introduce an effective approach to stabilize the proposal-target matching in point-based methods. We propose Auxiliary Point Guidance (APG) to provide clear and effective guidance for proposal selection and optimization, addressing the core issue of matching uncertainty. Additionally, we develop Implicit Feature Interpolation (IFI) to enable adaptive feature extraction in diverse crowd scenarios, further enhancing the model's robustness and accuracy. Extensive experiments demonstrate the effectiveness of our approach, showing significant improvements in crowd counting and localization performance, particularly under challenging conditions.
引用
收藏
页码:428 / 444
页数:17
相关论文
共 53 条
  • [11] Composition Loss for Counting, Density Map Estimation and Localization in Dense Crowds
    Idrees, Haroon
    Tayyab, Muhmmad
    Athrey, Kishan
    Zhang, Dong
    Al-Maadeed, Somaya
    Rajpoot, Nasir
    Shah, Mubarak
    [J]. COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 : 544 - 559
  • [12] Multi-Source Multi-Scale Counting in Extremely Dense Crowd Images
    Idrees, Haroon
    Saleemi, Imran
    Seibert, Cody
    Shah, Mubarak
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 2547 - 2554
  • [13] Attention Scaling for Crowd Counting
    Jiang, Xiaoheng
    Zhang, Li
    Xu, Mingliang
    Zhang, Tianzhu
    Lv, Pei
    Zhou, Bing
    Yang, Xin
    Pang, Yanwei
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4705 - 4714
  • [14] King DB, 2015, ACS SYM SER, V1214, P1, DOI 10.1021/bk-2015-1214.ch001
  • [15] Kuhn H. W., 1955, NAV RES LOG, V2, P83, DOI [DOI 10.1002/NAV.3800020109, DOI 10.1002/NAV.20053, 10.1002/nav.20053, 10.1002/nav.3800020109]
  • [16] Where Are the Blobs: Counting by Localization with Point Supervision
    Laradji, Issam H.
    Rostamzadeh, Negar
    Pinheiro, Pedro O.
    Vazquez, David
    Schmidt, Mark
    [J]. COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 : 560 - 576
  • [17] CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes
    Li, Yuhong
    Zhang, Xiaofan
    Chen, Deming
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1091 - 1100
  • [18] Density Map Regression Guided Detection Network for RGB-D Crowd Counting and Localization
    Lian, Dongze
    Li, Jing
    Zheng, Jia
    Luo, Weixin
    Gao, Shenghua
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1821 - 1830
  • [19] An End-to-End Transformer Model for Crowd Localization
    Liang, Dingkang
    Xu, Wei
    Bai, Xiang
    [J]. COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 38 - 54
  • [20] Focal Inverse Distance Transform Maps for Crowd Localization
    Liang, Dingkang
    Xu, Wei
    Zhu, Yingying
    Zhou, Yu
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6040 - 6052