CLFormer: a unified transformer-based framework for weakly supervised crowd counting and localization

被引:0
作者
Mingfang Deng
Huailin Zhao
Ming Gao
机构
[1] Shanghai Institute of Technology,School of Electrical and Electronic Engineering
关键词
Shunted Transformer; Weakly supervised learning; Crowd counting; Crowd localization;
D O I
暂无
中图分类号
学科分类号
摘要
Recent progress in crowd counting and localization methods mainly relies on expensive point-level annotations and convolutional neural networks with limited receptive filed, which hinders their applications in complex real-world scenes. To this end, we present CLFormer, a Transformer-based weakly supervised crowd counting and localization framework. The model extracts global information from the input image using a Transformer and then passes the extracted features to both a regression branch for crowd counting and a localization branch for localization. Initial proposals are produced by the localization branch and filtered via score maps generated from the extracted features, and their centers are used as pseudo-point-level annotations. Through staggered training of the two branches, the quality of pseudo-point-level annotations is improved, and the final localization maps are generated. Experiments on four benchmark datasets (i.e., ShanghaiTech, UCF-QNRF, JHU-CROWD++, and NWPU-Crowd) demonstrate that CLFormer obtains better counting performance than weakly supervised and fully supervised counting networks and comparable localization performance to fully supervised localization networks.
引用
收藏
页码:1053 / 1067
页数:14
相关论文
共 50 条
[31]   Multi-Level Dynamic Graph Convolutional Networks for Weakly Supervised Crowd Counting [J].
Miao, Zhuangzhuang ;
Zhang, Yong ;
Ren, Hao ;
Hu, Yongli ;
Yin, Baocai .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (05) :3483-3495
[32]   Weakly-Supervised Crowd Counting Learns from Sorting Rather Than Locations [J].
Yang, Yifan ;
Li, Guorong ;
Wu, Zhe ;
Su, Li ;
Huang, Qingming ;
Sebe, Nicu .
COMPUTER VISION - ECCV 2020, PT VIII, 2020, 12353 :1-17
[33]   CrowdNeXt: Boosting Weakly Supervised Crowd Counting With Dual-Path Feature Aggregation and a Robust Loss Function [J].
Savner, Siddharth Singh ;
Kanhangad, Vivek .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
[34]   Improving Point-Based Crowd Counting and Localization Based on Auxiliary Point Guidance [J].
Chen, I-Hsiang ;
Chen, Wei-Ting ;
Liu, Yu-Wei ;
Yang, Ming-Hsuan ;
Kuo, Sy-Yen .
COMPUTER VISION - ECCV 2024, PT XXIV, 2025, 15082 :428-444
[35]   Semi-supervised Crowd Counting Method Based on Attention Mechanism [J].
Hu, Zijian ;
Li, Yingying ;
Zou, Jie ;
Liu, Jian .
2024 17TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING, ICACTE, 2024, :231-235
[36]   Semi-supervised Crowd Counting based on Patch Crowds Statistics [J].
Peng, Sifan ;
Yin, Baoqun ;
Xia, Yinfeng ;
Yang, Qianqian ;
Wang, Luyang .
2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, :749-755
[37]   D2PT: Density to Point Transformer with Knowledge Distillation for Crowd Counting and Localization [J].
Li, Fan ;
Yang, Enze ;
Li, Chao ;
Liu, Shuoyan ;
Wang, Haodong .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2025, E108D (02) :165-168
[38]   Cross-scene crowd counting based on supervised adaptive network parameters [J].
Shufang Li ;
Zhengping Hu ;
Mengyao Zhao ;
Shuai Bi ;
Zhe Sun .
Signal, Image and Video Processing, 2022, 16 :2113-2120
[39]   A Semi-supervised crowd counting method based on patch crowds statistics [J].
Peng, Sifan ;
Yin, Baoqun ;
Xia, Yinfeng ;
Yang, Qianqian ;
Wang, Luyang .
PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (04)
[40]   Semi-supervised Crowd Counting Based on Hard Pseudo-labels [J].
Li, Hanxiao ;
Song, Yonghong ;
Geng, Tong .
2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,