CLFormer: a unified transformer-based framework for weakly supervised crowd counting and localization

被引:0
|
作者
Mingfang Deng
Huailin Zhao
Ming Gao
机构
[1] Shanghai Institute of Technology,School of Electrical and Electronic Engineering
关键词
Shunted Transformer; Weakly supervised learning; Crowd counting; Crowd localization;
D O I
暂无
中图分类号
学科分类号
摘要
Recent progress in crowd counting and localization methods mainly relies on expensive point-level annotations and convolutional neural networks with limited receptive filed, which hinders their applications in complex real-world scenes. To this end, we present CLFormer, a Transformer-based weakly supervised crowd counting and localization framework. The model extracts global information from the input image using a Transformer and then passes the extracted features to both a regression branch for crowd counting and a localization branch for localization. Initial proposals are produced by the localization branch and filtered via score maps generated from the extracted features, and their centers are used as pseudo-point-level annotations. Through staggered training of the two branches, the quality of pseudo-point-level annotations is improved, and the final localization maps are generated. Experiments on four benchmark datasets (i.e., ShanghaiTech, UCF-QNRF, JHU-CROWD++, and NWPU-Crowd) demonstrate that CLFormer obtains better counting performance than weakly supervised and fully supervised counting networks and comparable localization performance to fully supervised localization networks.
引用
收藏
页码:1053 / 1067
页数:14
相关论文
共 50 条
  • [1] CLFormer: a unified transformer-based framework for weakly supervised crowd counting and localization
    Deng, Mingfang
    Zhao, Huailin
    Gao, Ming
    VISUAL COMPUTER, 2024, 40 (02) : 1053 - 1067
  • [2] Weakly supervised crowd counting based on Swin Transformer
    Feng, Min
    Hao, Linlin
    Kuang, Yonggang
    2023 THE 6TH INTERNATIONAL CONFERENCE ON ROBOT SYSTEMS AND APPLICATIONS, ICRSA 2023, 2023, : 229 - 236
  • [3] CCTwins: A Weakly Supervised Transformer-Based Crowd Counting Method With Adaptive Scene Consistency Attention
    Dong, Li
    Zhang, Haijun
    Zhou, Dongliang
    Shi, Jianyang
    Ma, Jianghong
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (01) : 22 - 35
  • [4] Learning Crowd Scale and Distribution for Weakly Supervised Crowd Counting and Localization
    Fan, Yaowu
    Wan, Jia
    Ma, Andy J.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (01) : 713 - 727
  • [5] Transformer-Based Feature Aggregation and Stitching Network for Crowd Counting
    Wang, Kehao
    Wang, Yuhui
    Ren, Ruiqi
    Zou, Han
    Shao, Zhichao
    IEEE ACCESS, 2023, 11 : 124833 - 124844
  • [6] A Weakly Supervised Hybrid Lightweight Network for Efficient Crowd Counting
    Chen, Yongqi
    Zhao, Huailin
    Gao, Ming
    Deng, Mingfang
    ELECTRONICS, 2024, 13 (04)
  • [7] DTCC: Multi-level dilated convolution with transformer for weakly-supervised crowd counting
    Zhuangzhuang Miao
    Yong Zhang
    Yuan Peng
    Haocheng Peng
    Baocai Yin
    Computational Visual Media, 2023, 9 : 859 - 873
  • [8] WEAKLY SUPERVISED CROWD-WISE ATTENTION FOR ROBUST CROWD COUNTING
    Kong, Xiyu
    Zhao, Muming
    Zhou, Hao
    Zhang, Chongyang
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2722 - 2726
  • [9] DTCC: Multi-level dilated convolution with transformer for weakly-supervised crowd counting
    Miao, Zhuangzhuang
    Zhang, Yong
    Peng, Yuan
    Peng, Haocheng
    Yin, Baocai
    COMPUTATIONAL VISUAL MEDIA, 2023, 9 (04) : 859 - 873
  • [10] TransCrowd: weakly-supervised crowd counting with transformers
    Liang, Dingkang
    Chen, Xiwu
    Xu, Wei
    Zhou, Yu
    Bai, Xiang
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (06)