CLFormer: a unified transformer-based framework for weakly supervised crowd counting and localization

被引:0
作者
Mingfang Deng
Huailin Zhao
Ming Gao
机构
[1] Shanghai Institute of Technology,School of Electrical and Electronic Engineering
关键词
Shunted Transformer; Weakly supervised learning; Crowd counting; Crowd localization;
D O I
暂无
中图分类号
学科分类号
摘要
Recent progress in crowd counting and localization methods mainly relies on expensive point-level annotations and convolutional neural networks with limited receptive filed, which hinders their applications in complex real-world scenes. To this end, we present CLFormer, a Transformer-based weakly supervised crowd counting and localization framework. The model extracts global information from the input image using a Transformer and then passes the extracted features to both a regression branch for crowd counting and a localization branch for localization. Initial proposals are produced by the localization branch and filtered via score maps generated from the extracted features, and their centers are used as pseudo-point-level annotations. Through staggered training of the two branches, the quality of pseudo-point-level annotations is improved, and the final localization maps are generated. Experiments on four benchmark datasets (i.e., ShanghaiTech, UCF-QNRF, JHU-CROWD++, and NWPU-Crowd) demonstrate that CLFormer obtains better counting performance than weakly supervised and fully supervised counting networks and comparable localization performance to fully supervised localization networks.
引用
收藏
页码:1053 / 1067
页数:14
相关论文
共 50 条
[41]   Cross-scene crowd counting based on supervised adaptive network parameters [J].
Li, Shufang ;
Hu, Zhengping ;
Zhao, Mengyao ;
Bi, Shuai ;
Sun, Zhe .
SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (08) :2113-2120
[42]   Weakly Supervised Temporal Action Localization Based on Contrastive Learning [J].
Hou Y. ;
Li Y. ;
Guo Z. .
Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2023, 56 (01) :73-80
[43]   A Transformer-Based Substitute Recommendation Model IncorporatingWeakly Supervised Customer Behavior Data [J].
Ye, Wenting ;
Yang, Hongfei ;
Zhao, Shuai ;
Fang, Haoyang ;
Shi, Xingjian ;
Neppalli, Naveen .
PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, :3325-3329
[44]   Hybrid Convolutional-Transformer framework for drone-based few-shot weakly supervised object detection [J].
Li, Shengming ;
Xue, Linsong ;
Feng, Lin ;
Yao, Cuili ;
Wang, Dong .
COMPUTERS & ELECTRICAL ENGINEERING, 2022, 102
[45]   Weakly Supervised Learning Framework Based on k Labeled Samples [J].
Fu Z. ;
Wang H.-J. ;
Li T.-R. ;
Teng F. ;
Zhang J. .
Ruan Jian Xue Bao/Journal of Software, 2020, 31 (04) :981-990
[46]   WeakCounter: Acceleration-based Repetition Counting of Actions with Weakly Supervised Learning [J].
Nishino, Yuuki ;
Maekawa, Takuya ;
Hara, Takahiro .
IWSC'21: PROCEEDINGS OF THE 2021 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2021, :144-146
[47]   DiffusionLoc: A diffusion model-based framework for crowd localization [J].
Zhang, Qi ;
Li, Yuan ;
Liu, Yiran ;
Zhou, Yanzhao ;
Jiao, Jianbin .
IMAGE AND VISION COMPUTING, 2025, 155
[48]   Transformer Based Multiple Instance Learning for Weakly Supervised Histopathology Image Segmentation [J].
Qian, Ziniu ;
Li, Kailu ;
Lai, Maode ;
Chang, Eric I-Chao ;
Wei, Bingzheng ;
Fan, Yubo ;
Xu, Yan .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 :160-170
[49]   Streamlining tuberculosis detection with foundation model-based weakly supervised transformer [J].
Bedőházi, Zsolt ;
Biricz, András ;
Foster, Nick ;
Lin, Yusen Eason ;
Csabai, István .
Computers in Biology and Medicine, 2025, 195
[50]   Weakly Supervised Deep Learning-based Intracranial Hemorrhage Localization [J].
Nemcek, Jakub ;
Vicar, Tomas ;
Jakubicek, Roman .
PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES (BIOIMAGING), VOL 2, 2021, :111-116