CLFormer: a unified transformer-based framework for weakly supervised crowd counting and localization

被引：0

作者：

Mingfang Deng

Huailin Zhao

Ming Gao

机构：

[1] Shanghai Institute of Technology,School of Electrical and Electronic Engineering

来源：

The Visual Computer | 2024年 / 40卷 / 2期

关键词：

Shunted Transformer; Weakly supervised learning; Crowd counting; Crowd localization;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Recent progress in crowd counting and localization methods mainly relies on expensive point-level annotations and convolutional neural networks with limited receptive filed, which hinders their applications in complex real-world scenes. To this end, we present CLFormer, a Transformer-based weakly supervised crowd counting and localization framework. The model extracts global information from the input image using a Transformer and then passes the extracted features to both a regression branch for crowd counting and a localization branch for localization. Initial proposals are produced by the localization branch and filtered via score maps generated from the extracted features, and their centers are used as pseudo-point-level annotations. Through staggered training of the two branches, the quality of pseudo-point-level annotations is improved, and the final localization maps are generated. Experiments on four benchmark datasets (i.e., ShanghaiTech, UCF-QNRF, JHU-CROWD++, and NWPU-Crowd) demonstrate that CLFormer obtains better counting performance than weakly supervised and fully supervised counting networks and comparable localization performance to fully supervised localization networks.

引用

页码：1053 / 1067

页数：14

共 50 条

[41] Cross-scene crowd counting based on supervised adaptive network parameters [J].

Li, Shufang ;

Hu, Zhengping ;

Zhao, Mengyao ;

Bi, Shuai ;

Sun, Zhe .

SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (08) :2113-2120

[42] Weakly Supervised Temporal Action Localization Based on Contrastive Learning [J].

Hou Y. ;

Li Y. ;

Guo Z. .

Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2023, 56 (01) :73-80

[43] A Transformer-Based Substitute Recommendation Model IncorporatingWeakly Supervised Customer Behavior Data [J].

Ye, Wenting ;

Yang, Hongfei ;

Zhao, Shuai ;

Fang, Haoyang ;

Shi, Xingjian ;

Neppalli, Naveen .

PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, :3325-3329

[44] Hybrid Convolutional-Transformer framework for drone-based few-shot weakly supervised object detection [J].

Li, Shengming ;

Xue, Linsong ;

Feng, Lin ;

Yao, Cuili ;

Wang, Dong .

COMPUTERS & ELECTRICAL ENGINEERING, 2022, 102

[45] Weakly Supervised Learning Framework Based on k Labeled Samples [J].

Fu Z. ;

Wang H.-J. ;

Li T.-R. ;

Teng F. ;

Zhang J. .

Ruan Jian Xue Bao/Journal of Software, 2020, 31 (04) :981-990

[46] WeakCounter: Acceleration-based Repetition Counting of Actions with Weakly Supervised Learning [J].

Nishino, Yuuki ;

Maekawa, Takuya ;

Hara, Takahiro .

IWSC'21: PROCEEDINGS OF THE 2021 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2021, :144-146

[47] DiffusionLoc: A diffusion model-based framework for crowd localization [J].

Zhang, Qi ;

Li, Yuan ;

Liu, Yiran ;

Zhou, Yanzhao ;

Jiao, Jianbin .

IMAGE AND VISION COMPUTING, 2025, 155

[48] Transformer Based Multiple Instance Learning for Weakly Supervised Histopathology Image Segmentation [J].

Qian, Ziniu ;

Li, Kailu ;

Lai, Maode ;

Chang, Eric I-Chao ;

Wei, Bingzheng ;

Fan, Yubo ;

Xu, Yan .

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 :160-170

[49] Streamlining tuberculosis detection with foundation model-based weakly supervised transformer [J].

Bedőházi, Zsolt ;

Biricz, András ;

Foster, Nick ;

Lin, Yusen Eason ;

Csabai, István .

Computers in Biology and Medicine, 2025, 195

[50] Weakly Supervised Deep Learning-based Intracranial Hemorrhage Localization [J].

Nemcek, Jakub ;

Vicar, Tomas ;

Jakubicek, Roman .

PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES (BIOIMAGING), VOL 2, 2021, :111-116

← 1 2 3 4 5 →