A Crowded Object Counting System with Self-Attention Mechanism

被引：0

作者：

Lien, Cheng-Chang ^{[1
]}

Wu, Pei-Chen ^{[1
]}

机构：

[1] Chung Hua Univ, Dept Comp Sci & Informat Engn, Hsinchu City 300110, Taiwan

来源：

SENSORS | 2024年 / 24卷 / 20期

关键词：

crowded object counting; density map; self-attention mechanism; VEHICLE DETECTION;

D O I：

10.3390/s24206612

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Traditional object counting systems use object detection methods to count objects. However, when objects are small, crowded, and dense, object detection may fail, leading to inaccuracies in counting. To address this issue, we propose a crowded object counting system based on density map estimation. While most density map estimation models employ encoder-decoder or multi-branch approaches to generate feature maps at different scales for obtaining an accurate density map, improving the accuracy of crowded object counting remains a challenge. In this paper, we propose a novel model that can generate more accurate density maps, utilizing the context-aware network as the primary structure and integrating the self-attention mechanism. There are three main contributions in this paper. Firstly, the self-attention mechanism is employed to improve the accuracy of density map estimation. Secondly, the missing vehicle labels in the TRANCOS database are relabeled, ensuring that the ground truth data are more complete than the original TRANCOS database, thus enabling the proposed novel model to have higher crowded object counting accuracy. Thirdly, the parameters of the self-attention mechanism are analyzed to obtain the optimum parameter combination. The experimental results demonstrate that the accuracy of crowded object counting can reach 85.9%, 90.0%, 83.4%, and 92.6% for the TRANCOS, relabeled TRANCOS, ShanghaiTech Part A, and Part B datasets, respectively. Furthermore, the ablation study for the context-aware network with self-attention mechanism analyzes the optimum parameter combination.

引用

页数：15

共 42 条

[1] Ba JL., 2016, arXiv, DOI 10.48550/arXiv.1607.06450
[2] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934]
[3] Cordonnier JB, 2021, Arxiv, DOI [arXiv:2006.16362, DOI 10.48550/ARXIV.2006.16362]
[4] Devlin J, 2019, Arxiv, DOI [arXiv:1810.04805, DOI 10.48550/ARXIV.1810.04805]
[5] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[6] Weak and Occluded Vehicle Detection in Complex Infrared Environment Based on Improved YOLOv4
Du, Shuangjiang
Zhang, Pin
Zhang, Baofu
Xu, Honghui
[J]. IEEE ACCESS, 2021, 9 : 25671 - 25680
[7] Fan QF, 2016, IEEE INT VEH SYM, P124, DOI 10.1109/IVS.2016.7535375
[8] An Anticipative Crowd Management System Preventing Clogging in Exits During Pedestrian Evacuation Processes
Georgoudas, Ioakeim G.
Sirakoulis, Georgios Ch.
Andreadis, Ioannis Th.
[J]. IEEE SYSTEMS JOURNAL, 2011, 5 (01): : 129 - 141
[9] Fast R-CNN
Girshick, Ross
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
[10] Rich feature hierarchies for accurate object detection and semantic segmentation
Girshick, Ross
Donahue, Jeff
Darrell, Trevor
Malik, Jitendra
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587

← 1 2 3 4 5 →