PARTS BASED ATTENTION FOR HIGHLY OCCLUDED PEDESTRIAN DETECTION WITH TRANSFORMERS

被引：0

作者：

Shastry, K. N. Ajay ^{[1
]}

Chaudhari, Jayesh ^{[1
]}

Thapar, Daksh ^{[2
]}

Nigam, Aditya ^{[2
]}

Arora, Chetan ^{[1
]}

机构：

[1] Indian Inst Technol, Delhi, India

[2] Indian Inst Technol, Mandi, Himachal Prades, India

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年

关键词：

D O I：

10.1109/ICIP49359.2023.10222651

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Despite the significant progress made in pedestrian detection in last decade, detecting pedestrians under heavy occlusion still remains a challenging problem. In state of the art (SOTA), convolutional neural network (CNN) based models, the reason is attributed to non-maximal-suppression (NMS), which often erroneously deletes true positives when one pedestrian is occluding other. SOTA transformer based models do not have such NMS step, yet fail to detect highly occluded pedestrians. In this paper, we study the reasons for such failures. We observe that such models first predict key-points, and then compute the attention at the specific key-points. Our analysis reveals that the key-points do not have any preference towards semantically important body parts. Under heavy occlusion, such key-points end up attending to non-discriminative regions or background, leading to false negatives. We take inspiration from the conventional wisdom of detecting objects using their parts, and bias the attention of proposed transformer architecture towards semantically important, and highly discriminative human body parts. The intervention leads to SOTA results on benchmark Citypersons and Caltech datasets, achieving 30.75%, and 32.96% miss-rate (lower is better) respectively, against 32.6%, and 38.2% by the current SOTA. Code is available at https://ajayshastry08.github.io/pa_dino

引用

页码：3085 / 3089

页数：5

共 50 条

[31] GCN-based Detection of Occluded Key Parts of Vehicle Target [J].

Wang, Yeru ;

Yang, Geng ;

Liu, Shu ;

Xu, Xiao ;

Chen, Huajie ;

Qin, Feiwei ;

Xu, Huajie .

Binggong Xuebao/Acta Armamentarii, 2024, 45 :242-251

[32] Spatial Attention for Pedestrian Detection [J].

Ujjwal ;

Dziri, Aziz ;

Leroy, Bertrand ;

Bremond, Francois .

2019 16TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2019,

[33] Automatic pedestrian detection in partially occluded single image [J].

Hsu, Wei-Yen .

INTEGRATED COMPUTER-AIDED ENGINEERING, 2018, 25 (04) :369-379

[34] Mask-Guided Attention Network and Occlusion-Sensitive Hard Example Mining for Occluded Pedestrian Detection [J].

Xie, Jin ;

Pang, Yanwei ;

Khan, Muhammad Haris ;

Anwer, Rao Muhammad ;

Khan, Fahad Shahbaz ;

Shao, Ling .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :3872-3884

[35] A Helmet Detection Algorithm Based on Transformers with Deformable Attention Module [J].

Songle Chen ;

Hongbo Sun ;

Yuxin Wu ;

Lei Shang ;

Xiukai Ruan .

Chinese Journal of Electronics, 2025, 34 (01) :229-241

[36] A Helmet Detection Algorithm Based on Transformers with Deformable Attention Module [J].

Chen, Songle ;

Sun, Hongbo ;

Wu, Yuxin ;

Shang, Lei ;

Ruan, Xiukai .

CHINESE JOURNAL OF ELECTRONICS, 2025, 34 (01) :229-241

[37] Pedestrian detection based on attention mechanism and feature enhancement with SSD [J].

Feng, T. T. ;

Ge, H. Y. .

2020 5TH INTERNATIONAL CONFERENCE ON COMMUNICATION, IMAGE AND SIGNAL PROCESSING (CCISP 2020), 2020, :145-148

[38] Pedestrian Target Detection Based on Attention Mechanism in Cloud Computing [J].

Zhao, Lihua ;

Zeng, Fanjun .

PROCEEDINGS OF INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, MACHINE LEARNING AND PATTERN RECOGNITION, IPMLP 2024, 2024, :313-317

[39] A Pedestrian Detection Network Based on an Attention Mechanism and Pose Information [J].

Jiang, Zhaoyin ;

Huang, Shucheng ;

Li, Mingxing .

APPLIED SCIENCES-BASEL, 2024, 14 (18)

[40] Occluded Pedestrian Detection Algorithm Based on Improved Network Structure of YOLOv3 [J].

Liu L. ;

Zheng Y. ;

Fu D. .

Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2020, 33 (06) :568-574

← 1 2 3 4 5 →