PARTS BASED ATTENTION FOR HIGHLY OCCLUDED PEDESTRIAN DETECTION WITH TRANSFORMERS

被引:0
作者
Shastry, K. N. Ajay [1 ]
Chaudhari, Jayesh [1 ]
Thapar, Daksh [2 ]
Nigam, Aditya [2 ]
Arora, Chetan [1 ]
机构
[1] Indian Inst Technol, Delhi, India
[2] Indian Inst Technol, Mandi, Himachal Prades, India
来源
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年
关键词
D O I
10.1109/ICIP49359.2023.10222651
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the significant progress made in pedestrian detection in last decade, detecting pedestrians under heavy occlusion still remains a challenging problem. In state of the art (SOTA), convolutional neural network (CNN) based models, the reason is attributed to non-maximal-suppression (NMS), which often erroneously deletes true positives when one pedestrian is occluding other. SOTA transformer based models do not have such NMS step, yet fail to detect highly occluded pedestrians. In this paper, we study the reasons for such failures. We observe that such models first predict key-points, and then compute the attention at the specific key-points. Our analysis reveals that the key-points do not have any preference towards semantically important body parts. Under heavy occlusion, such key-points end up attending to non-discriminative regions or background, leading to false negatives. We take inspiration from the conventional wisdom of detecting objects using their parts, and bias the attention of proposed transformer architecture towards semantically important, and highly discriminative human body parts. The intervention leads to SOTA results on benchmark Citypersons and Caltech datasets, achieving 30.75%, and 32.96% miss-rate (lower is better) respectively, against 32.6%, and 38.2% by the current SOTA. Code is available at https://ajayshastry08.github.io/pa_dino
引用
收藏
页码:3085 / 3089
页数:5
相关论文
共 50 条
[21]   Discriminative Feature Transformation for Occluded Pedestrian Detection [J].
Zhou, Chunluan ;
Yang, Ming ;
Yuan, Junsong .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9556-9565
[22]   Progressive Refinement Network for Occluded Pedestrian Detection [J].
Song, Xiaolin ;
Zhao, Kaili ;
Chu, Wen-Sheng ;
Zhang, Honggang ;
Guo, Jun .
COMPUTER VISION - ECCV 2020, PT XXIII, 2020, 12368 :32-48
[23]   Feature Calibration Network for Occluded Pedestrian Detection [J].
Zhang, Tianliang ;
Ye, Qixiang ;
Zhang, Baochang ;
Liu, Jianzhuang ;
Zhang, Xiaopeng ;
Tian, Qi .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (05) :4151-4163
[24]   Occluded pedestrian detection combined with semantic features [J].
Ruan, Binjie ;
Zhang, Chongyang .
IET IMAGE PROCESSING, 2021, 15 (10) :2292-2300
[25]   A Cascade Framework for Unoccluded and Occluded Pedestrian Detection [J].
Ankit, Aayush ;
Ahmad, Irfan Riaz ;
Shin, Hyunchul .
2014 IEEE STUDENTS' TECHNOLOGY SYMPOSIUM (IEEE TECHSYM), 2014, :62-67
[26]   A Study on Occluded Pedestrian Detection Based on Block-based Features and Ensemble Classifier [J].
Wu Bin ;
Qu Shiru .
2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, :4710-4715
[27]   A Pedestrian Detection Algorithm Based on Channel Attention Mechanism [J].
Li, Weidong ;
Han, Shuang ;
Liu, Yang .
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, :5954-5959
[28]   Nighttime Pedestrian Detection Based on Feature Attention and Transformation [J].
Li, Gang ;
Zhang, Shanshan ;
Yang, Jian .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :9180-9187
[29]   Infrared Pedestrian Detection Method Based on Attention Model [J].
Pattern Recognition and Image Analysis, 2021, 31 :821-829
[30]   Infrared Pedestrian Detection Method Based on Attention Model [J].
Wang, Wenxiu .
PATTERN RECOGNITION AND IMAGE ANALYSIS, 2021, 31 (04) :821-829