Two-Stage Pedestrian Detection Model Using a New Classification Head for Domain Generalization

被引:2
作者
Schulz, Daniel [1 ,2 ,3 ]
Perez, Claudio A. [1 ,2 ,3 ]
机构
[1] Univ Chile, Dept Elect Engn, Santiago 8370451, Chile
[2] Univ Chile, Adv Min Technol Ctr, Santiago 8370451, Chile
[3] Ctr Intervent Med Precis & Adv Cellular Therapy, IMPACT, Santiago 7620086, Chile
关键词
pedestrian detection; domain generalization; object detection; triplet loss; two-stage detection; RECOGNITION;
D O I
10.3390/s23239380
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Pedestrian detection based on deep learning methods have reached great success in the past few years with several possible real-world applications including autonomous driving, robotic navigation, and video surveillance. In this work, a new neural network two-stage pedestrian detector with a new custom classification head, adding the triplet loss function to the standard bounding box regression and classification losses, is presented. This aims to improve the domain generalization capabilities of existing pedestrian detectors, by explicitly maximizing inter-class distance and minimizing intra-class distance. Triplet loss is applied to the features generated by the region proposal network, aimed at clustering together pedestrian samples in the features space. We used Faster R-CNN and Cascade R-CNN with the HRNet backbone pre-trained on ImageNet, changing the standard classification head for Faster R-CNN, and changing one of the three heads for Cascade R-CNN. The best results were obtained using a progressive training pipeline, starting from a dataset that is further away from the target domain, and progressively fine-tuning on datasets closer to the target domain. We obtained state-of-the-art results, MR-2 of 9.9, 11.0, and 36.2 for the reasonable, small, and heavy subsets on the CityPersons benchmark with outstanding performance on the heavy subset, the most difficult one.
引用
收藏
页数:23
相关论文
共 50 条
[31]   A Two-Stage Neural Network Model for Automatic Detection of Skiffs from Satellite Imagery [J].
Ducaru, Tudor ;
Stoica, Stefan ;
Holban, Vlad ;
Minhas, Fayyaz .
2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
[32]   Two-Stage Training of Graph Neural Networks for Graph Classification [J].
Manh Tuan Do ;
Noseong Park ;
Kijung Shin .
Neural Processing Letters, 2023, 55 :2799-2823
[33]   Two-Stage Framework for Specialty Vehicles Detection and Classification: Toward Intelligent Visual Surveillance of Airport Surface [J].
Ding, Meng ;
Zhou, Wenhui ;
Xu, Yiming ;
Xu, Yubin .
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (02) :1912-1923
[34]   Two-stage grasp strategy combining CNN-based classification and adaptive detection on a flexible hand [J].
Chen, Xiaoyan ;
Sun, Yilin ;
Zhang, Qiuju ;
Liu, Fei .
APPLIED SOFT COMPUTING, 2020, 97
[35]   Effective pedestrian detection using deformable part model based on human model [J].
Hye Ji Choi ;
Yoon Suk Lee ;
Duk-Sun Shim ;
Chan Gun Lee ;
Kwang Nam Choi .
International Journal of Control, Automation and Systems, 2016, 14 :1618-1625
[36]   Effective Pedestrian Detection using Deformable Part Model based on Human Model [J].
Choi, Hye Ji ;
Lee, Yoon Suk ;
Shim, Duk-Sun ;
Lee, Chan Gun ;
Choi, Kwang Nam .
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2016, 14 (06) :1618-1625
[37]   A two-stage text detection approach using gradient point adjacency and deep network [J].
Khan, Tauseef ;
Mollah, Ayatullah Faruk .
INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2022, 25 (02) :152-165
[38]   MetaGON: A Lightweight Pedestrian Re-Identification Domain Generalization Model Adapted to Edge Devices [J].
Peng, Xiting ;
Zhao, Huaxuan ;
Zhang, Xiaoyu ;
Zhang, Chaofeng ;
Chen, Caijuan .
IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2024, 5 :690-699
[39]   A Two-Stage Anomaly Detection Method Based on User Preference Features and the Deep Fusion Model [J].
Zhang, Sen-Lei ;
Zhang, Bin ;
Zhou, Yi-Tao ;
Guo, Yue-Xuan ;
Tan, Jing-Lei .
APPLIED SCIENCES-BASEL, 2023, 13 (10)
[40]   Approach to Part using Deformable Part Model in Pedestrian Detection System [J].
Choi, Hye Ji ;
Shin, Nara ;
Choi, Kwang Nam .
FIRST INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2016, 0011