FINE TUNING DEEP LEARNING MODELS FOR PEDESTRIAN DETECTION

被引：8

作者：

Amisse, Caisse ^{[1
,2
]}

Jijon-Palma, Mario Ernesto ^{[1
]}

Silva Centeno, Jorge Antonio ^{[1
]}

机构：

[1] Univ Fed Parana, Programa Posgrad Ciencias Geodes, Curitiba, Parana, Brazil

[2] Univ Rovuma, Dept Ciencias Nat, Nampula, Mozambique

来源：

BOLETIM DE CIENCIAS GEODESICAS | 2021年 / 27卷 / 02期

关键词：

fine-tuning; pedestrian detection; training data; deep learning models; OBJECT DETECTION;

D O I：

10.1590/s1982-21702021000200013

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Object detection in high resolution images is a new challenge that the remote sensing community is facing thanks to introduction of unmanned aerial vehicles and monitoring cameras. One of the interests is to detect and trace persons in the images. Different from general objects, pedestrians can have different poses and are undergoing constant morphological changes while moving, this task needs an intelligent solution. Fine-tuning has woken up great interest among researchers due to its relevance for retraining convolutional networks for many and interesting applications. For object classification, detection, and segmentation fine-tuned models have shown state-of-the-art performance. In the present work, we evaluate the performance of fine-tuned models with a variation of training data by comparing Faster Region-based Convolutional Neural Network (Faster R-CNN) Inception v2, Single Shot MultiBox Detector (SSD) Inception v2, and SSD Mobilenet v2. To achieve the goal, the effect of varying training data on performance metrics such as accuracy, precision, F1-score, and recall are taken into account. After testing the detectors, it was identified that the precision and recall are more sensitive on the variation of the amount of training data. Under five variation of the amount of training data, we observe that the proportion of 60%-80% consistently achieve highly comparable performance, whereas in all variation of training data Faster R-CNN Inception v2 outperforms SSD Inception v2 and SSD Mobilenet v2 in evaluated metrics, but the SSD converges relatively quickly during the training phase. Overall, partitioning 80% of total data for fine-tuning trained models produces efficient detectors even with only 700 data samples.

引用

页数：16

共 50 条

[41] Multi-Grained Deep Feature Learning for Robust Pedestrian Detection [J].

Lin, Chunze ;

Lu, Jiwen ;

Zhou, Jie .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (12) :3608-3621

[42] Computer vision and deep learning techniques for pedestrian detection and tracking: A survey [J].

Brunetti, Antonio ;

Buongiorno, Domenico ;

Trotta, Gianpaolo Francesco ;

Bevilacqua, Vitoantonio .

NEUROCOMPUTING, 2018, 300 :17-33

[43] A novel model based on deep learning for Pedestrian detection and Trajectory prediction [J].

Shi, Keke ;

Zhu, Yaping ;

Pan, Hong .

PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, :592-598

[44] Multi-Modal Pedestrian Detection Algorithm Based on Deep Learning [J].

Li X. ;

Fu H. ;

Niu W. ;

Wang P. ;

Lü Z. ;

Wang W. .

Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2022, 56 (10) :61-70

[45] Classification of Focal Liver Lesions Using Deep Learning with Fine-Tuning [J].

Wang, Weibin ;

Iwamoto, Yutaro ;

Han, Xianhua ;

Chen, Yen-Wei ;

Chen, Qingqing ;

Liang, Dong ;

Lin, Lanfen ;

Hu, Hongjie ;

Zhang, Qiaowei .

PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON DIGITAL MEDICINE AND IMAGE PROCESSING (DMIP 2018), 2018, :56-60

[46] Enhancing Freezing of Gait Detection in Parkinson's Through Fine-Tuned Deep Learning Models [J].

Tebaldi, Michele ;

Pravadelli, Graziano ;

Demrozi, Florenc ;

Giugno, Rosalba ;

Turetta, Cristian .

2024 IEEE INTERNATIONAL CONFERENCE ON DIGITAL HEALTH, ICDH 2024, 2024, :87-94

[47] Optimizing Deep Learning Models for Object Detection [J].

Barburescu, Calin-George ;

Iuhasz, Gabriel .

2020 22ND INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2020), 2020, :270-277

[48] Partial Occlusion Handling in Pedestrian Detection With a Deep Model [J].

Ouyang, Wanli ;

Zeng, Xingyu ;

Wang, Xiaogang .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (11) :2123-2137

[49] From Handcrafted to Deep Features for Pedestrian Detection: A Survey [J].

Cao, Jiale ;

Pang, Yanwei ;

Xie, Jin ;

Khan, Fahad Shahbaz ;

Shao, Ling .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) :4913-4934

[50] POPD: Partial Occluded Pedestrian Detection Using A Multimodal Deep Learning Approach [J].

Garg, Deepanshu ;

Kumar, Alok ;

Eswaran, Sivaraman ;

Section, Youcef Djenouri ;

Srivastava, Gautam .

2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,

← 1 2 3 4 5 →