IMPROVING A FASTER R-CNN MODEL FOR VEHICLE DETECTION AND HUMAN ACTION RECOGNITION AT NIGHT VIA INFRARED THERMAL IMAGING USING TRANSFER LEARNING

被引:0
作者
Liu, Yaru [1 ]
Matsui, Kai [1 ]
Kageyama, Yoichi [1 ]
Shirai, Hikaru [1 ]
Ishizawa, Chikako [1 ]
机构
[1] Akita Univ, Grad Sch Engn Sci, 1-1 Tegata Gakuen Machi, Akita, Akita 0108502, Japan
来源
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL | 2024年 / 20卷 / 06期
关键词
Vehicle detection; Human action recognition; Infrared image; Nighttime; Faster R-CNN;
D O I
10.24507/ijicic.20.06.1573
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
At present, the older population is the fastest-growing segment of the driving population, which has led to higher rates of traffic accidents. Data on the number of casualties in accidents involving pedestrians and motor vehicles during the day and at night indicate that the proportion of fatalities is significantly higher at night. Consequently, focusing on the traffic safety of the elderly, reducing the occurrence of nighttime traffic accidents, and promoting sustainability are crucial for Japan, which faces the challenge of becoming a "super-aging" society. Thus, we propose a system to support the safety and security of pedestrians and drivers using infrared thermal imaging data at night. In previous studies, we developed methods to detect pedestrian actions using a novel convolutional neural network (CNN)-based model, specifically VGG16. In this study, we propose improvements to an existing detection method using an improved Faster R-CNN model to detect vehicles and recognize human actions in real time at night. We acquired new video data demonstrating multiple human actions related to distant target objects captured by the infrared thermal camera. These data can be used to investigate vehicle detection and action recognition in scenes involving multiple humans using transfer learning. We experimentally evaluated the performance of our method in terms of the detection accuracy, and the results indicate that our proposed method achieved a mean average precision of 0.97 in detecting actions in scenes with multiple people positioned far from the camera. It exhibited superior accuracy compared to conventional methods.
引用
收藏
页码:1573 / 1585
页数:13
相关论文
共 48 条
[31]   Improving faster R-CNN generalization for intestinal parasite detection using cycle-GAN based data augmentation [J].
Kumar, Satish ;
Arif, Tasleem ;
Ahamad, Gulfam ;
Chaudhary, Anis Ahmad ;
Ali, Mohamed A. M. ;
Islam, Asimul .
DISCOVER APPLIED SCIENCES, 2024, 6 (05)
[32]   Automatic Teeth Recognition Method from Dental Panoramic Images Using Faster R-CNN and Prior Knowledge Model [J].
Motoki, Kota ;
Mahdi, Fahad Parvez ;
Yagi, Naomi ;
Nii, Manabu ;
Kobashi, Syoji .
2020 JOINT 11TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 21ST INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS-ISIS), 2020, :355-359
[33]   Tree Trunk Recognition in Orchard Autonomous Operations under Different Light Conditions Using a Thermal Camera and Faster R-CNN [J].
Jiang, Ailian ;
Noguchi, Ryozo ;
Ahamed, Tofael .
SENSORS, 2022, 22 (05)
[34]   A deep learning-based dengue mosquito detection method using faster r-cnn and image processing techniques [J].
Siddiqua R. ;
Rahman S. ;
Uddin J. .
Annals of Emerging Technologies in Computing, 2021, 5 (03) :11-23
[35]   Faster R-CNN based parathyroid detection by using an automated annotation algorithm with a paired color/near-infrared imager [J].
Kifle, Naomi ;
Bhrugubanda, Saardhak ;
Kim, Yoseph ;
Ali, Khalid Mohamed ;
Wolfe, Samantha ;
Russell, Jonathon O. ;
Cha, Jaepyeong .
QUANTUM TECHNOLOGY: DRIVING COMMERCIALISATION OF AN ENABLING SCIENCE III, 2022, 12335
[36]   Faster R-CNN based parathyroid detection by using an automated annotation algorithm with a paired color/near-infrared imager [J].
Kifle, Naomi ;
Bhrugubanda, Saardhak ;
Kim, Yoseph ;
Ali, Khalid Mohamed ;
Wolfe, Samantha ;
Russell, Jonathon O. ;
Cha, Jaepyeong .
ADVANCED BIOMEDICAL AND CLINICAL DIAGNOSTIC AND SURGICAL GUIDANCE SYSTEMS XXI, 2023, 12368
[37]   Designing efficient brain tumor classifier using hybrid EfficientNet-faster R-CNN deep learning model [J].
Kharb, Aashutosh ;
Chaudhary, Prachi .
ENGINEERING RESEARCH EXPRESS, 2024, 6 (03)
[38]   Automatic defect detection in infrared thermal images of ancient polyptychs based on numerical simulation and a new efficient channel attention mechanism aided Faster R-CNN model [J].
Wang, Xin ;
Jiang, Guimin ;
Hu, Jue ;
Sfarra, Stefano ;
Mostacci, Miranda ;
Kouis, Dimitrios ;
Yang, Dazhi ;
Fernandes, Henrique ;
Avdelidis, Nicolas P. ;
Maldague, Xavier ;
Gai, Yonggang ;
Zhang, Hai .
HERITAGE SCIENCE, 2024, 12 (01)
[39]   Advancing fire detection: two-stage deep learning with hybrid feature extraction using faster R-CNN approach [J].
Cheknane, Maroua ;
Bendouma, Tahar ;
Boudouh, Saida Sarra .
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (6-7) :5503-5510
[40]   Pedestrian Detection in Fish-eye Images using Deep Learning: Combine Faster R-CNN with an effective Cutting Method [J].
Lin, Hongli ;
Kong, Zhenzhen ;
Wang, Weisheng ;
Liang, Kang ;
Chen, Jun .
2018 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MACHINE LEARNING (SPML 2018), 2018, :55-59