Improving the Robustness of Pedestrian Detection in Autonomous Driving With Generative Data Augmentation

被引:6
作者
Wu, Yalun [1 ]
Xiang, Yingxiao [2 ]
Tong, Endong [1 ]
Ye, Yuqi [1 ]
Cui, Zhibo [1 ]
Tian, Yunzhe [1 ]
Zhang, Lejun [3 ]
Liu, Jiqiang [1 ]
Han, Zhen [1 ]
Niu, Wenjia [1 ]
机构
[1] Beijing Jiaotong Univ, Beijing Key Lab Secur & Privacy Intelligent Transp, Beijing 100044, Peoples R China
[2] Inst Informat Engn, Chinese Acad Sci, Beijing 100085, Peoples R China
[3] Guangzhou Univ, Cyberspace Inst Adv Technol, Guangzhou 510006, Peoples R China
来源
IEEE NETWORK | 2024年 / 38卷 / 03期
关键词
Pedestrians; Data augmentation; Data models; Autonomous vehicles; Feature extraction; Semantics; Image capture; pedestrian detection; diffusion model; generative data augmentation; image caption;
D O I
10.1109/MNET.2024.3366232
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Pedestrian detection plays a crucial role in autonomous driving by identifying the position, size, orientation, and dynamic features of pedestrians in images or videos, assisting autonomous vehicles in making better decisions and controls. It's worth noting that the performance of pedestrian detection models largely depends on the quality and diversity of available training data. Current datasets for autonomous driving have limitations in terms of diversity, scale, and quality. In recent years, numerous studies have proposed the use of data augmentation strategies to expand the coverage of datasets, aiming to maximize the utilization of existing training data. However, these data augmentation methods often overlook the diversity of data scenarios. To overcome this challenge, in this paper, we propose a more comprehensive method for data augmentation, based on image descriptions and diffusion models. This method aims to cover a wider range of scene variations, including different weather conditions and lighting situations. We have designed a classifier to select data samples for augmentation, followed by extracting visual features based on image captions and converting them into high-level semantic information as textual descriptions for the corresponding samples. Finally, we utilize diffusion models to generate new variants. Additionally, we have designed three modification patterns to increase diversity in aspects such as weather conditions, lighting, and pedestrian poses within the data. We conducted extensive experiments on the KITTI dataset and in real-world environments, demonstrating that our proposed method significantly enhances the performance of pedestrian detection models in complex scenarios. This meticulous consideration of data augmentation will notably enhance the applicability and robustness of pedestrian detection models in actual autonomous driving scenarios.
引用
收藏
页码:63 / 69
页数:7
相关论文
共 50 条
[41]   Improving the IoT Attack Classification Mechanism with Data Augmentation for Generative Adversarial Networks [J].
Chu, Hung-Chi ;
Lin, Yu-Jhe .
APPLIED SCIENCES-BASEL, 2023, 13 (23)
[42]   Using a Diffusion Model for Pedestrian Trajectory Prediction in Semi-Open Autonomous Driving Environments [J].
Tang, Yingjuan ;
He, Hongwen ;
Wang, Yong ;
Wu, Yifan .
IEEE SENSORS JOURNAL, 2024, 24 (10) :17208-17218
[43]   Data Augmentation for Improving Explainability of Hate Speech Detection [J].
Gunjan Ansari ;
Parmeet Kaur ;
Chandni Saxena .
Arabian Journal for Science and Engineering, 2024, 49 :3609-3621
[44]   Improving Intrusion Detection Through Training Data Augmentation [J].
Otokwala, Uneneibotejit ;
Petrovski, Andrei ;
Kalutarage, Harsha .
2021 14TH INTERNATIONAL CONFERENCE ON SECURITY OF INFORMATION AND NETWORKS (SIN 2021), 2021,
[45]   Data Augmentation for Improving Explainability of Hate Speech Detection [J].
Ansari, Gunjan ;
Kaur, Parmeet ;
Saxena, Chandni .
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (03) :3609-3621
[46]   An Explainable Deep Learning-Based Method for Schizophrenia Diagnosis Using Generative Data-Augmentation [J].
Saadatinia, Mehrshad ;
Salimi-Badr, Armin .
IEEE ACCESS, 2024, 12 :98379-98392
[47]   Monocular three-dimensional object detection using data augmentation and self-supervised learning in autonomous driving [J].
Thayalan, Sugirtha ;
Muthukumarasamy, Sridevi ;
Santhakumar, Khailash ;
Ravi, Kiran Bangalore ;
Liu, Hao ;
Gauthier, Thomas ;
Yogamani, Senthil .
JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (01)
[48]   Fallen person detection for autonomous driving [J].
Lee, Suhyeon ;
Lee, Sangyong ;
Seong, Hongje ;
Hyun, Junhyuk ;
Kim, Euntai .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
[49]   Using Generative Adversarial Networks for Data Augmentation in Android Malware Detection [J].
Chen, Yi-Ming ;
Yang, Chun-Hsien ;
Chen, Guo-Chung .
2021 IEEE CONFERENCE ON DEPENDABLE AND SECURE COMPUTING (DSC), 2021,
[50]   A Novel Data Augmentation Method for Improved Visual Crack Detection Using Generative Adversarial Networks [J].
Branikas, Efstathios ;
Murray, Paul ;
West, Graeme .
IEEE ACCESS, 2023, 11 :22051-22059