A Large-Scale UAV Audio Dataset and Audio-Based UAV Classification Using CNN

被引:8
作者
Wang, Yaqin [1 ]
Chu, Zhiwei [1 ]
Ku, Ilmun [2 ]
Smith, E. Cho [1 ]
Matson, Eric T. [1 ]
机构
[1] Purdue Univ, W Lafayette, IN 47907 USA
[2] Hankuk Univ Foreign Studies, Seoul, South Korea
来源
2022 SIXTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC | 2022年
关键词
Drone Audio Dataset; UAV Classification; Machine Learning; Convolutional Neural Network; PARAMETRIC REPRESENTATIONS;
D O I
10.1109/IRC55401.2022.00039
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The increased popularity and accessibility of UAVs may create potential threats. Researchers have been developing UAV detection and classification systems with different methods, including audio-based approach. However, the number of publicly available UAV audio datasets is limited. To fill this gap, we selected 10 different UAVs, ranging from toy hand drones to Class I drones, and recorded a total of 5215 seconds length of audio data generated from the flying UAVs. To the best of our knowledge, the proposed dataset is the largest audio dataset for UAVs so far. We further implemented a convolutional neural network (CNN) model for 10-class UAV classification and trained the model with the collected data. The overall test accuracy of the trained model is 97.7% and the test loss is 0.085.
引用
收藏
页码:186 / 189
页数:4
相关论文
共 50 条
[31]   Simulated Dataset for the Loaded vs. Unloaded UAV Classification Problem Using Deep Learning [J].
Azad, Hamid ;
Mehta, Varun ;
Bolic, Miodrag ;
Mantegh, Iraj .
2023 IEEE SENSORS APPLICATIONS SYMPOSIUM, SAS, 2023,
[32]   Large Scale Image Classification Based on CNN and Parallel SVM [J].
Sun, Zhanquan ;
Li, Feng ;
Huang, Huifen .
NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 :545-555
[33]   Large-Scale Reality Modeling of a University Campus Using Combined UAV and Terrestrial Photogrammetry for Historical Preservation and Practical Use [J].
Berrett, Bryce E. ;
Vernon, Cory A. ;
Beckstrand, Haley ;
Pollei, Madi ;
Markert, Kaleb ;
Franke, Kevin W. ;
Hedengren, John D. .
DRONES, 2021, 5 (04)
[34]   Large-Scale Product Classification via Spatial Attention Based CNN Learning and Multi-class Regression [J].
Ai, Shanshan ;
Jia, Caiyan ;
Chen, Zhineng .
MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 :176-188
[35]   A Large-Scale Photonic CNN Based on Spike Coding and Temporal Integration [J].
Zhang, Junfeng ;
Ma, Bowen ;
Zhao, Yang ;
Zou, Weiwen .
IEEE JOURNAL OF SELECTED TOPICS IN QUANTUM ELECTRONICS, 2023, 29 (06)
[36]   Real Time Audio-Based Distress Signal Detection as Vital Signs of Myocardial Infarction Using Convolutional Neural Networks [J].
Mohan, H. M. ;
Anitha, S. .
JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2022, 13 (02) :106-116
[37]   ENHANCED METHOD OF AUDIO CODING USING CNN-BASED SPECTRAL RECOVERY WITH ADAPTIVE STRUCTURE [J].
Shin, Seong-Hyeon ;
Beack, Seung Kwon ;
Lim, Wootaek ;
Park, Hochong .
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, :351-355
[38]   Assessment of Dataset Scalability for Classification of Black Sigatoka in Banana Crops Using UAV-Based Multispectral Images and Deep Learning Techniques [J].
Linero-Ramos, Rafael ;
Parra-Rodriguez, Carlos ;
Espinosa-Valdez, Alexander ;
Gomez-Rojas, Jorge ;
Gongora, Mario .
DRONES, 2024, 8 (09)
[39]   A novel approach for vegetation classification using UAV-based hyperspectral imaging [J].
Ishida, Tetsuro ;
Kurihara, Junichi ;
Angelico Viray, Fra ;
Baes Namuco, Shielo ;
Paringit, Enrico C. ;
Jane Perez, Gay ;
Takahashi, Yukihiro ;
Joseph Marciano, Joel, Jr. .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2018, 144 :80-85
[40]   Precision Assessment of COVID-19 Phenotypes Using Large-Scale Clinic Visit Audio Recordings: Harnessing the Power of Patient Voice [J].
Barr, Paul J. ;
Ryan, James ;
Jacobson, Nicholas C. .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (02)