Mel-spectrogram and Deep CNN Based Representation Learning from Bio-Sonar Implementation on UAVs

被引:11
作者
Tanveer, M. Hassan [1 ]
Zhu, Hongxiao [2 ]
Ahmed, Waqar [3 ]
Thomas, Antony [4 ]
Imran, Basit Muhammad [5 ]
Salman, Muhammad [6 ]
机构
[1] Kennesaw State Univ, Robot & Mechatron Engn, Marietta, GA 30060 USA
[2] Virginia Tech, Dept Stat, Blacksburg, VA USA
[3] DITEN Univ Genova, PAVIS Ist Italiano Tecnol, Genoa, Italy
[4] Univ Genoa, DIBRIS, Genoa, Italy
[5] Virginia Tech, Mech Engn, Blacksburg, VA USA
[6] Kennesaw State Univ, Dept Mech Engn, Marietta, GA USA
来源
2021 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND ROBOTICS (ICCCR 2021) | 2021年
关键词
Unmanned Aerial Vehicle; Deep Convolutional Neural Networks; Mel-Spectogram; Unknown Environment; CONVOLUTIONAL NEURAL-NETWORKS;
D O I
10.1109/ICCCR49711.2021.9349416
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present an approach for estimating the leaf density of trees while navigating in a forest. To this end, we consider an Unmanned Aerial Vehicle (UAV) equipped with a biosonar sensor that mimics the sonar sensors of echolocating bats. Such sensors provide a light-weight and cost-effective alternative to other widely used sensors such as camera, LiDAR and are gaining popularity among the robotics research community. The obtained echo signals during UAV navigation are processed to obtain the leaf density in the main lobe of the sonar first using a mel spectogram and then a Deep Convolutional Neural Network (CNN) trained on a set of known environment. We further evaluate our approach in simulation by considering trees with different leaf density (that is, resolution). It is seen that our method achieves promising results with an accuracy of 98.7%.
引用
收藏
页码:220 / 224
页数:5
相关论文
共 32 条
[1]   Convolutional Neural Networks for Speech Recognition [J].
Abdel-Hamid, Ossama ;
Mohamed, Abdel-Rahman ;
Jiang, Hui ;
Deng, Li ;
Penn, Gerald ;
Yu, Dong .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (10) :1533-1545
[2]  
[Anonymous], 2020, INDIAN J SCI TECHNOL, DOI DOI 10.17485/IJST/v13i34.813
[3]  
[Anonymous], 2013, ISMIR
[4]  
Cakir E, 2015, IEEE IJCNN
[5]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[6]  
Cirean Dan, 2012, P NEURAL INFORM PROC, V25, P2
[7]   Multi-column deep neural network for traffic sign classification [J].
Ciresan, Dan ;
Meier, Ueli ;
Masci, Jonathan ;
Schmidhuber, Juergen .
NEURAL NETWORKS, 2012, 32 :333-338
[8]  
Cotton CV, 2011, 2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), P69, DOI 10.1109/ASPAA.2011.6082331
[9]  
de Magistris G., 2019, OCEANS-IEEE, P1, DOI DOI 10.23919/oceans40490.2019.8962860
[10]  
Deng L, 2013, INT CONF ACOUST SPEE, P6669, DOI 10.1109/ICASSP.2013.6638952