Sound source localization for auditory perception of a humanoid robot using deep neural networks

被引:0
作者
G. Boztas
机构
[1] Firat University,Department of Electrical and Electronics Engineering, Faculty of Technology
来源
Neural Computing and Applications | 2023年 / 35卷
关键词
Deep learning; Time-series estimation; Humanoid robot; Sound source location; Auditory perception;
D O I
暂无
中图分类号
学科分类号
摘要
This paper presents an estimation of the sound source location using deep neural networks in order to provide auditory perception of a humanoid robot. Estimation of a moving sound source is crucial for a humanoid robot to improve functionality in some environments where the robot’s camera cannot operate. It plays an important role, especially in a recovery scenario with no visual contact. In this study, the data of the sound source around the robot were recorded by four microphones placed on the humanoid robot’s head. A wheeled robot was used to obtain the sound source with odometry. Recorded sound dataset and collected odometry dataset were used as input data and target data, respectively. The discrete wavelet transform (DWT) was applied for pre-processing of the input data. After pre-processing, the obtained matrices were applied as inputs of the proposed convolutional neural network (CNN), long short-term memory (LSTM), bidirectional long-short-term memory (biLSTM), and multilayer perceptron (MLP) networks to estimate the sound source location around the humanoid robot. As a result of all tests for the estimation models created by proposed networks, the R2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R^2$$\end{document} metrics of the biLSTM structure were obtained as approximately 0.97. This study showed experimentally that humanoid robots can sense the position of sound source in the environment with sufficient accuracy like many living creatures.
引用
收藏
页码:6801 / 6811
页数:10
相关论文
共 96 条
[1]  
Saeedvand S(2019)A comprehensive survey on humanoid robot development The Knowledge Engineering Review 34 1-18
[2]  
Jafari M(2021)A flexible multi-functional smart skin for force, touch position, proximity, and humidity sensing for humanoid robots IEEE Sens J 21 26355-26363
[3]  
Aghdasi HS(2021)A literature review of sensor heads for humanoid robots Robot Auton Syst 143 21-32
[4]  
Baltes J(2021)Emotion space modelling for social robots Eng Appl Art Intell 100 93-101
[5]  
Dai Yanning(2020)Performing predefined tasks using the human-robot interaction on speech recognition for an industrial robot Eng Appl Artif Intell 95 1-21
[6]  
Gao Shuo(2015)Classification of reverberant audio signals using clustered ad hoc distributed microphones Signal Process 107 3510-3524
[7]  
Rojas-Quintero JA(2005)Revisiting trilateration for robot localization IEEE Transact Robot 21 669-686
[8]  
Rodríguez-Liñán MC(2021)Cluster analysis and model comparison using smart meter data Sensors 21 103255-103262
[9]  
Yan Fei(2021)Stability analysis of the modified levenberg-marquardt algorithm for the artificial neural network training IEEE Transact Neural Netw Learn Syst 32 285-308
[10]  
Iliyasu Abdullah M(2021)Adapting h-infinity controller for the desired reference tracking of the sphere position in the maglev process Inform Sci 569 37-48