Method and practice of microphone array speech source localization based on sound propagation modeling

被引:0
作者
Meng, Gang [1 ]
Yang, Chao [1 ]
Guo, Hui [1 ]
Wang, Yansong [1 ]
机构
[1] Shanghai University of Engineering Science, Shanghai
关键词
Far-field sound model; Microphone array; Near-field sound model; Speech source localization; Time delay algorithm;
D O I
10.2478/amns-2024-2681
中图分类号
学科分类号
摘要
This paper realizes the speech source localization for microphone arrays based on the sound propagation model. According to the actual environment and location of the sound source, this paper divides the sound source into far-field source and near-field source and constructs the far-field sound model and near-field sound model applicable to the microphone array. The TDOA time-delayed localization algorithm is employed to locate the voice source of the microphone array by judging the sound far and near the field. In the localization test, this paper selects microphones to form an array according to the actual needs and preprocesses the sound signal data required for practice. The preprocessing data and sound source localization practice prove that the microphone array speech source localization algorithm used in this paper can effectively estimate the actual position of the sound source, and the absolute error between its estimated sound source position and the actual sound source position is only about 0.3m. © 2024 Gang Meng et al., published by Sciendo.
引用
收藏
相关论文
共 26 条
[1]  
Pech M., Vrchota J., Bednar J., Predictive maintenance and intelligent sensors in smart factory, Sensors, 21, 4, (2021)
[2]  
Hu L., Miao Y., Wu G., Hassan M.M., Humar I., iRobot-Factory: An intelligent robot factory based on cognitive manufacturing and edge computing, Future Generation Computer Systems, 90, pp. 569-577, (2019)
[3]  
Guo K., Wan X., Liu L., Gao Z., Yang M., Fault diagnosis of intelligent production line based on digital twin and improved random forest, Applied Sciences, 11, 16, (2021)
[4]  
Castellini P., Giulietti N., Falcionelli N., Dragoni A.F., Chiariotti P., A neural network based microphone array approach to grid-less noise source localization, Applied Acoustics, 177, (2021)
[5]  
Grondin F., Michaud F., Lightweight and optimized sound source localization and tracking methods for open and closed microphone array configurations, Robotics and Autonomous Systems, 113, pp. 63-80, (2019)
[6]  
Yang J., Banerjee G., Gupta V., Lam M.S., Landay J.A., Soundr: Head position and orientation prediction using a microphone array, Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems, pp. 1-12, (2020)
[7]  
Dey N., Ashour A.S., Dey N., Ashour A.S., Microphone array principles, Direction of arrival estimation and localization of multi-speech sources, pp. 5-22, (2018)
[8]  
Scheibler R., Azcarreta J., Beuchat R., Ferry C., Pyramic: Full stack open microphone array architecture and dataset, 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC), pp. 226-230, (2018)
[9]  
Su D., Vidal-Calleja T., Miro J.V., Towards real-time 3D sound sources maping with linear microphone arrays, 2017 IEEE International Conference on Robotics and Automation (ICRA), pp. 1662-1668, (2017)
[10]  
Padois T., Sgard F., Doutres O., Berry A., Acoustic source localization using a polyhedral microphone array and an improved generalized cross-correlation technique, Journal of Sound and Vibration, 386, pp. 82-99, (2017)