Speaker identification using Ultra-Wideband measurement of voice

被引：2

作者：

Li, Haoxuan ^{[1
]}

Tang, Chong ^{[2
]}

Vishwakarma, Shelly ^{[2
]}

Ge, Yao ^{[3
]}

Li, Wenda ^{[1
]}

机构：

[1] Univ Dundee, Dept Biomed Engn, Dundee, Scotland

[2] Univ Southampton, Dept Elect & Comp Sci, Southampton, England

[3] Univ Glasgow, James Watt Sch Engn, Glasgow City, Scotland

来源：

IET RADAR SONAR AND NAVIGATION | 2024年 / 18卷 / 02期

关键词：

Biometric identification; ResNet; Speaker identification; UWB radar; Voice recognition;

D O I：

10.1049/rsn2.12525

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Voice identification is being increasingly adopted in various domains, including security infrastructures, intelligent home systems, and personalised digital assistants. Notably, it harbours significant promise in transforming healthcare, especially in electronic health record detecting and speech impairment monitoring such as aphasia. Current strategies such as acoustic models based on deep learning, voice bio-metrics, and spectrogram analysis, have been identified with several drawbacks including vulnerability to altered voices, susceptibility to ambient noise, and the necessity for significant computational power. In response to these issues, the authors introduce a ground-breaking method of voice identification using Ultra-Wideband (UWB) technology. This method capitalises on the micro-Doppler shifts associated with movements of the laryngeal prominence. The distinctive nature of these bio-metric traits related to speech production provides superior resistance against common pitfalls of voice identification. The proposed model leverages the high-resolution characteristics of UWB to register tiny variations in laryngeal movements produced during speech, thus forming a distinct voice profile for each speaker. Through rigorous testing, the proposed system demonstrated significant progress in voice identification, achieving close to 90% accuracy in controlled experimental settings. This breakthrough indicates that UWB-enabled voice identification could have a profound effect on medical applications, providing potential improvements in diagnosing, monitoring, possibly treating speech disorders, and thereby shaping a future of enhanced and secured healthcare services. A ground-breaking method of voice identification using UWB technology is introduced. The proposed model leverages the high-resolution characteristics of UWB to register tiny variations in laryngeal movements produced during speech, thus forming a distinct voice profile for each speaker.image

引用

页码：266 / 276

页数：11

共 20 条

[1] [Anonymous], NOVELDA XETHRU X4 RA
[2] [Anonymous], VOICE RECOGNITION TE
[3] Driving Activity Recognition Using UWB Radar and Deep Neural Networks
Brishtel, Iuliia
Krauss, Stephan
Chamseddine, Mahdi
Rambach, Jason Raphael
Stricker, Didier
[J]. SENSORS, 2023, 23 (02)
[4] Chen Mei, 2020, Healthc Manage Forum, V33, P10, DOI 10.1177/0840470419873123
[5] Ge Y., 2023, LARGE SCALE MULTIMOD
[6] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[7] A Novel Pathological Voice Identification Technique through Simulated Cochlear Implant Processing Systems
Islam, Rumana
Abdel-Raheem, Esam
Tarique, Mohammed
[J]. APPLIED SCIENCES-BASEL, 2022, 12 (05):
[8] Kat LW, 1999, INT CONF ACOUST SPEE, P221, DOI 10.1109/ICASSP.1999.758102
[9] Li WK, 2016, FIELDS I COMMUN, V78, P1, DOI 10.1007/978-1-4939-6568-7_1
[10] Decomposition of Multicomponent Micro-Doppler Signals Based on HHT-AMD
Li, Wenchao
Kuang, Gangyao
Xiong, Boli
[J]. APPLIED SCIENCES-BASEL, 2018, 8 (10):

← 1 2 →