Speaker identification using Ultra-Wideband measurement of voice

被引:2
作者
Li, Haoxuan [1 ]
Tang, Chong [2 ]
Vishwakarma, Shelly [2 ]
Ge, Yao [3 ]
Li, Wenda [1 ]
机构
[1] Univ Dundee, Dept Biomed Engn, Dundee, Scotland
[2] Univ Southampton, Dept Elect & Comp Sci, Southampton, England
[3] Univ Glasgow, James Watt Sch Engn, Glasgow City, Scotland
关键词
Biometric identification; ResNet; Speaker identification; UWB radar; Voice recognition;
D O I
10.1049/rsn2.12525
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Voice identification is being increasingly adopted in various domains, including security infrastructures, intelligent home systems, and personalised digital assistants. Notably, it harbours significant promise in transforming healthcare, especially in electronic health record detecting and speech impairment monitoring such as aphasia. Current strategies such as acoustic models based on deep learning, voice bio-metrics, and spectrogram analysis, have been identified with several drawbacks including vulnerability to altered voices, susceptibility to ambient noise, and the necessity for significant computational power. In response to these issues, the authors introduce a ground-breaking method of voice identification using Ultra-Wideband (UWB) technology. This method capitalises on the micro-Doppler shifts associated with movements of the laryngeal prominence. The distinctive nature of these bio-metric traits related to speech production provides superior resistance against common pitfalls of voice identification. The proposed model leverages the high-resolution characteristics of UWB to register tiny variations in laryngeal movements produced during speech, thus forming a distinct voice profile for each speaker. Through rigorous testing, the proposed system demonstrated significant progress in voice identification, achieving close to 90% accuracy in controlled experimental settings. This breakthrough indicates that UWB-enabled voice identification could have a profound effect on medical applications, providing potential improvements in diagnosing, monitoring, possibly treating speech disorders, and thereby shaping a future of enhanced and secured healthcare services. A ground-breaking method of voice identification using UWB technology is introduced. The proposed model leverages the high-resolution characteristics of UWB to register tiny variations in laryngeal movements produced during speech, thus forming a distinct voice profile for each speaker.image
引用
收藏
页码:266 / 276
页数:11
相关论文
共 20 条
  • [1] [Anonymous], NOVELDA XETHRU X4 RA
  • [2] [Anonymous], VOICE RECOGNITION TE
  • [3] Driving Activity Recognition Using UWB Radar and Deep Neural Networks
    Brishtel, Iuliia
    Krauss, Stephan
    Chamseddine, Mahdi
    Rambach, Jason Raphael
    Stricker, Didier
    [J]. SENSORS, 2023, 23 (02)
  • [4] Chen Mei, 2020, Healthc Manage Forum, V33, P10, DOI 10.1177/0840470419873123
  • [5] Ge Y., 2023, LARGE SCALE MULTIMOD
  • [6] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [7] A Novel Pathological Voice Identification Technique through Simulated Cochlear Implant Processing Systems
    Islam, Rumana
    Abdel-Raheem, Esam
    Tarique, Mohammed
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (05):
  • [8] Kat LW, 1999, INT CONF ACOUST SPEE, P221, DOI 10.1109/ICASSP.1999.758102
  • [9] Li WK, 2016, FIELDS I COMMUN, V78, P1, DOI 10.1007/978-1-4939-6568-7_1
  • [10] Decomposition of Multicomponent Micro-Doppler Signals Based on HHT-AMD
    Li, Wenchao
    Kuang, Gangyao
    Xiong, Boli
    [J]. APPLIED SCIENCES-BASEL, 2018, 8 (10):