Silent Speech Interface Using Ultrasonic Doppler Sonar

被引:4
作者
Lee, Ki-Seung [1 ]
机构
[1] Konkuk Univ, Dept Elect Engn, Seoul 143701, South Korea
来源
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2020年 / E103D卷 / 08期
关键词
silent speech interface; ultrasonic Doppler; deep neural networks; RECOGNITION; SENSOR;
D O I
10.1587/transinf.2019EDP7211
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Some non-acoustic modalities have the ability to reveal certain speech attributes that can be used for synthesizing speech signals without acoustic signals. This study validated the use of ultrasonic Doppler frequency shifts caused by facial movements to implement a silent speech interface system. A 40kHz ultrasonic beam is incident to a speaker's mouth region. The features derived from the demodulated received signals were used to estimate the speech parameters. A nonlinear regression approach was employed in this estimation where the relationship between ultrasonic features and corresponding speech is represented by deep neural networks (DNN). In this study, we investigated the discrepancies between the ultrasonic signals of audible and silent speech to validate the possibility for totally silent communication. Since reference speech signals are not available in silently mouthed ultrasonic signals, a nearest-neighbor search and alignment method was proposed, wherein alignment was achieved by determining the optimal pair of ultrasonic and audible features in the sense of a minimum mean square error criterion. The experimental results showed that the performance of the ultrasonic Doppler-based method was superior to that of EMG-based speech estimation, and was comparable to an image-based method.
引用
收藏
页码:1875 / 1887
页数:13
相关论文
共 40 条
  • [21] Ultrasonic doppler sensor for voic activity detection
    Kalgaonkar, Kaustubh
    Hu, Rongquiang
    Raj, Bhiksha
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (10) : 754 - 757
  • [22] ONE-HANDED GESTURE RECOGNITION USING ULTRASONIC DOPPLER SONAR
    Kalgaonkar, Kaustubh
    Raj, Bhiksha
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1889 - +
  • [23] Generating Intelligible Audio Speech From Visual Speech
    Le Cornu, Thomas
    Milner, Ben
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (09) : 1447 - 1457
  • [24] EMG-based speech recognition using hidden Markov models with global control variables
    Lee, Ki-Seung
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2008, 55 (03) : 930 - 940
  • [25] Prediction of Acoustic Feature Parameters Using Myoelectric Signals
    Lee, Ki-Seung
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2010, 57 (07) : 1587 - 1595
  • [26] A NEW KIND OF NON-ACOUSTIC SPEECH ACQUISITION METHOD BASED ON MILLIMETER WAVE RADAR
    Li, S.
    Tian, Y.
    Lu, G.
    Zhang, Y.
    Xue, H.
    Wang, J.
    Jing, X.
    [J]. PROGRESS IN ELECTROMAGNETICS RESEARCH-PIER, 2012, 130 : 17 - 40
  • [27] Li S., 2008, Progress In Electromagnetics Research B, V9, P199, DOI 10.2528/PIERB08063001
  • [28] A 94-GHz Millimeter-Wave Sensor for Speech Signal Acquisition
    Li, Sheng
    Tian, Ying
    Lu, Guohua
    Zhang, Yang
    Lv, Hao
    Yu, Xiao
    Xue, Huijun
    Zhang, Hua
    Wang, Jianqi
    Jing, Xijing
    [J]. SENSORS, 2013, 13 (11) : 14248 - 14260
  • [29] Microwave Human Vocal Vibration Signal Detection Based on Doppler Radar Technology
    Lin, Chien-San
    Chang, Sheng-Fuh
    Chang, Chia-Chan
    Lin, Chun-Chi
    [J]. IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2010, 58 (08) : 2299 - 2306
  • [30] ON THE PHONETIC INFORMATION IN ULTRASONIC MICROPHONE SIGNALS
    Livescu, Karen
    Zhu, Bo
    Glass, James
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4621 - +