Ultrasonic doppler sensor for voic activity detection

被引:27
作者
Kalgaonkar, Kaustubh [1 ]
Hu, Rongquiang
Raj, Bhiksha
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
[2] Ditech Networks Inc, Mountain View, CA 94043 USA
[3] Mitsubishi Elect Res Lab, Cambridge, MA 02139 USA
关键词
doppler; speech denoising; voice activity detection;
D O I
10.1109/LSP.2007.896450
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This letter describes a robust voice activity detector using an ultrasonic Doppler sonar device. An ultrasonic beam is incident on the talker's face. Facial movements result in Doppler frequency shifts in the reflected signal that are sensed by an ultrasonic sensor. Speech-related facial movements result in identifiable patterns in the spectrum of the received signal that can be used to identify speech activity. These sensors are not affected by even high levels of ambient audio noise. Unlike most other non-acoustic sensors, the device need not be taped to a talker. A simple yet robust method of extracting the voice activity information from the ultrasonic Doppler signal is developed and presented in this letter. The algorithm is seen to be very effective and robust to noise, and it can be implemented in real time.
引用
收藏
页码:754 / 757
页数:4
相关论文
共 11 条
  • [1] [Anonymous], P 14 ANN ACM S US IN
  • [2] Burnett G. C., 1999, P 138 M AC SOC AM
  • [3] A soft voice activity detector based on a Laplacian-Gaussian model
    Gazor, S
    Zhang, W
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 498 - 505
  • [4] Haykin S., 2000, COMMUNICATION SYSTEM, V4th
  • [5] HERSHEY J, 2004, P ISCA
  • [6] HU R, 2005, P IEEE ASRU, P319
  • [7] Oppenheim AV., 1990, DISCRETE TIME SIGNAL
  • [8] ALGORITHM FOR DETERMINING ENDPOINTS OF ISOLATED UTTERANCES
    RABINER, LR
    SAMBUR, MR
    [J]. BELL SYSTEM TECHNICAL JOURNAL, 1975, 54 (02): : 297 - 315
  • [9] A MULTICHANNEL ELECTROGLOTTOGRAPH
    ROTHENBERG, M
    [J]. JOURNAL OF VOICE, 1992, 6 (01) : 36 - 43
  • [10] Scanlon M., 1998, Proceedings of IRIS Acoustic and Seismic Sensing, V2, P205