Robust Voice Activity Detection Using Selectively Energy Features

被引:0
|
作者
Wakasugi, Junichiro [1 ]
Hayasaka, Noboru [2 ]
Iiguni, Youji [1 ]
机构
[1] Osaka Univ, Grad Sch Engn Sci, Osaka, Japan
[2] Osaka Electrocommun Univ, Dept Informat Engn, Osaka, Japan
来源
2014 21ST IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (ICECS) | 2014年
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a robust voice activity detection algorithm that can switch the calculation method automatically depending on the noise in order to adapt various noise. We use entropy as an indicator for judging whether the noise is narrow-band or wide-band. Under narrow-band noise condition, spectral product is the suitable calculation method, on the other hand, under wide-band noise condition, using spectral summation is the suitable one. The proposed method decides the type of noise by entropy, then uses the suitable calculation method depending on the noise. We evaluated the proposed method compared with other conventional methods by ROC curves and the number of correct-segments. As the result of the experiments, the proposed method can detect the speech-segments more correctly than the other methods and shows the better performance in frame-level. The experimental result shows the proposed method can switch the calculation method appropriately depending on the noise.
引用
收藏
页码:359 / 362
页数:4
相关论文
共 50 条
  • [21] Using Variational Bayes free energy for unsupervised voice activity detection
    Cournapeau, David
    Kawahara, Tatsuya
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4429 - 4432
  • [22] Robust voice activity detection directed by noise classification
    Saeedi, Jamal
    Ahadi, Seyed Mohammad
    Faez, Karim
    SIGNAL IMAGE AND VIDEO PROCESSING, 2015, 9 (03) : 561 - 572
  • [23] Adaptive regularization framework for robust voice activity detection
    Lu, Xugang
    Unoki, Masashi
    Isotani, Ryosuke
    Kawai, Hisashi
    Nakamura, Satoshi
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2664 - 2667
  • [24] Robust Voice Activity Detection Algorithm for Noisy Speech
    Verteletskaya, Ekaterina
    Simak, Boris
    RTT 2009: 11TH INTERNATIONAL CONFERENCE RTT 2009 RESEARCH IN TELECOMMUNICATION TECHNOLOGY, CONFERENCE PROCEEDINGS, 2009, : 98 - 101
  • [25] A robust voice activity detection based on wavelet transform
    Aghajani, Kh.
    Manzuri, M. T.
    Karami, M.
    Tayebi, H.
    2008 SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, 2008, : 37 - +
  • [26] Robust voice activity detection based on noise eigenspace
    Ying, Dongwen
    Shi, Yu
    Lu, Xugang
    Dang, Jianwu
    Soong, Frank
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2007, 28 (06) : 413 - 423
  • [27] Formant-Based Robust Voice Activity Detection
    Yoo, In-Chul
    Lim, Hyeontaek
    Yook, Dongsuk
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (12) : 2238 - 2245
  • [28] Robust voice activity detection directed by noise classification
    Jamal Saeedi
    Seyed Mohammad Ahadi
    Karim Faez
    Signal, Image and Video Processing, 2015, 9 : 561 - 572
  • [29] Robust voice activity detection in stereo recording with crosstalk
    Ghosh, Prasanta Kumar
    Tsiartas, Andreas
    Georgiou, Panayiotis
    Narayanan, Shrikanth S.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 3098 - 3101
  • [30] A Robust Voice Activity Detection Algorithm in Nonstationary Noise
    Lei, Jianjun
    Yang, Jiachen
    Wang, Jian
    Yang, Zhen
    2009 INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, PROCEEDINGS, 2009, : 195 - +