ON USING SPECTRAL GRADIENT IN CONDITIONAL MAP CRITERION FOR ROBUST VOICE ACTIVITY DETECTION

被引:0
作者
Choi, Jae-Hun [1 ]
Chang, Joon-Hyuk [1 ]
机构
[1] Hanyang Univ, Sch Elect Engn, Seoul 133791, South Korea
来源
PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC 2012) | 2012年
关键词
Voice activity detection; Spectral gradient; Conditional MAP; Likelihood ratio test;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a novel approach to improve a statistical model-based voice activity detection (VAD) method based on a modified conditional maximum a posteriori (MAP) criterion incorporating the spectral gradient scheme. The proposed conditional MAP incorporates not only the voice activity decision in the previous frame as in Ref. [1] but also the spectral gradient of the observed spectra between the current frame and the past frames to efficiently exploit the inter-frame correlation of voice activity. As a result, the proposed VAD leads to six separate thresholds to be adaptively determined in the likelihood ratio test (LRT) depending on both the previous VAD result and the estimated spectral gradient parameter. Experimental results demonstrate that the proposed approach yields better results compared to those of the previous conditional MAP-based method.
引用
收藏
页码:370 / 374
页数:5
相关论文
共 50 条
  • [21] Robust voice activity detection based on noise eigenspace
    Ying, Dongwen
    Shi, Yu
    Lu, Xugang
    Dang, Jianwu
    Soong, Frank
    [J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2007, 28 (06) : 413 - 423
  • [22] Robust voice activity detection directed by noise classification
    Jamal Saeedi
    Seyed Mohammad Ahadi
    Karim Faez
    [J]. Signal, Image and Video Processing, 2015, 9 : 561 - 572
  • [23] Robust voice activity detection in stereo recording with crosstalk
    Ghosh, Prasanta Kumar
    Tsiartas, Andreas
    Georgiou, Panayiotis
    Narayanan, Shrikanth S.
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 3098 - 3101
  • [24] A Robust Voice Activity Detection Algorithm in Nonstationary Noise
    Lei, Jianjun
    Yang, Jiachen
    Wang, Jian
    Yang, Zhen
    [J]. 2009 INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, PROCEEDINGS, 2009, : 195 - +
  • [25] Robust speaker verification in air traffic control using improved voice activity detection
    Neffe, Michael
    Van Pham, Tuan
    Pernkopf, Franz
    Kubin, Gernot
    [J]. PROCEEDINGS OF THE FOURTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PATTERN RECOGNITION, AND APPLICATIONS, 2007, : 298 - +
  • [26] ROBUST VOICE ACTIVITY DETECTION USING EMPIRICAL MODE DECOMPOSITION AND MODULATION SPECTRUM ANALYSIS
    Kanai, Yasuaki
    Unoki, Masashi
    [J]. 2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 400 - 404
  • [27] A robust voice activity detection based on noise eigenspace projection
    Ying, Dongwen
    Shi, Yu
    Soong, Frank
    Dang, Jianwu
    Lu, Xugang
    [J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 76 - +
  • [28] An RNN and CRNN Based Approach to Robust Voice Activity Detection
    Wang, Guan-Bo
    Zhang, Wei-Qiang
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1347 - 1350
  • [29] On training targets for noise-robust voice activity detection
    Braun, Sebastian
    Tashev, Ivan
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 421 - 425
  • [30] Voice activity detection using Laplacian model and UMP test
    Jang, Keun Won
    Kim, Dong Kook
    Chang, Joon-Hyuk
    [J]. PROCEEDING OF THE 11TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTERS: COMPUTER SCIENCE AND TECHNOLOGY, VOL 4, 2007, : 480 - +