Noise robust voice activity detection using joint phase and magnitude based feature enhancement

被引:0
作者
Khomdet Phapatanaburi
Longbiao Wang
Zeyan Oo
Weifeng Li
Seiichi Nakagawa
Masahiro Iwahashi
机构
[1] Nagaoka University of Technology,Tianjin Key Laboratory of Cognitive Computing and Application
[2] School of Computer Science and Technology,Graduate School at Shenzhen
[3] Tianjin University,undefined
[4] Tsinghua University,undefined
[5] Toyohashi University of Technology,undefined
来源
Journal of Ambient Intelligence and Humanized Computing | 2017年 / 8卷
关键词
Deep neural network (DNN); Phase information; Noise-robust VAD; Feature enhancement;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, deep neural network (DNN)-based feature enhancement has been proposed for many speech applications. DNN-enhanced features have achieved higher performance than raw features. However, phase information is discarded during most conventional DNN training. In this paper, we propose a DNN-based joint phase- and magnitude -based feature (JPMF) enhancement (JPMF with DNN) and a noise-aware training (NAT)-DNN-based JPMF enhancement (JPMF with NAT-DNN) for noise-robust voice activity detection (VAD). Moreover, to improve the performance of the proposed feature enhancement, a combination of the scores of the proposed phase- and magnitude-based features is also applied. Specifically, mel-frequency cepstral coefficients (MFCCs) and the mel-frequency delta phase (MFDP) are used as magnitude and phase features. The experimental results show that the proposed feature enhancement significantly outperforms the conventional magnitude-based feature enhancement. The proposed JPMF with NAT-DNN method achieves the best relative equal error rate (EER), compared with individual magnitude- and phase-based DNN speech enhancement. Moreover, the combined score of the enhanced MFCC and MFDP using JPMF with NAT-DNN further improves the VAD performance.
引用
收藏
页码:845 / 859
页数:14
相关论文
共 50 条
  • [41] Coupled Noise Suppression and Feature Enhancement Network for Skeleton-Based Action Recognition
    Liu, Ye
    Wu, Tianyong
    Shi, Tianhao
    Wang, Miaohui
    Gao, Hao
    Liu, Jun
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2025,
  • [42] Detection of Early Gastric Cancer Based on Single Shot Detector with Feature Enhancement
    Pan, Ongsheng
    Zhang, Rong
    Wang, Yalei
    Feng, Hui
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1091 - 1095
  • [43] Sonar Image Target Detection Based on Adaptive Global Feature Enhancement Network
    Wang, Zhen
    Zhang, Shanwen
    Huang, Wenzhun
    Guo, Jianxin
    Zeng, Leya
    IEEE SENSORS JOURNAL, 2022, 22 (02) : 1509 - 1530
  • [44] YOLO-Ships: Lightweight ship object detection based on feature enhancement
    Zhang, Yu
    Chen, Wenhui
    Li, Songlin
    Liu, Hailong
    Hu, Qing
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 101
  • [45] VFEDet: A Variational Information Bottleneck Based Feature Enhancement Object Detection Network
    Wu, Mingyu
    Zhu, Ming
    Tang, Ruixue
    TWELFTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2020), 2021, 11720
  • [46] Research on Negative Road Obstacle Detection Based on Multimodal Feature Enhancement and Fusion
    Huo, Guanglei
    Cao, Chuqing
    Li, Yaxin
    Lin, Wenwei
    Zhang, Chentao
    APPLIED SCIENCES-BASEL, 2025, 15 (03):
  • [47] Pavement crack detection method based on multi-scale feature enhancement
    Zhai J.-Z.
    Sun Z.-Y.
    Pei L.-L.
    Huyan J.
    Li W.
    Jiaotong Yunshu Gongcheng Xuebao/Journal of Traffic and Transportation Engineering, 2023, 23 (01): : 291 - 308
  • [48] A ship small target detection algorithm based on feature enhancement in SAR image
    Yan C.-M.
    Wang C.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (01): : 239 - 247
  • [49] Imbalance multiclass problem: a robust feature enhancement-based framework for liver lesion classification
    Hu, Rui
    Song, Yuqing
    Liu, Yi
    Zhu, Yan
    Feng, Nuo
    Qiu, Chengjian
    Han, Kai
    Teng, Qiaoying
    Haq, Imran Ul
    Liu, Zhe
    MULTIMEDIA SYSTEMS, 2024, 30 (02)
  • [50] Independent vector analysis followed by HMM-based feature enhancement for robust speech recognition
    Cho, Ji-Won
    Park, Hyung-Min
    SIGNAL PROCESSING, 2016, 120 : 200 - 208