Noise-robust Pitch Detection Based on Super-Resolution Harmonics

被引:0
|
作者
Zhu, Dongjie [1 ]
Zhu, Weibin [1 ]
Wang, Tianrui [1 ]
Gao, Yingying [2 ]
Feng, Junlan [2 ]
Zhang, Shilei [2 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China
[2] China Mobile Res Inst, Beijing, Peoples R China
关键词
SPEECH; ESTIMATOR;
D O I
10.1109/APSIPAASC58517.2023.10317312
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to improve the performance of pitch detection algorithms in a noisy scenario, we propose a noise-robust pitch detection model based on super-resolution harmonics. The model consists of denoise, integration, and super-resolution modules. The Denoisy module highlights the harmonic structure of speech in the frequency domain to reduce noise interference. The enhanced spectrum is multiplied by an integral matrix to obtain a pitch-harmonic integration spectrum. The super-resolution module adjusts the low-resolution pitch-harmonic integration spectrum and outputs fine results. In addition, dynamic programming is employed for pitch tracking to eliminate those singular spots in the preliminary results. The experimental results show that the proposed model significantly outperforms the referenced approaches, including SWIPE, CREPE, HarmoF0, and HGCN+. The results also show that the model can work well even under low SNR.
引用
收藏
页码:422 / 426
页数:5
相关论文
共 50 条
  • [21] Study of Super-resolution Direction Detection with spatially nonstationary noise
    Shi, Xiumin
    Chen, Liang
    2006 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, VOLS 1 AND 2, PROCEEDINGS, 2006, : 1492 - +
  • [22] A Single Image Super-Resolution Algorithm Using Non-Local-Mean Self-Similarity and Noise-Robust Saliency Map
    Lee, Hui Jung
    Choi, Dong-Yoon
    Lim, Kyoung Won
    Song, Byung Cheol
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (07) : 1463 - 1474
  • [23] Super-resolution microscopy based on parallel detection
    Zhang, Zhimin
    Liu, Shaocong
    Xu, Liang
    Han, Yubing
    Kuang, Cuifang
    Liu, Yong
    Hao, Xiang
    Yang, Hongqin
    Liu, Xu
    JOURNAL OF INNOVATIVE OPTICAL HEALTH SCIENCES, 2019, 12 (06)
  • [24] Attentive ExFeat based deep generative adversarial network for noise robust face super-resolution
    Tomar, Anurag Singh
    Arya, K. V.
    Rajput, Shyam Singh
    PATTERN RECOGNITION LETTERS, 2023, 169 : 58 - 66
  • [25] Robust Dual Images Super-resolution
    Zhang, Xiaohong
    Zhang, Yun
    Qian, Guiping
    Qin, Aihong
    2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
  • [26] Towards Robust Speech Super-Resolution
    Wang, Heming
    Wang, DeLiang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2058 - 2066
  • [27] Noise robust face super-resolution via learning of spatial attentive features
    Anurag Singh Tomar
    K. V. Arya
    Shyam Singh Rajput
    Multimedia Tools and Applications, 2023, 82 : 25449 - 25465
  • [28] Joint Robust Voicing Detection and Pitch Estimation Based on Residual Harmonics
    Drugman, Thomas
    Alwan, Abeer
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1984 - +
  • [29] Noise robust face super-resolution via learning of spatial attentive features
    Tomar, Anurag Singh
    Arya, K. V.
    Rajput, Shyam Singh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (16) : 25449 - 25465
  • [30] Noise robust single image super-resolution using a multiscale image pyramid
    Hu, Jing
    Wu, Xi
    Zhou, Jiliu
    SIGNAL PROCESSING, 2018, 148 : 157 - 171