Noise-robust Pitch Detection Based on Super-Resolution Harmonics

被引:0
|
作者
Zhu, Dongjie [1 ]
Zhu, Weibin [1 ]
Wang, Tianrui [1 ]
Gao, Yingying [2 ]
Feng, Junlan [2 ]
Zhang, Shilei [2 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China
[2] China Mobile Res Inst, Beijing, Peoples R China
关键词
SPEECH; ESTIMATOR;
D O I
10.1109/APSIPAASC58517.2023.10317312
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to improve the performance of pitch detection algorithms in a noisy scenario, we propose a noise-robust pitch detection model based on super-resolution harmonics. The model consists of denoise, integration, and super-resolution modules. The Denoisy module highlights the harmonic structure of speech in the frequency domain to reduce noise interference. The enhanced spectrum is multiplied by an integral matrix to obtain a pitch-harmonic integration spectrum. The super-resolution module adjusts the low-resolution pitch-harmonic integration spectrum and outputs fine results. In addition, dynamic programming is employed for pitch tracking to eliminate those singular spots in the preliminary results. The experimental results show that the proposed model significantly outperforms the referenced approaches, including SWIPE, CREPE, HarmoF0, and HGCN+. The results also show that the model can work well even under low SNR.
引用
收藏
页码:422 / 426
页数:5
相关论文
共 50 条
  • [1] Spectral network based on lattice convolution and adversarial training for noise-robust speech super-resolution
    Yang, Junkang
    Liu, Hongqing
    Gan, Lu
    Jing, Xiaorong
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2024, 156 (05): : 3143 - 3157
  • [2] Noise-robust video super-resolution using an adaptive spatial-temporal filter
    Hu, Jing
    Luo, Yupin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (21) : 9259 - 9278
  • [3] Noise-robust video super-resolution using an adaptive spatial-temporal filter
    Jing Hu
    Yupin Luo
    Multimedia Tools and Applications, 2015, 74 : 9259 - 9278
  • [4] Structural similarity-based Bi-representation through true noise level for noise-robust face super-resolution
    Nagar, Surendra
    Jain, Ankush
    Singh, Pramod Kumar
    Kumar, Ajay
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (17) : 26255 - 26288
  • [5] Structural similarity-based Bi-representation through true noise level for noise-robust face super-resolution
    Surendra Nagar
    Ankush Jain
    Pramod Kumar Singh
    Ajay Kumar
    Multimedia Tools and Applications, 2023, 82 : 26255 - 26288
  • [6] Noise-robust Pitch Detection Algorithm Based on AMDF with Clustering Analysis Picking Peaks
    Gao, Jun
    Xu, Dan
    2016 IEEE INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2016, : 1144 - 1148
  • [7] Robust super-resolution
    Zomet, A
    Rav-Acha, A
    Peleg, S
    2001 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2001, : 645 - 650
  • [8] Noise-Robust Pitch Detection using Auto-correlation Function with Enhancements
    Muhammad, Ghulam
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2010, 22 : 13 - 28
  • [9] Pitch synchronous based feature extraction for noise-robust speaker verification
    Gong Wei-Guo
    Yang Li-Ping
    Chen Di
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 295 - 298
  • [10] Noise-robust pitch detection method using wavelet transform with aliasing compensation
    Chen, SH
    Wang, JF
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2002, 149 (06): : 327 - 334