Spectral refinement with adaptive window-size selection for voicing detection and fundamental frequency estimation

被引:0
|
作者
Madhu, Nilesh [1 ]
Krini, Mohammed [2 ]
机构
[1] Univ Ghent, Imec, IDLab, Dept Elect & Informat Syst, Ghent, Belgium
[2] Aschaffenburg Univ Appl Sci, Signal Proc Lab, Aschaffenburg, Germany
来源
2020 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2020) | 2020年
关键词
spectrum computation; fundamental frequency estimation; speech enhancement; DFT; spectral refinement; SPEECH;
D O I
10.1109/ISSPIT51521.2020.9408968
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Spectral refinement (SR) offers a computationally inexpensive means of generating a refined (higher resolution) signal spectrum by linearly combining the spectra of shorter, contiguous signal segments. The benefit of this method has previously been demonstrated on the problem of fundamental frequency (F0) estimation in speech processing - specifically for the improved estimation of very low F0. One drawback of SR is, however, the poorer detection of voicing onsets due to the Heisenberg-Gabor limit on time and frequency resolution. This may also lead to degraded performance in noisy conditions. Transitioning between long- and short-time windows for the spectral analysis may offer a good trade-off in these situations. This contribution presents a method to adaptively switch between short- and long-time windows (and, correspondingly, between the short-term and the refined spectrum) for voicing detection and F0 estimation. The improvements in voicing detection and F0 estimation due to this adaptive switching is conclusively demonstrated on audio signals in clean and corrupted conditions.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] VLSI processor for reliable stereo matching based on adaptive window-size selection
    Hariyama, M
    Takeuchi, T
    Kameyama, M
    2001 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2001, : 1168 - 1173
  • [2] Spectral refinement and its application to fundamental frequency estimation
    Krini, Mohamed
    Schmidt, Gerhard
    2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, : 181 - 184
  • [3] Adaptive Window Size Estimation in Unsupervised Change Detection
    Gong, Xing
    Corpetti, Thomas
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2013, 6 (02) : 991 - 1003
  • [4] A study on window-size selection for threshold and bootstrap value-at-risk models
    Smith, Anri
    Huang, Chun-Kai
    JOURNAL OF RISK MODEL VALIDATION, 2019, 13 (04): : 1 - 16
  • [5] Adaptive window size gradient estimation for image edge detection
    Albán, E
    Katkovnik, V
    Egiazarian, K
    IMAGE PROCESSING: ALGORITHMS AND SYSTEMS II, 2003, 5014 : 54 - 65
  • [6] PROGRAMS FOR THE ESTIMATION OF FUNDAMENTAL-FREQUENCY, AMPLITUDE, AND VOICING OF SPEECH
    HEYMAN, R
    BIRD, RJ
    HEYMAN, RL
    HARDING, J
    BEHAVIOR RESEARCH METHODS & INSTRUMENTATION, 1981, 13 (06): : 760 - 760
  • [7] Adaptive Rate Control and Contention Window-Size Adjustment for Power-Line Communication
    Yoon, Sung-Guk
    Bahk, Saewoong
    IEEE TRANSACTIONS ON POWER DELIVERY, 2011, 26 (02) : 809 - 816
  • [8] A comparison of several recent methods of fundamental frequency and voicing decision estimation
    Mousset, E
    Ainsworth, WA
    Fonollosa, JAR
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1273 - 1276
  • [9] A RULE-BASED, ADAPTIVE WINDOW-SIZE FILTER FOR THE ENHANCEMENT OF SUBCUTANEOUS VASCULAR PATTERNS IN THERMOGRAPHIC IMAGES
    CHAN, EKY
    PEARCE, JA
    IMAGES OF THE TWENTY-FIRST CENTURY, PTS 1-6, 1989, 11 : 1746 - 1748
  • [10] Adaptive step size window matching for detection
    Mekuz, Nathan
    Derpanis, Konstantinos G.
    Tsotsos, John K.
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 259 - +