Segmentation on time-frequency domain for speech segregation

被引:0
|
作者
Lim, Sung-Kil [1 ]
Lee, Hyon-Soo [1 ]
机构
[1] Kyung Hee Univ, Dept Comp Engn, 1 Seochun Dong, Kyonggi Do, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an algorithm for the frequency channel segmentation using a neural oscillatory network. The frequency channel segments means that local groups of channels in frequency domain that could be arisen from the same sound source. The proposed algorithm is based on the smoothed spectrum of the input sound. Valleys in the smoothed spectrum are used to determine vertical weights and the continuity of segment boundaries is used to determine vertical weights in the oscillatory network. To evaluate a suitableness of the proposed segmentation algorithm before the grouping stage is applied, we compare the synthesis results of ideal mask with that of proposed algorithm.
引用
收藏
页码:433 / +
页数:2
相关论文
共 50 条
  • [1] Watermarking of speech signals in the time-frequency domain
    Al-Khassaweneh, Mahmood
    Al-Zoubi, Hussein
    Aviyente, Selin
    2009 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY, 2009, : 317 - +
  • [2] Neural speech enhancement in the time-frequency domain
    Volkmer, M
    2003 IEEE XIII WORKSHOP ON NEURAL NETWORKS FOR SIGNAL PROCESSING - NNSP'03, 2003, : 617 - 626
  • [3] SPEECH ENHANCEMENT BASED ON JOINT TIME-FREQUENCY SEGMENTATION
    Tantibundhit, C.
    Pernkopf, F.
    Kubin, G.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4673 - +
  • [4] Joint Time-Frequency and Time Domain Learning for Speech Enhancement
    Tang, Chuanxin
    Luo, Chong
    Zhao, Zhiyuan
    Xie, Wenxuan
    Zeng, Wenjun
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3816 - 3822
  • [5] A Method of Sound Segmentation in Time-Frequency Domain Using Peaks and Valleys in Spectrogram for Speech Separation
    Lim, Sung-Kil
    Lee, Hyon-Soo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2008, 27 (08): : 418 - 426
  • [6] A HYBRID TIME-FREQUENCY DOMAIN ARTICULATORY SPEECH SYNTHESIZER
    SONDHI, MM
    SCHROETER, J
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1987, 35 (07): : 955 - 967
  • [7] Integrated speech enhancement and coding in the time-frequency domain
    Drygajlo, A
    Carnero, B
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1183 - 1186
  • [8] Robust Speech Watermarking Procedure in the Time-Frequency Domain
    Srdjan Stanković
    Irena Orović
    Nikola Žarić
    EURASIP Journal on Advances in Signal Processing, 2008
  • [9] Robust speech watermarking procedure in the time-frequency domain
    Stankovic, Srdjan
    Orovic, Irena
    Zaric, Nikola
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2008, 2008 (1)
  • [10] Joint Time-Frequency Segmentation Algorithm for Transient Speech Decomposition and Speech Enhancement
    Tantibundhit, Charturong
    Pernkopf, Franz
    Kubin, Gernot
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1417 - 1428