Segmentation on time-frequency domain for speech segregation

被引：0

作者：

Lim, Sung-Kil ^{[1
]}

Lee, Hyon-Soo ^{[1
]}

机构：

[1] Kyung Hee Univ, Dept Comp Engn, 1 Seochun Dong, Kyonggi Do, South Korea

来源：

2006 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1 AND 2 | 2006年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose an algorithm for the frequency channel segmentation using a neural oscillatory network. The frequency channel segments means that local groups of channels in frequency domain that could be arisen from the same sound source. The proposed algorithm is based on the smoothed spectrum of the input sound. Valleys in the smoothed spectrum are used to determine vertical weights and the continuity of segment boundaries is used to determine vertical weights in the oscillatory network. To evaluate a suitableness of the proposed segmentation algorithm before the grouping stage is applied, we compare the synthesis results of ideal mask with that of proposed algorithm.

引用

页码：433 / +

页数：2

共 50 条

[1] Watermarking of speech signals in the time-frequency domain
Al-Khassaweneh, Mahmood
Al-Zoubi, Hussein
Aviyente, Selin
2009 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY, 2009, : 317 - +
[2] Neural speech enhancement in the time-frequency domain
Volkmer, M
2003 IEEE XIII WORKSHOP ON NEURAL NETWORKS FOR SIGNAL PROCESSING - NNSP'03, 2003, : 617 - 626
[3] SPEECH ENHANCEMENT BASED ON JOINT TIME-FREQUENCY SEGMENTATION
Tantibundhit, C.
Pernkopf, F.
Kubin, G.
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4673 - +
[4] Joint Time-Frequency and Time Domain Learning for Speech Enhancement
Tang, Chuanxin
Luo, Chong
Zhao, Zhiyuan
Xie, Wenxuan
Zeng, Wenjun
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3816 - 3822
[5] A Method of Sound Segmentation in Time-Frequency Domain Using Peaks and Valleys in Spectrogram for Speech Separation
Lim, Sung-Kil
Lee, Hyon-Soo
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2008, 27 (08): : 418 - 426
[6] A HYBRID TIME-FREQUENCY DOMAIN ARTICULATORY SPEECH SYNTHESIZER
SONDHI, MM
SCHROETER, J
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1987, 35 (07): : 955 - 967
[7] Integrated speech enhancement and coding in the time-frequency domain
Drygajlo, A
Carnero, B
1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1183 - 1186
[8] Robust Speech Watermarking Procedure in the Time-Frequency Domain
Srdjan Stanković
Irena Orović
Nikola Žarić
EURASIP Journal on Advances in Signal Processing, 2008
[9] Robust speech watermarking procedure in the time-frequency domain
Stankovic, Srdjan
Orovic, Irena
Zaric, Nikola
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2008, 2008 (1)
[10] Joint Time-Frequency Segmentation Algorithm for Transient Speech Decomposition and Speech Enhancement
Tantibundhit, Charturong
Pernkopf, Franz
Kubin, Gernot
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1417 - 1428

← 1 2 3 4 5 →