Sequential organization of speech in computational auditory scene analysis

被引：16

作者：

Shao, Yang ^{[1
]}

Wang, DeLiang ^{[1
,2
]}

机构：

[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA

[2] Ohio State Univ, Ctr Cognit Sci, Columbus, OH 43210 USA

来源：

SPEECH COMMUNICATION | 2009年 / 51卷 / 08期

关键词：

Sequential organization; Computational auditory scene analysis; Speaker quantization; Binary time-frequency mask; MODEL; SEGREGATION; TRACKING;

D O I：

10.1016/j.specom.2009.02.003

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

A human listener has the ability to follow a speaker's voice over time in the presence of other talkers and non-speech interference. This paper proposes a general system for sequential organization of speech based on speaker models. By training a general background model, the proposed system is shown to function well with both interfering talkers and non-speech intrusions. To deal with situations where prior information about specific speakers is not available, a speaker quantization method is employed to extract representative models from a large speaker space and obtained generic models are used to perform sequential grouping. Our systematic evaluations show that grouping performance using generic models is only moderately lower than the performance level achieved with known speaker models. (C) 2009 Elsevier B.V. All rights reserved.

引用

页码：657 / 667

页数：11

共 50 条

[1] A Computational Auditory Scene Analysis System for Robust Speech Recognition
Srinivasan, Soundararajan
Shao, Yang
Jin, Zhaozhang
Wang, DeLiang
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 73 - +
[2] Separation of Reverberant Speech Based on Computational Auditory Scene Analysis
Li Hongyan
Cao Meng
Wang Yue
AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2018, 52 (06) : 561 - 571
[3] Linking computational auditory scene analysis to automatic speech recognition
Cooke, M
Morris, A
Green, P
ACUSTICA, 1996, 82 : S87 - S87
[4] A computational auditory scene analysis system for speech segregation and robust speech recognition
Shao, Yang
Srinivasan, Soundararajan
Jin, Zhaozhang
Wang, DeLiang
COMPUTER SPEECH AND LANGUAGE, 2010, 24 (01): : 77 - 93
[5] A Sequential Processing Model for Speech Separation Based on Auditory Scene Analysis
Nakanishi, Isao
Hanada, Junichi
2015 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2015, : 124 - 128
[6] Improved monaural speech segregation based on computational auditory scene analysis
Wang Yu
Lin Jiajun
Chen Ning
Yuan Wenhao
EURASIP Journal on Audio, Speech, and Music Processing, 2013
[7] Improved monaural speech segregation based on computational auditory scene analysis
Wang Yu
Lin Jiajun
Chen Ning
Yuan Wenhao
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,
[8] COMPUTATIONAL AUDITORY SCENE ANALYSIS
BROWN, GJ
COOKE, M
COMPUTER SPEECH AND LANGUAGE, 1994, 8 (04): : 297 - 336
[9] Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech
Li, Peng
Guan, Yong
Xu, Bo
Liu, Wenju
ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 2, PROCEEDINGS, 2006, : 742 - +
[10] Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech
Li, Peng
Guan, Yong
Xu, Bo
Liu, Wenju
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06): : 2014 - 2023

← 1 2 3 4 5 →