Auditory scene analysis via application of ICA in a time-frequency domain

被引:0
作者
Janku, L [1 ]
机构
[1] Czech Tech Univ, Fac Elect Engn, Dept Comp Sci, CR-16635 Prague, Czech Republic
[2] Tech Univ Brno, Fac Informat Technol, Inst Comp Graph & Multimedia, Brno, Czech Republic
来源
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2004年 / 3206卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper deals with the auditory scene analysis via application of ICA in a time-frequency domain. An extension of an original algorithm is presented. This extension consists in Bayesian estimation of a number of independent components via direct implementation of selected grouping principles and via analysis of a structure of the previous time-spans. While the original algorithm is not capable to process sound scenes with fluctuating number of independent sound sources, the presented extension can operate also on sound scenes with the fluctuating number of sound sources.
引用
收藏
页码:347 / 353
页数:7
相关论文
共 18 条
[11]   Speech recognition by machines and humans [J].
Lippmann, RP .
SPEECH COMMUNICATION, 1997, 22 (01) :1-15
[12]  
MELLINGER DK, 1995, SCENE ANAL
[13]  
MOORE BCJ, 1995, HDB PERCEPTION COGNI
[14]  
PATTERSON RD, 1996, AUDITORY FILTERS EXC
[15]  
SLANEY M, 1994, AUDITORY TOOLBOX
[16]  
SMARAGDIS P, 2001, THESIS MIT MASSACHUS
[17]  
STONE JV, 2002, SPATIOTEMPORAL INDEP
[18]   Development of a Sign Language Dialogue System for a Healing Dialogue Robot [J].
Huang, Xuan ;
Wu, Bo ;
Kameda, Hiroyuki .
2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, :867-872