MULTI-STAGE REJECTION SAMPLING (MSRS): A ROBUST SRP-PHAT PEAK DETECTION ALGORITHM FOR LOCALIZATION OF COCKTAIL-PARTY TALKERS

被引:0
|
作者
Khanal, Sarthak [1 ]
Silverman, Harvey F. [1 ]
机构
[1] Brown Univ, LEMS, Box D, Providence, RI 02906 USA
来源
2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA) | 2015年
关键词
Microphone array; source localization; talker localization; cocktail party; SRP-PHAT; steered response power; region contraction; volume contraction; peak detection; HIERARCHICAL SEARCH;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The Steered Response Power using the Phase Transform weight (SRP-PHAT) has been shown to be robust in noisy and reverberant conditions. Also, volume contraction has been applied effectively to trap the global maximum for densely-hilly 3-D spaces like the SRP. However, previous methods have suffered from the presence of peaks representing multiple talkers in close proximity as is likely in a conversational cocktail-party setting. We present a volume contraction algorithm called Multi-Stage Rejection Sampling (MSRS) for detection of multiple peaks in the SRP-PHAT space. Our method not only circumvents sorting - a computationally expensive step in volume contraction algorithms - but also automatically divides a search volume into sub-volumes for robust detection of multiple peaks. We discuss some modifications to the standard SRP-PHAT functional and present results using all real-room data for baseline white-noise, an eight-speaker teleconferencing setup and a fully unconstrained cocktail-party situation containing about 21 persons in the room.
引用
收藏
页数:5
相关论文
empty
未找到相关数据