Location Estimation of Predominant Sound Source with Embedded Source Separation in Amplitude-Panned Stereo Signal

被引:4
作者
Han, Taek-Jin [1 ]
Kim, Ki-Jun [1 ]
Park, Hochong [1 ]
机构
[1] Kwangwoon Univ, Dept Elect Engn, Seoul 139701, South Korea
关键词
Amplitude panning; non-negative matrix factorization; panning gain; sound source location; source separation;
D O I
10.1109/LSP.2015.2424991
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This letter proposes a new method of estimating the location of a predominant source in an amplitude-panned stereo signal with two sources. When the conventional method of location estimation is applied to an amplitude-panned multi-source sound, a serious estimation error occurs due to interference between sources. To solve this problem, the proposed method includes an embedded source separation based on non-negative matrix factorization, which first determines the initial estimate of source location, and then computes the basis matrix of the source using the initial location estimate. In this way, the proposed method can perform the source separation without a training stage. The comparative evaluation confirms that the proposed method provides higher performance in location estimation than the conventional method with a training stage.
引用
收藏
页码:1685 / 1688
页数:4
相关论文
共 15 条
  • [1] Avendano C., 2002, AUD ENG SOC 22 INT C
  • [2] Baek Y.-H., 2012, AUDIO ENG SOC CONVEN
  • [3] Bregman AS., 1994, AUDITORY SCENE ANAL
  • [4] Briand M., 2006, P 120 AUD ENG SOC CO
  • [5] Time-Frequency Matrix Feature Extraction and Classification of Environmental Audio Signals
    Ghoraani, Behnaz
    Krishnan, Sridhar
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 2197 - 2209
  • [6] Geometric signal decompositions for spatial audio enhancement
    Goodwin, Michael M.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 409 - 412
  • [7] HELEN M, 2005, P 13 EUR SIGN PROC C
  • [8] King B, 2012, IEEE INT WORKS MACH
  • [9] Lee DD, 2001, ADV NEUR IN, V13, P556
  • [10] Pulkki V, 1997, J AUDIO ENG SOC, V45, P456