Informed Audio Source Separation Using Linearly Constrained Spatial Filters

被引：9

作者：

Gorlow, Stanislaw ^{[1
]}

Marchand, Sylvain ^{[2
]}

机构：

[1] Univ Bordeaux 1, CNRS, Comp Sci Res Lab Bordeaux LaBRI, F-33405 Talence, France

[2] Univ Western Brittany, Informat & Commun Sci & Technol Lab Lab STICC, CNRS, F-29238 Brest 3, France

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2013年 / 21卷 / 01期

关键词：

Array signal processing; audio quality assessment; audio watermarking; informed audio source separation;

D O I：

10.1109/TASL.2012.2208629

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this work we readdress the issue of audio source separation in an informed scenario, where certain information about the sound sources is embedded into their mixture as an imperceptible watermark. In doing so, we provide a description of an improved algorithm that follows the linearly constrained minimum-variance filtering approach in the subband domain, in order to obtain perceptually better estimates of the source signals in comparison to other published approaches. Just as its predecessor, the algorithm does not impose any restrictions on the number of simultaneously active sources, neither on their spectral overlap. It rather adapts to a given signal constellation and provides the best possible estimates under given constraints in linearithmic time. The validity of the approach is demonstrated on a stereo mixture with two levels of sound complexity. It is also shown by means of both objective and subjective evaluation that the proposed algorithm outperforms a reference algorithm by at least one grade. Bearing high perceptual resemblance to the original signals at a fairly tolerable data rate of 10-20 kbps per source, the algorithm hence seems well-suited for active listening applications such as re-mixing or re-spatialization in real time.

引用

页码：1 / 11

页数：11

共 19 条

[1]

[Anonymous], 2010, DISCRETE TIME SIGNAL

[2]

[Anonymous], 2003, BS15341 ITUR

[3]

Comon P, 2010, HANDBOOK OF BLIND SOURCE SEPARATION: INDEPENDENT COMPONENT ANALYSIS AND APPLICATIONS, P1

[4] Subjective and Objective Quality Assessment of Audio Source Separation [J].

Emiya, Valentin ;

Vincent, Emmanuel ;

Harlander, Niklas ;

Hohmann, Volker .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07) :2046-2057

[5]

ER MH, 1983, IEEE T ACOUST SPEECH, V31, P1378

[6]

Fastl H., 2007, Psychoacoustics: Facts and Models, P247

[7] DERIVATION OF AUDITORY FILTER SHAPES FROM NOTCHED-NOISE DATA [J].

GLASBERG, BR ;

MOORE, BCJ .

HEARING RESEARCH, 1990, 47 (1-2) :103-138

[8]

Gorlow S, 2011, 2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), P309, DOI 10.1109/ASPAA.2011.6082312

[9]

Haykin S., 2001, ADAPTIVE FILTER THEO

[10] A METHOD FOR THE CONSTRUCTION OF MINIMUM-REDUNDANCY CODES [J].

HUFFMAN, DA .

PROCEEDINGS OF THE INSTITUTE OF RADIO ENGINEERS, 1952, 40 (09) :1098-1101

← 1 2 →