Large-Area Microphone Array for Audio Source Separation Based on a Hybrid Architecture Exploiting Thin-Film Electronics and CMOS

被引：13

作者：

Sanz-Robinson, Josue ^{[1
]}

Huang, Liechao ^{[1
]}

Moy, Tiffany ^{[1
]}

Rieutort-Louis, Warren ^{[1
]}

Hu, Yingzhe ^{[1
]}

Wagner, Sigurd ^{[1
]}

Sturm, James C. ^{[1
]}

Verma, Naveen ^{[1
]}

机构：

[1] Princeton Univ, Princeton, NJ 08544 USA

来源：

IEEE JOURNAL OF SOLID-STATE CIRCUITS | 2016年 / 51卷 / 04期

基金：

美国国家科学基金会;

关键词：

Amorphous silicon (a-Si); critically sampled; flexible electronics; large area electronics; microphone array; source separation; thin-film; thin-film transistors (TFT); SILICON;

D O I：

10.1109/JSSC.2015.2501426

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We present a system for reconstructing-independent voice commands from two simultaneous speakers, based on an array of spatially distributed microphones. It adopts a hybrid architecture, combining large-area electronics (LAE), which enables a physically expansive array (>1 m width), and a CMOS IC, which provides superior transistors for readout and signal processing. The array enables us to: 1) select microphones closest to the speakers to receive the highest SNR signal; 2) use multiple spatially diverse microphones to enhance robustness to variations due to microphones and sound propagation in a practical room. Each channel consists of a thin-film transducer formed from polyvinylidene fluoride (PVDF), a piezopolymer, and a localized amplifier composed of amorphous silicon (a-Si) thin-film transistors (TFTs). Each channel is sequentially sampled by a TFT scanning circuit, to reduce the number of interfaces between the large-area electronics (LAE) and CMOS IC. A reconstruction algorithm is proposed, which exploits the measured transfer function between each speaker and microphone, to separate two simultaneous speakers. The algorithm overcomes 1) sampling-rate limitations of the scanning circuits and 2) sensitivities to microphone placement and directionality. An entire system with eight channels is demonstrated, acquiring and reconstructing two simultaneous audio signals at 2 m distance from the array achieving a signal-to-interferer (SIR) ratio improvement of similar to 12 dB.

引用

页码：979 / 991

页数：13

共 24 条

[21] Performance measurement in blind audio source separation [J].

Vincent, Emmanuel ;

Gribonval, Remi ;

Févotte, Cedric .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04) :1462-1469

[22] A Micro Oxygen Sensor Based on a Nano Sol-Gel TiO2 Thin Film [J].

Wang, Hairong ;

Chen, Lei ;

Wang, Jiaxin ;

Sun, Quantao ;

Zhao, Yulong .

SENSORS, 2014, 14 (09) :16423-16433

[23]

Weinstein E, 2004, MITLCSTM642

[24] Flexible substrate micro-crystalline silicon and gated amorphous silicon strain sensors [J].

Zhou, LS ;

Jung, SY ;

Brandon, E ;

Jackson, TN .

IEEE TRANSACTIONS ON ELECTRON DEVICES, 2006, 53 (02) :380-385

← 1 2 3 →