Large-Area Microphone Array for Audio Source Separation Based on a Hybrid Architecture Exploiting Thin-Film Electronics and CMOS

被引:13
作者
Sanz-Robinson, Josue [1 ]
Huang, Liechao [1 ]
Moy, Tiffany [1 ]
Rieutort-Louis, Warren [1 ]
Hu, Yingzhe [1 ]
Wagner, Sigurd [1 ]
Sturm, James C. [1 ]
Verma, Naveen [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
基金
美国国家科学基金会;
关键词
Amorphous silicon (a-Si); critically sampled; flexible electronics; large area electronics; microphone array; source separation; thin-film; thin-film transistors (TFT); SILICON;
D O I
10.1109/JSSC.2015.2501426
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a system for reconstructing-independent voice commands from two simultaneous speakers, based on an array of spatially distributed microphones. It adopts a hybrid architecture, combining large-area electronics (LAE), which enables a physically expansive array (>1 m width), and a CMOS IC, which provides superior transistors for readout and signal processing. The array enables us to: 1) select microphones closest to the speakers to receive the highest SNR signal; 2) use multiple spatially diverse microphones to enhance robustness to variations due to microphones and sound propagation in a practical room. Each channel consists of a thin-film transducer formed from polyvinylidene fluoride (PVDF), a piezopolymer, and a localized amplifier composed of amorphous silicon (a-Si) thin-film transistors (TFTs). Each channel is sequentially sampled by a TFT scanning circuit, to reduce the number of interfaces between the large-area electronics (LAE) and CMOS IC. A reconstruction algorithm is proposed, which exploits the measured transfer function between each speaker and microphone, to separate two simultaneous speakers. The algorithm overcomes 1) sampling-rate limitations of the scanning circuits and 2) sensitivities to microphone placement and directionality. An entire system with eight channels is demonstrated, acquiring and reconstructing two simultaneous audio signals at 2 m distance from the array achieving a signal-to-interferer (SIR) ratio improvement of similar to 12 dB.
引用
收藏
页码:979 / 991
页数:13
相关论文
共 24 条
[21]   Performance measurement in blind audio source separation [J].
Vincent, Emmanuel ;
Gribonval, Remi ;
Févotte, Cedric .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04) :1462-1469
[22]   A Micro Oxygen Sensor Based on a Nano Sol-Gel TiO2 Thin Film [J].
Wang, Hairong ;
Chen, Lei ;
Wang, Jiaxin ;
Sun, Quantao ;
Zhao, Yulong .
SENSORS, 2014, 14 (09) :16423-16433
[23]  
Weinstein E, 2004, MITLCSTM642
[24]   Flexible substrate micro-crystalline silicon and gated amorphous silicon strain sensors [J].
Zhou, LS ;
Jung, SY ;
Brandon, E ;
Jackson, TN .
IEEE TRANSACTIONS ON ELECTRON DEVICES, 2006, 53 (02) :380-385