Large-Area Microphone Array for Audio Source Separation Based on a Hybrid Architecture Exploiting Thin-Film Electronics and CMOS

被引:13
作者
Sanz-Robinson, Josue [1 ]
Huang, Liechao [1 ]
Moy, Tiffany [1 ]
Rieutort-Louis, Warren [1 ]
Hu, Yingzhe [1 ]
Wagner, Sigurd [1 ]
Sturm, James C. [1 ]
Verma, Naveen [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
基金
美国国家科学基金会;
关键词
Amorphous silicon (a-Si); critically sampled; flexible electronics; large area electronics; microphone array; source separation; thin-film; thin-film transistors (TFT); SILICON;
D O I
10.1109/JSSC.2015.2501426
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a system for reconstructing-independent voice commands from two simultaneous speakers, based on an array of spatially distributed microphones. It adopts a hybrid architecture, combining large-area electronics (LAE), which enables a physically expansive array (>1 m width), and a CMOS IC, which provides superior transistors for readout and signal processing. The array enables us to: 1) select microphones closest to the speakers to receive the highest SNR signal; 2) use multiple spatially diverse microphones to enhance robustness to variations due to microphones and sound propagation in a practical room. Each channel consists of a thin-film transducer formed from polyvinylidene fluoride (PVDF), a piezopolymer, and a localized amplifier composed of amorphous silicon (a-Si) thin-film transistors (TFTs). Each channel is sequentially sampled by a TFT scanning circuit, to reduce the number of interfaces between the large-area electronics (LAE) and CMOS IC. A reconstruction algorithm is proposed, which exploits the measured transfer function between each speaker and microphone, to separate two simultaneous speakers. The algorithm overcomes 1) sampling-rate limitations of the scanning circuits and 2) sensitivities to microphone placement and directionality. An entire system with eight channels is demonstrated, acquiring and reconstructing two simultaneous audio signals at 2 m distance from the array achieving a signal-to-interferer (SIR) ratio improvement of similar to 12 dB.
引用
收藏
页码:979 / 991
页数:13
相关论文
共 24 条
[1]   IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS [J].
ALLEN, JB ;
BERKLEY, DA .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) :943-950
[2]   The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech [J].
Araki, S ;
Mukai, R ;
Makino, S ;
Nishikawa, T ;
Saruwatari, H .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (02) :109-116
[3]   Conformable amplified lead zirconate titanate sensors with enhanced piezoelectric response for cutaneous pressure monitoring [J].
Dagdeviren, Canan ;
Su, Yewang ;
Joe, Pauline ;
Yona, Raissa ;
Liu, Yuhao ;
Kim, Yun-Soung ;
Huang, YongAn ;
Damadoran, Anoop R. ;
Xia, Jing ;
Martin, Lane W. ;
Huang, Yonggang ;
Rogers, John A. .
NATURE COMMUNICATIONS, 2014, 5
[4]  
Freeman R. L., 2005, FUNDAMENTALS TELECOM, P90
[5]   Amorphous silicon thin-film transistors on compliant polyimide foil substrates [J].
Gleskova, H ;
Wagner, S .
IEEE ELECTRON DEVICE LETTERS, 1999, 20 (09) :473-475
[6]   Flexible ferroelectret field-effect transistor for large-area sensor skins and microphones [J].
Graz, Ingrid ;
Kaltenbrunner, Martin ;
Keplinger, Christoph ;
Schwodiauer, Reinhard ;
Bauer, Siegfried ;
Lacour, Stephanie P. ;
Wagner, Sigurd .
APPLIED PHYSICS LETTERS, 2006, 89 (07)
[7]  
Huang L., 2015, P S VLSI CIRC VLSIC, P198
[8]  
Kabal P., 2002, 10 MCGILL U DEP EL C
[9]  
Kokkinakis K., 2010, ADV MODERN BLIND SIG, P13
[10]   Thin Film Transistor Technology-Past, Present, and Future [J].
Kuo, Yue .
ELECTROCHEMICAL SOCIETY INTERFACE, 2013, 22 (01) :55-61