Hardware design for blind source separation using fast time-frequency mask technique

被引：2

作者：

Tsai, Tsung-Han ^{[1
]}

Liu, Pei-Yun ^{[1
]}

Chiou, Yu-He ^{[1
]}

机构：

[1] Natl Cent Univ, Dept Elect Engn, Taoyuan, Taiwan

来源：

INTEGRATION-THE VLSI JOURNAL | 2022年 / 82卷

关键词：

Blind separation; Time-frequency mask; Convolutive BSS; Reduction of DOA variance; VLSI Design; VLSI IMPLEMENTATION; ARCHITECTURE; MIXTURES;

D O I：

10.1016/j.vlsi.2021.07.001

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we propose a fast time-frequency mask technique that relies on the sparseness of source signals for blind source separation (BSS) to separate a mixture of two input sounds in a single signal automatically. Due to the sparseness of source signals, the signal can be distinguished when it is transformed into the time-frequency domain. Most previous methods did not mention the effect of different angles on accuracy. To overcome such problems, we first define two features which are normalized level-ratio and phase-difference. Next, we use our method to decrease the variance of Direction of Arrival (DOA). This can reduce the variance of features so that it can reduce the iterations of k-means. Finally, a mask is generated according to the clustered features. Our method does not require any prior information or parameter estimation. The motivation of the proposed design is to incorporate the BSS system with some smart voice appliances. In the application scenario, all the non-human voices may appear and regard as interference. We use Signal to Distortion Ratio (SDR) and Signal to Interference Ratio (SIR) to make some comparison. Based on the proposed system, then we present a hardware design. We use the TSMC 90-nm CMOS process. As a cost-effective result, it consumes about 120 K gates and executes with a frequency of 10 MHz. The power consumption is only 2.92 mW with low power design considerations.

引用

页码：67 / 77

页数：11

共 50 条

[31] Blind Separation of Radar Signals Based on Detection of Time Frequency Single Source Point
Cheng, Xude
Liu, Fuli
Xue, Xuedong
Xu, Bing
Zheng, Yuan
RECENT DEVELOPMENTS IN INTELLIGENT SYSTEMS AND INTERACTIVE APPLICATIONS (IISA2016), 2017, 541 : 411 - 417
[32] A Step Toward Real-Time Time-Frequency Analyses with Varying Time-Frequency Resolutions: Hardware Implementation of an Adaptive S-transform
Radovic, Nevena
Ivanovic, Veselin N.
Djurovic, Igor
Simeunovic, Marko
Sejdic, Ervin
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (02) : 853 - 874
[33] Distant speech separation using predicted time-frequency masks from spatial features
Pertila, Pasi
Nikunen, Joonas
SPEECH COMMUNICATION, 2015, 68 : 97 - 106
[34] Eliminating the Permutation Ambiguity of Convolutive Blind Source Separation by Using Coupled Frequency Bins
Xie, Kan
Zhou, Guoxu
Yang, Junjie
He, Zhaoshui
Xie, Shengli
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (02) : 589 - 599
[35] A PARTITIONED FREQUENCY DOMAIN ALGORITHM FOR CONVOLUTIVE BLIND SOURCE SEPARATION
Scarpiniti, Michele
Picaro, Andrea
Parisi, Raffaele
Uncini, Aurelio
2009 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2009, : 411 - 416
[36] Speech Enhancement in Low SNR Environments by Designing a Time-Frequency Binary Mask
Cheng, Shuai
Zhang, Haijian
Hua, Guang
2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
[37] ON TIME-FREQUENCY MASK ESTIMATION FOR MVDR BEAMFORMING WITH APPLICATION IN ROBUST SPEECH RECOGNITION
Xiao, Xiong
Zhao, Shengkui
Jones, Douglas L.
Chng, Eng Siong
Li, Haizhou
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 3246 - 3250
[38] Independent Vector Extraction for Fast Joint Blind Source Separation and Dereverberation
Ikeshita, Rintaro
Nakatani, Tomohiro
IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 972 - 976
[39] Blind source separation based on time-domain optimization of a frequency-domain independence criterion
Mei, Tiemin
Xi, Jiangtao
Yin, Fuliang
Mertins, Alfred
Chicharo, Joe F.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06): : 2075 - 2085
[40] A permutation algorithm based on dynamic time warping in speech frequency-domain blind source separation
Lv, Zhao
Zhang, Bei-bei
Wu, Xiao-pei
Zhang, Chao
Zhou, Bang-yan
SPEECH COMMUNICATION, 2017, 92 : 132 - 141

← 1 2 3 4 5 →