Hardware design for blind source separation using fast time-frequency mask technique

被引:2
|
作者
Tsai, Tsung-Han [1 ]
Liu, Pei-Yun [1 ]
Chiou, Yu-He [1 ]
机构
[1] Natl Cent Univ, Dept Elect Engn, Taoyuan, Taiwan
关键词
Blind separation; Time-frequency mask; Convolutive BSS; Reduction of DOA variance; VLSI Design; VLSI IMPLEMENTATION; ARCHITECTURE; MIXTURES;
D O I
10.1016/j.vlsi.2021.07.001
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a fast time-frequency mask technique that relies on the sparseness of source signals for blind source separation (BSS) to separate a mixture of two input sounds in a single signal automatically. Due to the sparseness of source signals, the signal can be distinguished when it is transformed into the time-frequency domain. Most previous methods did not mention the effect of different angles on accuracy. To overcome such problems, we first define two features which are normalized level-ratio and phase-difference. Next, we use our method to decrease the variance of Direction of Arrival (DOA). This can reduce the variance of features so that it can reduce the iterations of k-means. Finally, a mask is generated according to the clustered features. Our method does not require any prior information or parameter estimation. The motivation of the proposed design is to incorporate the BSS system with some smart voice appliances. In the application scenario, all the non-human voices may appear and regard as interference. We use Signal to Distortion Ratio (SDR) and Signal to Interference Ratio (SIR) to make some comparison. Based on the proposed system, then we present a hardware design. We use the TSMC 90-nm CMOS process. As a cost-effective result, it consumes about 120 K gates and executes with a frequency of 10 MHz. The power consumption is only 2.92 mW with low power design considerations.
引用
收藏
页码:67 / 77
页数:11
相关论文
共 50 条
  • [31] Blind Separation of Radar Signals Based on Detection of Time Frequency Single Source Point
    Cheng, Xude
    Liu, Fuli
    Xue, Xuedong
    Xu, Bing
    Zheng, Yuan
    RECENT DEVELOPMENTS IN INTELLIGENT SYSTEMS AND INTERACTIVE APPLICATIONS (IISA2016), 2017, 541 : 411 - 417
  • [32] A Step Toward Real-Time Time-Frequency Analyses with Varying Time-Frequency Resolutions: Hardware Implementation of an Adaptive S-transform
    Radovic, Nevena
    Ivanovic, Veselin N.
    Djurovic, Igor
    Simeunovic, Marko
    Sejdic, Ervin
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (02) : 853 - 874
  • [33] Distant speech separation using predicted time-frequency masks from spatial features
    Pertila, Pasi
    Nikunen, Joonas
    SPEECH COMMUNICATION, 2015, 68 : 97 - 106
  • [34] Eliminating the Permutation Ambiguity of Convolutive Blind Source Separation by Using Coupled Frequency Bins
    Xie, Kan
    Zhou, Guoxu
    Yang, Junjie
    He, Zhaoshui
    Xie, Shengli
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (02) : 589 - 599
  • [35] A PARTITIONED FREQUENCY DOMAIN ALGORITHM FOR CONVOLUTIVE BLIND SOURCE SEPARATION
    Scarpiniti, Michele
    Picaro, Andrea
    Parisi, Raffaele
    Uncini, Aurelio
    2009 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2009, : 411 - 416
  • [36] Speech Enhancement in Low SNR Environments by Designing a Time-Frequency Binary Mask
    Cheng, Shuai
    Zhang, Haijian
    Hua, Guang
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [37] ON TIME-FREQUENCY MASK ESTIMATION FOR MVDR BEAMFORMING WITH APPLICATION IN ROBUST SPEECH RECOGNITION
    Xiao, Xiong
    Zhao, Shengkui
    Jones, Douglas L.
    Chng, Eng Siong
    Li, Haizhou
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 3246 - 3250
  • [38] Independent Vector Extraction for Fast Joint Blind Source Separation and Dereverberation
    Ikeshita, Rintaro
    Nakatani, Tomohiro
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 972 - 976
  • [39] Blind source separation based on time-domain optimization of a frequency-domain independence criterion
    Mei, Tiemin
    Xi, Jiangtao
    Yin, Fuliang
    Mertins, Alfred
    Chicharo, Joe F.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06): : 2075 - 2085
  • [40] A permutation algorithm based on dynamic time warping in speech frequency-domain blind source separation
    Lv, Zhao
    Zhang, Bei-bei
    Wu, Xiao-pei
    Zhang, Chao
    Zhou, Bang-yan
    SPEECH COMMUNICATION, 2017, 92 : 132 - 141