LEARNING-BASED PERSONAL SPEECH ENHANCEMENT FOR TELECONFERENCING BY EXPLOITING SPATIAL-SPECTRAL FEATURES

被引:6
|
作者
Hsu, Yicheng [1 ]
Lee, Yonghan [1 ]
Bai, Mingsian R. [1 ,2 ]
机构
[1] Natl Tsing Hua Univ, Dept Power Mech Engn, Hsinchu, Taiwan
[2] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu, Taiwan
关键词
spatial coherence analysis; target speech enhancement; speaker embedding; convolutional recurrent neural network; SPEAKER EXTRACTION; SEPARATION;
D O I
10.1109/ICASSP43922.2022.9746859
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Teleconferencing is becoming essential during the COVID-19 pandemic. However, in real-world applications, speech quality can deteriorate due to, for example, background interference, noise, or reverberation. To solve this problem, target speech extraction from the mixture signals can be performed with the aid of the user's vocal features. Various features are accounted for in this study's proposed system, including speaker embeddings derived from user enrollment and a novel long-short-term spatial coherence (LSTSC) feature pertaining to the target speaker activity. As a learning-based approach, a target speech sifting network was employed to extract the target signal. The network trained with LSTSC in the proposed approach is robust to microphone array geometries and the number of microphones. Furthermore, the proposed enhancement system was compared with a baseline system with speaker embeddings and interchannel phase difference. The results demonstrated the superior performance of the proposed system over the baseline in enhancement performance and robustness.
引用
收藏
页码:8787 / 8791
页数:5
相关论文
共 50 条
  • [21] Integration of spatial-spectral information for resolution enhancement in hyperspectral images
    Gu, Yanfeng
    Zhang, Ye
    Zhang, Junping
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2008, 46 (05): : 1347 - 1358
  • [22] SPATIAL-SPECTRAL DATA FUSION FOR RESOLUTION ENHANCEMENT OF HYPERSPECTRAL IMAGERY
    Mianji, Fereidoun A.
    Zhang, Ye
    Gu, Yanfeng
    Babakhani, Asad
    2009 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-5, 2009, : 2313 - +
  • [23] Hyperspectral image classification based on multi-branch spatial-spectral feature enhancement
    Li, Tie
    Li, Wenxu
    Wang, Junguo
    Gao, Qiaoyu
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (06) : 844 - 855
  • [24] Deep Spatial-Spectral Subspace Clustering for Hyperspectral Images Based on Contrastive Learning
    Hu, Xiang
    Li, Teng
    Zhou, Tong
    Peng, Yuanxi
    REMOTE SENSING, 2021, 13 (21)
  • [25] SPATIAL-SPECTRAL CONTRASTIVE LEARNING FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Guan, Peiyan
    Lam, Edmund Y.
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1372 - 1375
  • [26] Fourier transform infrared spectroscopy microscopic imaging classification based on spatial-spectral features
    Liu, Lian
    Yang, Xiukun
    Zhong, Mingliang
    Liu, Yao
    Jing, Xiaojun
    Yang, Qin
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2018, 29 (04)
  • [27] Exploiting surface, content and relevance features for learning-based extractive summarization
    Wu, Mingli
    Li, Wenjie
    Wei, Furu
    Lu, Qin
    Wong, Kam-Fai
    PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (NLP-KE'07), 2007, : 234 - +
  • [28] Graph-based spatial-spectral feature learning for hyperspectral image classification
    Ahmad, Muhammad
    Khan, Adil Mehmood
    Hussain, Rasheed
    IET IMAGE PROCESSING, 2017, 11 (12) : 1310 - 1316
  • [29] Frequency-chirped readout of spatial-spectral absorption features
    Chang, TJ
    Mohan, RK
    Tian, MZ
    Harris, TL
    Babbitt, WR
    Merkel, KD
    PHYSICAL REVIEW A, 2004, 70 (06): : 063803 - 1
  • [30] Automated Hyperspectral Image Classification Using Spatial-Spectral Features
    Dhok, Shivani
    Bhurane, Ankit
    Kothari, Ashwin
    2019 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2019, : 184 - 189