SPATIAL-TEMPORAL GRAPH CONVOLUTION NETWORK FOR MULTICHANNEL SPEECH ENHANCEMENT

被引:4
|
作者
Hao, Minghui [1 ]
Yu, Jingjing [1 ]
Zhang, Luyao [1 ]
机构
[1] Beijing Jiaotong Univ, Elect & Informat Engn, Beijing, Peoples R China
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
关键词
Graph convolution network; spatial dependency extraction; spatial-temporal convolution module; SII-weighted loss function; speech enhancement;
D O I
10.1109/ICASSP43922.2022.9746054
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Spatial dependency related to distributed microphone positions is essential for multichannel speech enhancement task. It is still challenging due to lack of accurate array positions and complex spatial-temporal relations of multichannel noisy signals This paper proposes a spatial-temporal graph convolutional network composed of cascaded spatial-temporal (ST) modules with channel fusion. Without any prior information of array and acoustic scene, a graph convolution block is designed with learnable adjacency matrix to capture the spatial dependency of pairwise channels. Then, it is embedded with time-frequency convolution block as the ST module to fuse the multi-dimensional correlation features for target speech estimation. Furthermore, a novel weighted loss function based on speech intelligibility index (SII) is proposed to assign more attention for the important bands of human understanding during network training. Our framework is demonstrated to achieve over 11% performance improvement on PESQ and intelligibility against prior state-of-the-art approaches in multi-scene speech enhancement experiments.
引用
收藏
页码:6512 / 6516
页数:5
相关论文
共 50 条
  • [1] Multichannel spatial-temporal graph convolution network based on spectrum decomposition for traffic prediction
    Lei, Tianyang
    Yang, Kewei
    Li, Jichao
    Chen, Gang
    Jiang, Jiuyao
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [2] Dual Dynamic Spatial-Temporal Graph Convolution Network for Traffic Prediction
    Sun, Yanfeng
    Jiang, Xiangheng
    Hu, Yongli
    Duan, Fuqing
    Guo, Kan
    Wang, Boyue
    Gao, Junbin
    Yin, Baocai
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 23680 - 23693
  • [3] Multi-dimensional spatial-temporal graph convolution for urban sensors imputation and enhancement
    Huang, Longji
    Huang, Jianbin
    Li, He
    Cui, Jiangtao
    KNOWLEDGE-BASED SYSTEMS, 2023, 278
  • [4] Spatial-Temporal Aggregation Graph Convolution Network for Efficient Mobile Cellular Traffic Prediction
    Zhao, Nan
    Wu, Aonan
    Pei, Yiyang
    Liang, Ying-Chang
    Niyato, Dusit
    IEEE COMMUNICATIONS LETTERS, 2022, 26 (03) : 587 - 591
  • [5] Spatial-Temporal Attention Graph Convolution Network on Edge Cloud for Traffic Flow Prediction
    Lai, Qifeng
    Tian, Jinyu
    Wang, Wei
    Hu, Xiping
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (04) : 4565 - 4576
  • [6] Adaptive Spatial-Temporal Convolution Network for Traffic Forecasting
    Li, Zhao
    Zhang, Yong
    Zhang, Zhao
    Wang, Xing
    Zhu, Lin
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT II, 2022, 13369 : 287 - 299
  • [7] STIGCN: spatial-temporal interaction-aware graph convolution network for pedestrian trajectory prediction
    Chen, Wangxing
    Sang, Haifeng
    Wang, Jinyu
    Zhao, Zishan
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (08) : 10695 - 10719
  • [8] Spatial-Temporal Traffic Flow Prediction With Fusion Graph Convolution Network and Enhanced Gated Recurrent Units
    Cai, Chuang
    Qu, Zhijian
    Ma, Liqun
    Yu, Lianfei
    Liu, Wenbo
    Ren, Chongguang
    IEEE ACCESS, 2024, 12 : 56477 - 56491
  • [9] Graph Convolution Based Spatial-Temporal Attention LSTM Model for Flood Forecasting
    Feng, Jun
    Sha, Haichao
    Ding, Yukai
    Yan, Le
    Yu, Zhangheng
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [10] Multi-Stream and Enhanced Spatial-Temporal Graph Convolution Network for Skeleton-Based Action Recognition
    Li, Fanjia
    Zhu, Aichun
    Xu, Yonggang
    Cui, Ran
    Hua, Gang
    IEEE ACCESS, 2020, 8 : 97757 - 97770