Speaker Separation Using Visual Speech Features and Single-channel Audio

被引:0
|
作者
Khan, Faheem [1 ]
Milner, Ben [1 ]
机构
[1] Univ East Anglia, Sch Comp Sci, Norwich, Norfolk, England
关键词
Speaker separation; Wiener filter; visual features; audio-visual correlation; RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work proposes a method of single-channel speaker separation that uses visual speech information to extract a target speaker's speech from a mixture of speakers. The method requires a single audio input and visual features extracted from the mouth region of each speaker in the mixture. The visual information from speakers is used to create a visually-derived Wiener filter. The Wiener filter gains are then non-linearly adjusted by a perceptual gain transform to improve the quality and intelligibility of the target speech. Experimental results are presented that estimate the quality and intelligibility of the extracted target speaker and a comparison is made of different perceptual gain transforms. These show that significant gains are achieved by the application of the perceptual gain function.
引用
收藏
页码:3263 / 3267
页数:5
相关论文
共 50 条
  • [21] SINGLE-CHANNEL SPEECH SEPARATION BY USING A SPARSE DECOMPOSITION WITH PERIODIC STRUCTURE
    Nakashizuka, Makoto
    Okumura, Hiroyuki
    Iiguni, Youji
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS 2008), 2008, : 339 - 342
  • [22] Single-channel speech separation using sequential discriminative dictionary learning
    Ye, Zhongfu, 1600, Elsevier B.V., Netherlands (106):
  • [23] New Results on Single-Channel Speech Separation Using Sinusoidal Modeling
    Mowlaee, Pejman
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (05): : 1265 - 1277
  • [24] Single-Channel Speech Separation Using Phase-Based Methods
    Lee, Yun-Kyung
    Lee, In Sung
    Kwon, Oh-Wook
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (04) : 2453 - 2459
  • [25] SINGLE-CHANNEL SPEECH SEPARATION AND RECOGNITION USING LOOPY BELIEF PROPAGATION
    Rennie, Steven J.
    Hershey, John R.
    Olsen, Peder A.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3845 - 3848
  • [26] Single-channel speech separation using sequential discriminative dictionary learning
    Xu, Yangfei
    Bao, Guangzhao
    Xu, Xu
    Ye, Zhongfu
    SIGNAL PROCESSING, 2015, 106 : 134 - 140
  • [27] TASNET: TIME-DOMAIN AUDIO SEPARATION NETWORK FOR REAL-TIME, SINGLE-CHANNEL SPEECH SEPARATION
    Luo, Yi
    Mesgarani, Nima
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 696 - 700
  • [28] Speaker Counting and Separation From Single-Channel Noisy Mixtures
    Chetupalli, Srikanth Raj
    Habets, Emanuel A. P.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1681 - 1692
  • [29] Single-channel speech separation using combined EMD and speech-specific information
    Prasanna Kumar M.K.
    Kumaraswamy R.
    International Journal of Speech Technology, 2017, 20 (4) : 1037 - 1047
  • [30] Speaker Verification Based on Single Channel Speech Separation
    Jin, Rong
    Ablimit, Mijit
    Hamdulla, Askar
    IEEE ACCESS, 2023, 11 : 112631 - 112638