Visual Lip Activity Detection and Speaker Detection Using Mouth Region Intensities

被引:29
作者
Siatras, Spyridon [1 ]
Nikolaidis, Nikos [1 ]
Krinidis, Michail [1 ]
Pitas, Ioannis [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Thessaloniki 54124, Greece
关键词
Speaker detection; visual speech detection; SPEECH; FEATURES;
D O I
10.1109/TCSVT.2008.2009262
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this letter, we introduce a novel approach for lip activity detection and speaker detection, using solely visual information. The main idea in this work is to apply signal detection algorithms to a simple and easily extracted feature from the mouth region. We argue that the increased average value and standard deviation of the number of pixels with low intensities that the mouth region of a speaking person demonstrates can be used as visual cues for detecting visual speech. We then proceed in deriving a statistical algorithm that utilizes this fact for the efficient characterization of visual speech and silence In video sequences. Furthermore, we employ the lip activity detection method in order to determine the active speaker(s) in a multi-person environment.
引用
收藏
页码:133 / 137
页数:5
相关论文
共 50 条
  • [21] The DKU Speech Activity Detection and Speaker Identification Systems for Fearless Steps Challenge Phase-02
    Lin, Qingjian
    Li, Tingle
    Li, Ming
    INTERSPEECH 2020, 2020, : 2607 - 2611
  • [22] VISUAL SALIENCY ANALYSIS FOR COMMON REGION OF INTEREST DETECTION IN MULTIPLE REMOTE SENSING IMAGES
    Zhang, Libao
    Sun, Qiaoyue
    Sun, Yang
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2316 - 2320
  • [23] Enhanced speaker diarization with detection of backchannels using eye-gaze information in poster conversations
    Inoue, Koji
    Wakabayashi, Yukoh
    Yoshimoto, Hiromasa
    Takanashi, Katsuya
    Kawahara, Tatsuya
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3086 - 3090
  • [24] Automatic Detection of Breath Using Voice Activity Detection and SVM Classifier with Application on News Reports
    Arafath, Mohamed Ismail Yasar K.
    Routray, Aurobinda
    INTERSPEECH 2019, 2019, : 609 - 613
  • [25] Vowel detection using a perceptually-enhanced spectrum matching conditioned to phonetic context and speaker identity
    Kashani, Hamidreza Baradaran
    Sayadiyan, Abolghasem
    Sheikhzadeh, Hamid
    SPEECH COMMUNICATION, 2017, 91 : 28 - 48
  • [26] Rotation-covariant visual concept detection using steerable Riesz wavelets and bags of visual words
    Depeursinge, Adrien
    Foncubierta, Antonio
    Mueller, Henning
    Van de Ville, Dimitri
    WAVELETS AND SPARSITY XV, 2013, 8858
  • [27] Voice Activity Detection Using Discriminative Restricted Boltzmann Machines
    Borin, Rogerio G.
    Silva, Magno T. M.
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 523 - 527
  • [28] Unsupervised birdcall activity detection using source and system features
    Thakur, Anshul
    Rajan, Padmanabhan
    2017 TWENTY-THIRD NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2017,
  • [29] Visual Odometry With Loop Closing Detection Using Iterative Closest SURF Point
    Chaib, Khaoula
    Hamerlain, Mustapha
    Achour, Nouara
    Nemra, Abdelkrim
    2017 6TH INTERNATIONAL CONFERENCE ON SYSTEMS AND CONTROL (ICSC' 17), 2017, : 503 - 508
  • [30] Robust Voice Activity Detection Using Gammatone Filtering and Entropy
    Ong, W. Q.
    Tan, A. W. C.
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON ROBOTICS, AUTOMATION AND SCIENCES (ICORAS 2016), 2016,