Estimation of speaking speed for faster face detection in video-footage

被引:1
|
作者
Ikeda, O [1 ]
机构
[1] Takushoku Univ, Fac Engn, Hachioji, Tokyo 1930985, Japan
来源
2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2 | 2005年
关键词
D O I
10.1109/ICME.2005.1521455
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We previously reported a face detection system based on color segmentation using HSV. It was shown that the color is more effective than other colors not only in accurate segmentation but also in effective extraction of facial features. The first is crucial for detection and the latter for recognition. When it comes to video footages of news program, sound often accompanies the video and persons express themselves by moving facial parts while speaking. In this paper we improve the face detection in speed using both sound and video in a combined way. First, the rate of syllables spoken is estimated from the sound. Next, for a beginning short video clip of each new scene, a differential image is formed with the frame distance corresponding to the rate to find mouth and eyes. This enables us to reduce the number of sampling points for segmentation to a great degree and to enhance the reliability of the detection. Also music is discriminated from speaking by the estimation. These contribute to much faster detection of face.
引用
收藏
页码:442 / 445
页数:4
相关论文
共 50 条
  • [31] Preliminary yield estimation of the 2020 Beirut explosion using video footage from social media
    S. E. Rigby
    T. J. Lodge
    S. Alotaibi
    A. D. Barr
    S. D. Clarke
    G. S. Langdon
    A. Tyas
    Shock Waves, 2020, 30 : 671 - 675
  • [32] Face detection and tracking in a video by propagating detection probabilities
    Verma, RC
    Schmid, C
    Mikolajczyk, K
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (10) : 1215 - 1228
  • [33] Preliminary yield estimation of the 2020 Beirut explosion using video footage from social media
    Rigby, S. E.
    Lodge, T. J.
    Alotaibi, S.
    Barr, A. D.
    Clarke, S. D.
    Langdon, G. S.
    Tyas, A.
    SHOCK WAVES, 2020, 30 (06) : 671 - 675
  • [34] Real-time multi-view face detection and pose estimation in video stream
    Wang, Yan
    Liu, Yanghua
    Tao, Linmi
    Xu, Guangyou
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 354 - +
  • [35] Enhancing face detection in video sequences by video segmentation preprocessing
    Liu, Huibin
    Fan, Zuoxun
    Chen, Qiang
    Zhang, Xiaomei
    APPLIED INTELLIGENCE, 2023, 53 (03) : 2897 - 2907
  • [36] Enhancing face detection in video sequences by video segmentation preprocessing
    Huibin Liu
    Zuoxun Fan
    Qiang Chen
    Xiaomei Zhang
    Applied Intelligence, 2023, 53 : 2897 - 2907
  • [37] PIFS Scheme for HEad Pose Estimation Aimed at Faster Face Recognition
    Bisogni, Carmen
    Nappi, Michele
    Pero, Chiara
    Ricciardi, Stefano
    IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2022, 4 (02): : 173 - 184
  • [38] Pose estimation and frontal face detection for face recognition
    Lim, ET
    Wang, J
    Xie, W
    Venkarteswarlu, R
    Visual Information Processing XIV, 2005, 5817 : 97 - 105
  • [39] Video-conferencing speaking tests: do they measure the same construct as face-to-face tests?
    Nakatsuhara, Fumiyo
    Inoue, Chihiro
    Berry, Vivien
    Galaczi, Evelina
    ASSESSMENT IN EDUCATION-PRINCIPLES POLICY & PRACTICE, 2021, 28 (04) : 369 - 388
  • [40] A Fast Method of Face Detection in Video Images
    Zhang, Lijing
    Liang, Yingli
    2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 4, 2010, : 490 - 494