Estimation of speaking speed for faster face detection in video-footage

被引：1

作者：

Ikeda, O ^{[1
]}

机构：

[1] Takushoku Univ, Fac Engn, Hachioji, Tokyo 1930985, Japan

来源：

2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2 | 2005年

关键词：

D O I：

10.1109/ICME.2005.1521455

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We previously reported a face detection system based on color segmentation using HSV. It was shown that the color is more effective than other colors not only in accurate segmentation but also in effective extraction of facial features. The first is crucial for detection and the latter for recognition. When it comes to video footages of news program, sound often accompanies the video and persons express themselves by moving facial parts while speaking. In this paper we improve the face detection in speed using both sound and video in a combined way. First, the rate of syllables spoken is estimated from the sound. Next, for a beginning short video clip of each new scene, a differential image is formed with the frame distance corresponding to the rate to find mouth and eyes. This enables us to reduce the number of sampling points for segmentation to a great degree and to enhance the reliability of the detection. Also music is discriminated from speaking by the estimation. These contribute to much faster detection of face.

引用

页码：442 / 445

页数：4

共 50 条

[31] Preliminary yield estimation of the 2020 Beirut explosion using video footage from social media
S. E. Rigby
T. J. Lodge
S. Alotaibi
A. D. Barr
S. D. Clarke
G. S. Langdon
A. Tyas
Shock Waves, 2020, 30 : 671 - 675
[32] Face detection and tracking in a video by propagating detection probabilities
Verma, RC
Schmid, C
Mikolajczyk, K
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (10) : 1215 - 1228
[33] Preliminary yield estimation of the 2020 Beirut explosion using video footage from social media
Rigby, S. E.
Lodge, T. J.
Alotaibi, S.
Barr, A. D.
Clarke, S. D.
Langdon, G. S.
Tyas, A.
SHOCK WAVES, 2020, 30 (06) : 671 - 675
[34] Real-time multi-view face detection and pose estimation in video stream
Wang, Yan
Liu, Yanghua
Tao, Linmi
Xu, Guangyou
18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 354 - +
[35] Enhancing face detection in video sequences by video segmentation preprocessing
Liu, Huibin
Fan, Zuoxun
Chen, Qiang
Zhang, Xiaomei
APPLIED INTELLIGENCE, 2023, 53 (03) : 2897 - 2907
[36] Enhancing face detection in video sequences by video segmentation preprocessing
Huibin Liu
Zuoxun Fan
Qiang Chen
Xiaomei Zhang
Applied Intelligence, 2023, 53 : 2897 - 2907
[37] PIFS Scheme for HEad Pose Estimation Aimed at Faster Face Recognition
Bisogni, Carmen
Nappi, Michele
Pero, Chiara
Ricciardi, Stefano
IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2022, 4 (02): : 173 - 184
[38] Pose estimation and frontal face detection for face recognition
Lim, ET
Wang, J
Xie, W
Venkarteswarlu, R
Visual Information Processing XIV, 2005, 5817 : 97 - 105
[39] Video-conferencing speaking tests: do they measure the same construct as face-to-face tests?
Nakatsuhara, Fumiyo
Inoue, Chihiro
Berry, Vivien
Galaczi, Evelina
ASSESSMENT IN EDUCATION-PRINCIPLES POLICY & PRACTICE, 2021, 28 (04) : 369 - 388
[40] A Fast Method of Face Detection in Video Images
Zhang, Lijing
Liang, Yingli
2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 4, 2010, : 490 - 494

← 1 2 3 4 5 →