Lip-Geometry Feature-based Visual Digit Recognition

被引：0

作者：

Debnath, Saswati ^{[1
]}

Senbagavalli, M. ^{[1
]}

Rajagopal, R. ^{[1
]}

机构：

[1] Alliance Univ, Bangalore 562106, Karnataka, India

来源：

SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 5, SMARTCOM 2024 | 2024年 / 949卷

关键词：

Lip-geometry; PZMI; RBF-NN; ANN; ZERNIKE; MFCC;

D O I：

10.1007/978-981-97-1313-4_34

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

One of the most prevalent and natural forms of expression is speech. The level of speech understanding can be increased by the human using visual clues like lip and tongue movements. Visual speech recognition is the act of understanding speech by seeing the speaker's motion of the lips. To recognize visual speech, shape or geometry-based characteristics extract the speaker's lip movement. This paper proposes a lip geometry-based visual feature for visual digit recognition. The lip geometry of a speaker is calculated using the Pseudo-Zernike Moment Invariant (PZMI). Artificial Neural Network (ANN) and Radial Basis Function Neural Network (RBF-NN) are used here to recognize the speech for visual modality. The aim of the proposed work is to extract translation, rotation, and scale-invariant visual features. Moments invariant features are used in classification and recognition work as pattern-sensitive features. These features are proving discriminating properties for similar images which is very important for recognizing different visual speech. Because the accuracy of these futures has a significant impact on the classifiers used. The proposed system achieves 80% and 78.3% recognition accuracy using RBF-NN and ANN, respectively.

引用

页码：397 / 406

页数：10

共 43 条

[1] A LIP GEOMETRY APPROACH FOR FEATURE-FUSION BASED AUDIO-VISUAL SPEECH RECOGNITION
Ibrahim, M. Z.
Mulvaney, D. J.
2014 6TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING (ISCCSP), 2014, : 644 - 647
[2] Feature-based Noise Robust Speech Recognition on an Indonesian Language Automatic Speech Recognition System
Satriawan, Cil Hardianto
Lestari, Dessi Puji
2014 International Conference on Electrical Engineering and Computer Science (ICEECS), 2014, : 42 - 46
[3] Biometric Face Identification: Utilizing Soft Computing Methods for Feature-Based Recognition
Singh, Mahesh K.
Kumar, Sanjeev
Nandan, Durgesh
TRAITEMENT DU SIGNAL, 2024, 41 (05) : 2721 - 2728
[4] Spatio-temporal Weber Gradient Directional feature for visual and audio-visual phrase recognition systems
Salam Nandakishor
Debadatta Pati
International Journal of Information Technology, 2025, 17 (3) : 1359 - 1369
[5] Speech Emotion Recognition based on Multiple Feature Fusion
Jiang, Changjiang
Mao, Rong
Liu, Geng
Wang, Mingyi
2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 907 - 912
[6] New Feature Vector based on GFCC for Language Recognition
Chandrasekaram, B.
JOURNAL OF ALGEBRAIC STATISTICS, 2022, 13 (02) : 481 - 486
[7] Speaker recognition using PCA-based feature transformation
Ahmed, Ahmed Isam
Chiverton, John P.
Ndzi, David L.
Becerra, Victor M.
SPEECH COMMUNICATION, 2019, 110 : 33 - 46
[8] Classification and Recognition of Underwater Target Based on MFCC Feature Extraction
Tong, Yuze
Zhang, Xin
Ge, Yizhou
2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020), 2020,
[9] A dynamic feature extraction based on wavelet transforms for speaker recognition
Me Chunrong
Zhang Jianhuan
Long Fei
ICEMI 2007: PROCEEDINGS OF 2007 8TH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOL I, 2007, : 595 - 598
[10] Performance Evaluation of Feature Selection Methods for ANN Based Iris Recognition
Meetei, Thiyam Churjit
Begum, Shahin Ara
2013 1ST INTERNATIONAL CONFERENCE ON EMERGING TRENDS AND APPLICATIONS IN COMPUTER SCIENCE (ICETACS), 2013, : 208 - 213

← 1 2 3 4 5 →