Lip-Geometry Feature-based Visual Digit Recognition

被引:0
作者
Debnath, Saswati [1 ]
Senbagavalli, M. [1 ]
Rajagopal, R. [1 ]
机构
[1] Alliance Univ, Bangalore 562106, Karnataka, India
来源
SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 5, SMARTCOM 2024 | 2024年 / 949卷
关键词
Lip-geometry; PZMI; RBF-NN; ANN; ZERNIKE; MFCC;
D O I
10.1007/978-981-97-1313-4_34
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the most prevalent and natural forms of expression is speech. The level of speech understanding can be increased by the human using visual clues like lip and tongue movements. Visual speech recognition is the act of understanding speech by seeing the speaker's motion of the lips. To recognize visual speech, shape or geometry-based characteristics extract the speaker's lip movement. This paper proposes a lip geometry-based visual feature for visual digit recognition. The lip geometry of a speaker is calculated using the Pseudo-Zernike Moment Invariant (PZMI). Artificial Neural Network (ANN) and Radial Basis Function Neural Network (RBF-NN) are used here to recognize the speech for visual modality. The aim of the proposed work is to extract translation, rotation, and scale-invariant visual features. Moments invariant features are used in classification and recognition work as pattern-sensitive features. These features are proving discriminating properties for similar images which is very important for recognizing different visual speech. Because the accuracy of these futures has a significant impact on the classifiers used. The proposed system achieves 80% and 78.3% recognition accuracy using RBF-NN and ANN, respectively.
引用
收藏
页码:397 / 406
页数:10
相关论文
共 43 条
  • [1] A LIP GEOMETRY APPROACH FOR FEATURE-FUSION BASED AUDIO-VISUAL SPEECH RECOGNITION
    Ibrahim, M. Z.
    Mulvaney, D. J.
    2014 6TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING (ISCCSP), 2014, : 644 - 647
  • [2] Feature-based Noise Robust Speech Recognition on an Indonesian Language Automatic Speech Recognition System
    Satriawan, Cil Hardianto
    Lestari, Dessi Puji
    2014 International Conference on Electrical Engineering and Computer Science (ICEECS), 2014, : 42 - 46
  • [3] Biometric Face Identification: Utilizing Soft Computing Methods for Feature-Based Recognition
    Singh, Mahesh K.
    Kumar, Sanjeev
    Nandan, Durgesh
    TRAITEMENT DU SIGNAL, 2024, 41 (05) : 2721 - 2728
  • [4] Spatio-temporal Weber Gradient Directional feature for visual and audio-visual phrase recognition systems
    Salam Nandakishor
    Debadatta Pati
    International Journal of Information Technology, 2025, 17 (3) : 1359 - 1369
  • [5] Speech Emotion Recognition based on Multiple Feature Fusion
    Jiang, Changjiang
    Mao, Rong
    Liu, Geng
    Wang, Mingyi
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 907 - 912
  • [6] New Feature Vector based on GFCC for Language Recognition
    Chandrasekaram, B.
    JOURNAL OF ALGEBRAIC STATISTICS, 2022, 13 (02) : 481 - 486
  • [7] Speaker recognition using PCA-based feature transformation
    Ahmed, Ahmed Isam
    Chiverton, John P.
    Ndzi, David L.
    Becerra, Victor M.
    SPEECH COMMUNICATION, 2019, 110 : 33 - 46
  • [8] Classification and Recognition of Underwater Target Based on MFCC Feature Extraction
    Tong, Yuze
    Zhang, Xin
    Ge, Yizhou
    2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020), 2020,
  • [9] A dynamic feature extraction based on wavelet transforms for speaker recognition
    Me Chunrong
    Zhang Jianhuan
    Long Fei
    ICEMI 2007: PROCEEDINGS OF 2007 8TH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOL I, 2007, : 595 - 598
  • [10] Performance Evaluation of Feature Selection Methods for ANN Based Iris Recognition
    Meetei, Thiyam Churjit
    Begum, Shahin Ara
    2013 1ST INTERNATIONAL CONFERENCE ON EMERGING TRENDS AND APPLICATIONS IN COMPUTER SCIENCE (ICETACS), 2013, : 208 - 213