Silent Speech Interface Using Lip-Reading Methods

被引:0
|
作者
Jothibalaji, Raghupathy [1 ]
Adithya, S. Siva [1 ]
Saravanan, N. V. [1 ]
Dhanalakshmi, M. [1 ]
机构
[1] Sri Sivasubramaniya Nadar Coll Engn, Dept Biomed Engn, Chennai 603110, India
来源
BIOMEDICAL ENGINEERING SCIENCE AND TECHNOLOGY, ICBEST 2023 | 2024年 / 2003卷
关键词
Silent Speech Interface; Total Laryngectomy; Image processing; EXTRACTION;
D O I
10.1007/978-3-031-54547-4_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A silent speech interface (SSI) maps articulatory movement data to forecast the speech of mute people. SSIs have vast potential for improving oral communication in peoplewho have had laryngectomy surgery or thosewith severe voice abnormalities. In this paper, wepresent a lip-reading recognition algorithm to recognize Englishwords from the lipswhen speaking. The lip contour and English words were distinguished using a variety of lip metric metrics, including width, height, contour points, area, and ratio (width/height). In this test, fifteen talkers mouthed (silently articulated) sentences to finish a phrase-reading assignment in an experimental setting. Among the fifteen participants, 89% and 81% of the mouthed phrases were correctly recognized for two different classifier cases respectively. The automated ROI detection, elimination of lip shape effects, and operation without prior data training are strengths of the proposed method. The experimental outcomes showed the viability and potential of lip-reading-based silent speech interfaces for future clinical applications.
引用
收藏
页码:9 / 23
页数:15
相关论文
共 41 条
  • [1] Improving the Performance of Automatic Lip-Reading Using Image Conversion Techniques
    Lee, Ki-Seung
    ELECTRONICS, 2024, 13 (06)
  • [2] An adaptive approach for lip-reading using image and depth data
    Rekik, Ahmed
    Ben-Hamadou, Achraf
    Mahdi, Walid
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (14) : 8609 - 8636
  • [3] Cantonese sentence dataset for lip-reading
    Xiao, Yewei
    Liu, Xuanming
    Teng, Lianwei
    Zhu, Aosu
    Tian, Picheng
    Huang, Jian
    IET IMAGE PROCESSING, 2024, 18 (10) : 2645 - 2664
  • [4] Study of lip-reading detecting and locating technique
    Wang, Lirong
    Li, Jie
    Zhao, Yanyan
    ELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY V, PTS 1 AND 2, 2008, 6833
  • [5] Silent Speech Interface Using Ultrasonic Doppler Sonar
    Lee, Ki-Seung
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (08): : 1875 - 1887
  • [6] Ultrasonic Doppler Based Silent Speech Interface Using Perceptual Distance
    Lee, Ki-Seung
    APPLIED SCIENCES-BASEL, 2022, 12 (02):
  • [7] Design of a Silent Speech Interface using Facial Gesture Recognition and Electromyography
    Nair, Aishwarya
    Shashikumar, Niranjana
    Vidhya, S.
    Kirthika, S. K.
    16TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING, 2017, 61 : 117 - 122
  • [8] Geometrical-based lip-reading using template probabilistic multi-dimension dynamic time warping
    Ibrahim, M. Z.
    Mulvaney, D. J.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2015, 30 : 219 - 233
  • [9] A Review on Methods and Classifiers in Lip Reading
    Aran, Leticia Ria
    Wong, Farrah
    Yi, Lim Pei
    2017 IEEE 2ND INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND INTELLIGENT SYSTEMS (I2CACIS), 2017, : 196 - 201
  • [10] AlterEgo: A Personalized Wearable Silent Speech Interface
    Kapur, Arnav
    Kapur, Shreyas
    Maes, Pattie
    IUI 2018: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2018, : 43 - 53