Silent Speech Interface Using Lip-Reading Methods

被引:0
|
作者
Jothibalaji, Raghupathy [1 ]
Adithya, S. Siva [1 ]
Saravanan, N. V. [1 ]
Dhanalakshmi, M. [1 ]
机构
[1] Sri Sivasubramaniya Nadar Coll Engn, Dept Biomed Engn, Chennai 603110, India
来源
BIOMEDICAL ENGINEERING SCIENCE AND TECHNOLOGY, ICBEST 2023 | 2024年 / 2003卷
关键词
Silent Speech Interface; Total Laryngectomy; Image processing; EXTRACTION;
D O I
10.1007/978-3-031-54547-4_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A silent speech interface (SSI) maps articulatory movement data to forecast the speech of mute people. SSIs have vast potential for improving oral communication in peoplewho have had laryngectomy surgery or thosewith severe voice abnormalities. In this paper, wepresent a lip-reading recognition algorithm to recognize Englishwords from the lipswhen speaking. The lip contour and English words were distinguished using a variety of lip metric metrics, including width, height, contour points, area, and ratio (width/height). In this test, fifteen talkers mouthed (silently articulated) sentences to finish a phrase-reading assignment in an experimental setting. Among the fifteen participants, 89% and 81% of the mouthed phrases were correctly recognized for two different classifier cases respectively. The automated ROI detection, elimination of lip shape effects, and operation without prior data training are strengths of the proposed method. The experimental outcomes showed the viability and potential of lip-reading-based silent speech interfaces for future clinical applications.
引用
收藏
页码:9 / 23
页数:15
相关论文
共 41 条
  • [31] Integrating User-Centred Design in the Development of a Silent Speech Interface Based on Permanent Magnetic Articulography
    Cheah, Lam A.
    Gilbert, James M.
    Gonzalez, Jose A.
    Bai, Jie
    Ell, Stephen R.
    Fagan, Michael J.
    Moore, Roger K.
    Green, Phil D.
    Rychenko, Sergey I.
    BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, BIOSTEC 2015, 2015, 574 : 324 - 337
  • [32] Visuo-Phonetic Decoding using Multi-Stream and Context-Dependent Models for an Ultrasound-based Silent Speech Interface
    Hueber, Thomas
    Benaroya, Elie-Laurent
    Chollet, Gerard
    Denby, Bruce
    Dreyfus, Gerard
    Stone, Maureen
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 628 - +
  • [33] Lip reading using wavelet-based features and Random Forests classification
    Terissi, Lucas D.
    Parodi, Marianela
    Gomez, Juan C.
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 791 - 796
  • [34] Lip reading system using novel Japanese visemes classification and hierarchical weighted discrimination
    Okita, Shinsuke
    Mitsukura, Yasue
    Hamada, Nozomu
    2013 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS), 2013, : 531 - 536
  • [35] Statistical conversion of silent articulation into audible speech using full-covariance HMM
    Hueber, Thomas
    Bailly, Gerard
    COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 274 - 293
  • [36] Video analysis using spatiotemporal descriptor and kernel extreme learning machine for lip reading
    Lu, Longbin
    Zhang, Xinman
    Xu, Xuebin
    Shang, Dongpeng
    JOURNAL OF ELECTRONIC IMAGING, 2015, 24 (05)
  • [37] Adaptation of Tongue Ultrasound-Based Silent Speech Interfaces Using Spatial Transformer Networks
    Toth, Laszlo
    Shandiz, Amin Honarmandi
    Gosztolya, Gabor
    Csapo, Tamas Gabor
    INTERSPEECH 2023, 2023, : 1169 - 1173
  • [38] SilentSpeller: Towards mobile, hands-free, silent speech text entry using electropalatography
    Kimura, Naoki
    Gemicioglu, Tan
    Womack, Jonathan
    Li, Richard
    Zhao, Yuhui
    Bedri, Abdelkareem
    Su, Zixiong
    Olwal, Alex
    Rekimoto, Jun
    Starner, Thad
    PROCEEDINGS OF THE 2022 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI' 22), 2022,
  • [39] LipLearner: Customizing Silent Speech Commands from Voice Input using One-shot Lipreading
    Su, Zixiong
    Fang, Shitao
    Rekimoto, Jun
    ADJUNCT PROCEEDINGS OF THE 35TH ACM SYMPOSIUM ON USER INTERFACE SOFTWARE & TECHNOLOGY, UIST 2022, 2022,
  • [40] Multiclass Data Segmentation Using Diffuse Interface Methods on Graphs
    Garcia-Cardona, Cristina
    Merkurjev, Ekaterina
    Bertozzi, Andrea L.
    Flenner, Arjuna
    Percus, Allon G.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (08) : 1600 - 1613