Silent Speech Interface Using Lip-Reading Methods

被引：0

作者：

Jothibalaji, Raghupathy ^{[1
]}

Adithya, S. Siva ^{[1
]}

Saravanan, N. V. ^{[1
]}

Dhanalakshmi, M. ^{[1
]}

机构：

[1] Sri Sivasubramaniya Nadar Coll Engn, Dept Biomed Engn, Chennai 603110, India

来源：

BIOMEDICAL ENGINEERING SCIENCE AND TECHNOLOGY, ICBEST 2023 | 2024年 / 2003卷

关键词：

Silent Speech Interface; Total Laryngectomy; Image processing; EXTRACTION;

D O I：

10.1007/978-3-031-54547-4_2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A silent speech interface (SSI) maps articulatory movement data to forecast the speech of mute people. SSIs have vast potential for improving oral communication in peoplewho have had laryngectomy surgery or thosewith severe voice abnormalities. In this paper, wepresent a lip-reading recognition algorithm to recognize Englishwords from the lipswhen speaking. The lip contour and English words were distinguished using a variety of lip metric metrics, including width, height, contour points, area, and ratio (width/height). In this test, fifteen talkers mouthed (silently articulated) sentences to finish a phrase-reading assignment in an experimental setting. Among the fifteen participants, 89% and 81% of the mouthed phrases were correctly recognized for two different classifier cases respectively. The automated ROI detection, elimination of lip shape effects, and operation without prior data training are strengths of the proposed method. The experimental outcomes showed the viability and potential of lip-reading-based silent speech interfaces for future clinical applications.

引用

页码：9 / 23

页数：15

共 41 条

[31] Integrating User-Centred Design in the Development of a Silent Speech Interface Based on Permanent Magnetic Articulography
Cheah, Lam A.
Gilbert, James M.
Gonzalez, Jose A.
Bai, Jie
Ell, Stephen R.
Fagan, Michael J.
Moore, Roger K.
Green, Phil D.
Rychenko, Sergey I.
BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, BIOSTEC 2015, 2015, 574 : 324 - 337
[32] Visuo-Phonetic Decoding using Multi-Stream and Context-Dependent Models for an Ultrasound-based Silent Speech Interface
Hueber, Thomas
Benaroya, Elie-Laurent
Chollet, Gerard
Denby, Bruce
Dreyfus, Gerard
Stone, Maureen
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 628 - +
[33] Lip reading using wavelet-based features and Random Forests classification
Terissi, Lucas D.
Parodi, Marianela
Gomez, Juan C.
2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 791 - 796
[34] Lip reading system using novel Japanese visemes classification and hierarchical weighted discrimination
Okita, Shinsuke
Mitsukura, Yasue
Hamada, Nozomu
2013 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS), 2013, : 531 - 536
[35] Statistical conversion of silent articulation into audible speech using full-covariance HMM
Hueber, Thomas
Bailly, Gerard
COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 274 - 293
[36] Video analysis using spatiotemporal descriptor and kernel extreme learning machine for lip reading
Lu, Longbin
Zhang, Xinman
Xu, Xuebin
Shang, Dongpeng
JOURNAL OF ELECTRONIC IMAGING, 2015, 24 (05)
[37] Adaptation of Tongue Ultrasound-Based Silent Speech Interfaces Using Spatial Transformer Networks
Toth, Laszlo
Shandiz, Amin Honarmandi
Gosztolya, Gabor
Csapo, Tamas Gabor
INTERSPEECH 2023, 2023, : 1169 - 1173
[38] SilentSpeller: Towards mobile, hands-free, silent speech text entry using electropalatography
Kimura, Naoki
Gemicioglu, Tan
Womack, Jonathan
Li, Richard
Zhao, Yuhui
Bedri, Abdelkareem
Su, Zixiong
Olwal, Alex
Rekimoto, Jun
Starner, Thad
PROCEEDINGS OF THE 2022 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI' 22), 2022,
[39] LipLearner: Customizing Silent Speech Commands from Voice Input using One-shot Lipreading
Su, Zixiong
Fang, Shitao
Rekimoto, Jun
ADJUNCT PROCEEDINGS OF THE 35TH ACM SYMPOSIUM ON USER INTERFACE SOFTWARE & TECHNOLOGY, UIST 2022, 2022,
[40] Multiclass Data Segmentation Using Diffuse Interface Methods on Graphs
Garcia-Cardona, Cristina
Merkurjev, Ekaterina
Bertozzi, Andrea L.
Flenner, Arjuna
Percus, Allon G.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (08) : 1600 - 1613

← 1 2 3 4 5 →