Myo-To-Speech - Evolving Fuzzy-Neural Network Prediction of Speech Utterances from Myoelectric Signals

被引:2
|
作者
Malcangi, Mario [1 ]
Felisati, Giovanni [2 ]
Saibene, Alberto [2 ]
Alfonsi, Enrico [3 ]
Fresia, Mauro [3 ]
Maffioletti, Roberto [1 ]
Quan, Hao [1 ]
机构
[1] Univ Milan, Comp Sci Dept, Milan, Italy
[2] Univ Milan, Dept Heath Sci, Milan, Italy
[3] IRCCS Mondino Fdn, Pavia, Italy
来源
ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2018 | 2018年 / 893卷
关键词
EFuNN; Evolving Fuzzy Neural Network; Voice dysarthria; Voice rehabilitation; Myoelectric signal; SPASMODIC DYSPHONIA; ACTIVATION;
D O I
10.1007/978-3-319-98204-5_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Voice rehabilitation is needed after several diseases, when a subject's vocal ability is compromised by surgical interference or removal of phonation organs (e.g. the larynx), by neural degeneration or by neurological injury to the motor component of the motor-speech system in the phonation area of the brain (e.g. dysarthria in Parkinson disease). A novel approach to voice rehabilitation consists of predicting the phonetic control sequence of the voice-production apparatus (larynx, tongue, etc.) by drawing inferences on the basis of myoelectric (EMG) signals captured by a set of contact electrodes, applied to the neck area of a subject with important phonatory alteration (e.g. laryngectomised) and intact neural control. The inference paradigm is based on an EFuNN (Evolving Fuzzy Neural Network) that has been trained to use the sampled EMG signal to predict the phoneme that corresponds to the motor control of the sublingual muscle movements monitored at phonation time. A phoneme-to-speech synthesizer generates audio output corresponding to the utterance the subject has tried to enunciate.
引用
收藏
页码:158 / 168
页数:11
相关论文
共 3 条