Decoding and synthesizing tonal language speech from brain activity

被引:13
|
作者
Liu, Yan [1 ,2 ,3 ,4 ]
Zhao, Zehao [1 ,2 ,3 ,4 ]
Xu, Minpeng [1 ,2 ,4 ,5 ,6 ]
Yu, Haiqing [1 ,2 ,4 ,5 ]
Zhu, Yanming [1 ,2 ,3 ,4 ]
Zhang, Jie [1 ,2 ,3 ,4 ]
Bu, Linghao [1 ,2 ,3 ,4 ,7 ]
Zhang, Xiaoluo [1 ,2 ,3 ,4 ]
Lu, Junfeng [1 ,2 ,3 ,4 ,8 ]
Li, Yuanning [1 ,2 ,4 ,9 ]
Ming, Dong [1 ,2 ,4 ,5 ,6 ]
Wu, Jinsong [1 ,2 ,3 ,4 ]
机构
[1] Fudan Univ, Huashan Hosp, Shanghai Med Coll, Dept Neurosurg, Shanghai 200040, Peoples R China
[2] Natl Ctr Neurol Disorders, Shanghai 200052, Peoples R China
[3] Shanghai Key Lab Brain Funct Restorat & Neural Reg, Shanghai 200040, Peoples R China
[4] Fudan Univ, Neurosurg Inst, Shanghai 200052, Peoples R China
[5] Tianjin Univ, Coll Precis Instruments & Optoelect Engn, Dept Biomed Engn, Tianjin 300041, Peoples R China
[6] Tianjin Univ, Acad Med Engn & Translat Med, Tianjin 300041, Peoples R China
[7] Zhejiang Univ, Affiliated Hosp 1, Coll Med, Dept Neurosurg, Hangzhou 310000, Peoples R China
[8] Fudan Univ, MOE Frontiers Ctr Brain Sci, Huashan Hosp, Shanghai 200040, Peoples R China
[9] ShanghaiTech Univ, Sch Biomed Engn, Shanghai 201210, Peoples R China
来源
SCIENCE ADVANCES | 2023年 / 9卷 / 23期
关键词
HUMAN SENSORIMOTOR CORTEX; SPOKEN;
D O I
10.1126/sciadv.adh0478
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent studies have shown that the feasibility of speech brain-computer interfaces (BCIs) as a clinically valid treatment in helping nontonal language patients with communication disorders restore their speech ability. However, tonal language speech BCI is challenging because additional precise control of laryngeal movements to produce lexical tones is required. Thus, the model should emphasize the features from the tonal-related cortex. Here, we designed a modularized multistream neural network that directly synthesizes tonal language speech from intracranial recordings. The network decoded lexical tones and base syllables independently via parallel streams of neural network modules inspired by neuroscience findings. The speech was synthesized by combining tonal syllable labels with nondiscriminant speech neural activity. Compared to commonly used baseline models, our proposed models achieved higher performance with modest training data and computational costs. These findings raise a potential strategy for approaching tonal language speech restoration.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Synthesizing Speech From Brain Activity
    Abbasi, Jennifer
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2019, 321 (22): : 2155 - 2155
  • [2] Synthesizing Speech by Decoding Intracortical Neural Activity from Dorsal Motor Cortex
    Wairagkar, Maitreyee
    Hochberg, Leigh R.
    Brandman, David M.
    Stavisky, Sergey D.
    2023 11TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING, NER, 2023,
  • [3] DECODING THE GENETICS OF SPEECH AND LANGUAGE
    Fisher, Simon
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2013, : 17 - 17
  • [4] Decoding the genetics of speech and language
    Graham, Sarah A.
    Fisher, Simon E.
    CURRENT OPINION IN NEUROBIOLOGY, 2013, 23 (01) : 43 - 51
  • [5] Artificial intelligence based multimodal language decoding from brain activity: A review
    Zhao, Yuhao
    Chen, Yu
    Cheng, Kaiwen
    Huang, Wei
    BRAIN RESEARCH BULLETIN, 2023, 201
  • [6] MEGFormer: Enhancing Speech Decoding from Brain Activity Through Extended Semantic Representations
    Boyko, Maria
    Druzhinina, Polina
    Kormakov, Georgii
    Beliaeva, Aleksandra
    Sharaev, Maxim
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT II, 2024, 15002 : 281 - 290
  • [7] Decoding pain from brain activity
    Chen, Zhe Sage
    JOURNAL OF NEURAL ENGINEERING, 2021, 18 (05)
  • [8] A dual-channel language decoding from brain activity with progressive transfer training
    Huang, Wei
    Yan, Hongmei
    Cheng, Kaiwen
    Wang, Yuting
    Wang, Chong
    Li, Jiyi
    Li, Chen
    Li, Chaorong
    Zuo, Zhentao
    Chen, Huafu
    HUMAN BRAIN MAPPING, 2021, 42 (15) : 5089 - 5100
  • [9] Sine-wave speech recognition in a tonal language
    Feng, Yan-Mei
    Xu, Li
    Zhou, Ning
    Yang, Guang
    Yin, Shan-Kai
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (02): : EL133 - EL138
  • [10] Assessment of tracheoesophageal speech in a tonal language - A prospective study
    Wong, SHW
    Cheung, CCH
    Yuen, APW
    Ho, WK
    Ignace, W
    ARCHIVES OF OTOLARYNGOLOGY-HEAD & NECK SURGERY, 1997, 123 (01) : 88 - 92