ElectrodeNet-A Deep-Learning-Based Sound Coding Strategy for Cochlear Implants

被引：3

作者：

Huang, Enoch Hsin-Ho ^{[1
,2
]}

Chao, Rong ^{[2
,3
]}

Tsao, Yu ^{[2
,4
]}

Wu, Chao-Min ^{[1
]}

机构：

[1] Natl Cent Univ, Dept Elect Engn, Taoyuan 320317, Taiwan

[2] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei 115201, Taiwan

[3] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 701401, Taiwan

[4] Chung Yuan Christian Univ, Dept Elect Engn, Taoyuan 320314, Taiwan

来源：

IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS | 2024年 / 16卷 / 01期

关键词：

Channel selection (CS); cochlear implant (CI); deep learning; sound coding strategy; vocoder simulation; HEARING HEALTH-CARE; SPEECH-INTELLIGIBILITY; NEURAL-NETWORKS; PERCEPTION; RECOGNITION; MUSIC; NOISE; COMBINATION; PREDICTION; IMPROVE;

D O I：

10.1109/TCDS.2023.3275587

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

ElectrodeNet, a deep-learning-based sound coding strategy for the cochlear implant (CI), is proposed to emulate the advanced combination encoder (ACE) strategy by replacing the conventional envelope detection using various artificial neural networks. The extended ElectrodeNet-CS strategy further incorporates the channel selection (CS). Network models of deep neural network (DNN), convolutional neural network (CNN), and long short-term memory (LSTM) were trained using the fast Fourier transformed bins and channel envelopes obtained from the processing of clean speech by the ACE strategy. Objective speech understanding using short-time objective intelligibility (STOI) and normalized covariance metric (NCM) was estimated for ElectrodeNet using CI simulations. Sentence recognition tests for vocoded Mandarin speech were conducted with normal-hearing listeners. DNN, CNN, and LSTM-based ElectrodeNets exhibited strong correlations to ACE in objective and subjective scores using mean squared error (MSE), linear correlation coefficient (LCC), and Spearman's rank correlation coefficient (SRCC). The ElectrodeNet-CS strategy was capable of producing N-of-M compatible electrode patterns using a modified DNN network to embed maxima selection, and to perform in similar or even slightly higher average in STOI and sentence recognition compared to ACE. The methods and findings demonstrated the feasibility and potential of using deep learning in the CI coding strategy.

引用

页码：346 / 357

页数：12

共 89 条

[11] The multi-channel cochlear implant: Multi-disciplinary development of electrical stimulation of the cochlea and the resulting clinical benefit
Clark, Graeme M.
[J]. HEARING RESEARCH, 2015, 322 : 4 - 13
[12] Machine Learning and Cochlear Implantation-A Structured Review of Opportunities and Challenges
Crowson, Matthew G.
Lin, Vincent
Chen, Joseph M.
Chan, Timothy C. Y.
[J]. OTOLOGY & NEUROTOLOGY, 2020, 41 (01) : E36 - E45
[13] Using channel-specific statistical models to detect reverberation in cochlear implant stimuli
Desmond, Jill M.
Collins, Leslie M.
Throckmorton, Chandra S.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (02) : 1112 - 1120
[14] A comparison of the speech understanding provided by acoustic models of fixed-channel and channel-picking signal processors for cochlear implants
Dorman, MF
Loizou, PC
Spahr, AJ
Maloff, E
[J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2002, 45 (04): : 783 - 788
[15] Objective Quality and Intelligibility Prediction for Users of Assistive Listening Devices
Falk, Tiago H.
Parsa, Vijay
Santos, Joao F.
Arehart, Kathryn
Hazrati, Oldooz
Huber, Rainer
Kates, James M.
Scollie, Susan
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2015, 32 (02) : 114 - 124
[16] Speech Recognition in Cochlear Implant Recipients: Comparison of Standard HiRes and HiRes 120 Sound Processing
Firszt, Jill B.
Holden, Laura K.
Reeder, Ruth M.
Skinner, Margaret W.
[J]. OTOLOGY & NEUROTOLOGY, 2009, 30 (02) : 146 - 152
[17] One-pass single-channel noisy speech recognition using a combination of noisy and enhanced features
Fujimoto, Masakiyo
Kawai, Hisashi
[J]. INTERSPEECH 2019, 2019, : 486 - 490
[18] Deep learning models to remix music for cochlear implant users
Gajecki, Tom
Nogueira, Waldo
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 143 (06) : 3602 - 3615
[19] Modeling Electrode Place Discrimination in Cochlear Implant Stimulation
Gao, Xiao
Grayden, David B.
McDonnell, Mark D.
[J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2017, 64 (09) : 2219 - 2229
[20] Garofolo J.S., 1993, Technical report NISTIR 4930 25

← 1 2 3 4 5 6 7 8 9 →