Application of spectral subtraction method on enhancement of electrolarynx speech

被引：14

作者：

Liu, Hanjun ^{[1
]}

Zhao, Qin ^{[1
]}

Wan, Mingxi ^{[1
]}

Wang, Supin ^{[1
]}

机构：

[1] Xian Jiaotong Univ, Key Lab Biomed Informat Engn, Minist Educ, Dept Biomed Engn,Sch Life Sci & Technol, Xian 710049, Peoples R China

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2006年 / 120卷 / 01期

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1121/1.2203592

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Although electrolarynx (EL) serves as an important method of phonation for the laryngectomees, the resulting speech is of poor intelligibility due to the presence of a steady background noise caused by the instrument, even worse in the case of additive noise. This paper investigates the problem of EL speech enhancement by taking into account the frequency-domain masking properties of the human auditory system. One approach is incorporating an auditory masking threshold (AMT) for parametric adaptation in a subtractive-type enhancement process. The other is the supplementary AMT (SAMT) algorithm, which applies a cross-correlation spectral subtraction (CCSS) approach as a post-processing scheme to enhancing EL speech dealt with the AMT method. The performance of these two algorithms was evaluated as compared to the power spectral subtraction (PSS) algorithm. The best performance of EL speech enhancement was associated with the SAMT algorithm, followed by the AMT algorithm and the PSS algorithm. Acoustic and perceptual analyses indicated that the AMT and SAMT algorithms achieved the better performances of noise reduction and the enhanced EL speech was more pleasant to human listeners as compared to the PSS algorithm. (c) 2006 Acoustical Society of America.

引用

页码：398 / 406

页数：9

共 35 条

[1] Evaluation of an auditory masked threshold noise suppression algorithm in normal-hearing and hearing-impaired listeners
Arehart, KH
Hansen, JHL
Gallant, S
Kalstein, L
[J]. SPEECH COMMUNICATION, 2003, 40 (04) : 575 - 592
[2] AN EXPERIMENTAL TRANSISTORIZED ARTIFICIAL LARYNX
BARNEY, HL
HAWORTH, FE
DUNN, HK
[J]. BELL SYSTEM TECHNICAL JOURNAL, 1959, 38 (06): : 1337 - 1356
[3] Berouti M., 1979, ICASSP 79. 1979 IEEE International Conference on Acoustics, Speech and Signal Processing, P208
[4] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION
BOLL, SF
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02): : 113 - 120
[5] BRANDENBURG K, 1994, J AUDIO ENG SOC, V42, P780
[6] Noise estimation by minima controlled recursive averaging for robust speech enhancement
Cohen, I
Berdugo, B
[J]. IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) : 12 - 15
[7] Cole D, 1997, TENCON IEEE REGION, P491, DOI 10.1109/TENCON.1997.648252
[8] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR
EPHRAIM, Y
MALAH, D
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06): : 1109 - 1121
[9] Enhancement of electrolaryngeal speech by adaptive filtering
Espy-Wilson, CY
Chari, VR
MacAuslan, JM
Huang, CB
Walsh, MJ
[J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 1998, 41 (06): : 1253 - 1264
[10] EspyWilson CY, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P764, DOI 10.1109/ICSLP.1996.607475

← 1 2 3 4 →