IMAGE AND AUDIO-SPEECH DENOISING BASED ON HIGHER-ORDER STATISTICAL MODELING OF WAVELET COEFFICIENTS AND LOCAL VARIANCE ESTIMATION

被引:14
作者
Kittisuwan, Pichid [1 ]
Chanwimaluan, Thitiporn
Marukatat, Sanparith
Asdornwised, Widhyakorn [1 ]
机构
[1] Chulalongkorn Univ, Fac Engn, Dept Elect Engn, Bangkok 10330, Thailand
关键词
Pearson Type VII random vectors; image denoising; wavelet transforms; BIVARIATE SHRINKAGE; RANDOM VECTORS; TRANSFORM;
D O I
10.1142/S0219691310003808
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
At first, this paper is concerned with wavelet-based image denoising using Bayesian technique. In conventional denoising process, the parameters of probability density function (PDF) are usually calculated from the first few moments, mean and variance. In the first part of our work, a new image denoising algorithm based on Pearson Type VII random vectors is proposed. This PDF is used because it allows higher-order moments to be incorporated into the noiseless wavelet coefficients' probabilistic model. One of the cruxes of the Bayesian image denoising algorithms is to estimate the variance of the clean image. Here, maximum a posterior (MAP) approach is employed for not only noiseless wavelet-coefficient estimation but also local observed variance acquisition. For the local observed variance estimation, the selection of noisy wavelet-coefficient model, either a Laplacian or a Gaussian distribution, is based upon the corrupted noise power where Gamma distribution is used as a prior for the variance. Evidently, our selection of prior is motivated by analytical and computational tractability. In our experiments, our proposed method gives promising denoising results with moderate complexity. Eventually, our image denoising method can be simply extended to audio/speech processing by forming matrix representation whose rows are formed by time segments of digital speech waveforms. This way, the use of our image denoising methods can be exploited to improve the performance of various audio/speech tasks, e.g., denoised enhancement of voice activity detection to capture voiced speech, significantly needed for speech coding and voice conversion applications. Moreover, one of the voice abnormality detections, called oropharyngeal dysphagia classification, is also required denoising method to improve the signal quality in elderly patients. We provide simple speech examples to demonstrate the prospects of our techniques.
引用
收藏
页码:987 / 1017
页数:31
相关论文
共 34 条
  • [1] Spatially adaptive wavelet thresholding with context modeling for image denoising
    Chang, SG
    Yu, B
    Vetterli, M
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2000, 9 (09) : 1522 - 1531
  • [2] Improved voice activity detection algorithm using wavelet and support vector machine
    Chen, Shi-Huang
    Guido, Rodrigo Capobianco
    Truong, Trieu-Kien
    Chang, Yaotsu
    [J]. COMPUTER SPEECH AND LANGUAGE, 2010, 24 (03) : 531 - 543
  • [3] MAXIMUM LIKELIHOOD ESTIMATION OF PARAMETERS OF GAMMA DISTRIBUTION AND THEIR BIAS
    CHOI, SC
    WETTE, R
    [J]. TECHNOMETRICS, 1969, 11 (04) : 683 - &
  • [4] Image denoising by sparse 3-D transform-domain collaborative filtering
    Dabov, Kostadin
    Foi, Alessandro
    Katkovnik, Vladimir
    Egiazarian, Karen
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (08) : 2080 - 2095
  • [5] AN IMPROVED WAVELET DOMAIN DIGITAL WATERMARKING FOR IMAGE PROTECTION
    Dejey
    Rajesh, R. S.
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2010, 8 (01) : 19 - 31
  • [6] DE-NOISING BY SOFT-THRESHOLDING
    DONOHO, DL
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1995, 41 (03) : 613 - 627
  • [7] IDEAL SPATIAL ADAPTATION BY WAVELET SHRINKAGE
    DONOHO, DL
    JOHNSTONE, IM
    [J]. BIOMETRIKA, 1994, 81 (03) : 425 - 455
  • [8] *FESTV, 2007, CMU ARCTIC SPEECH SY
  • [9] FIGUEIREDO MT, 1999, SPIE INT C MATH MOD, V38, P97
  • [10] Wavelet time-frequency analysis and least squares support vector machines for the identification of voice disorders
    Fonseca, Everthon Silva
    Guido, Rodrigo Capobianco
    Scalassara, Paulo Rogerio
    Maciel, Carlos Dias
    Pereira, Jose Carlos
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2007, 37 (04) : 571 - 578