Signal processing representations of speech

被引:0
|
作者
Kleijn, W. Bastiaan [1 ]
机构
[1] Dept. of Signals, KTH (Royal Institute of Technology), Stockholm, Sweden
关键词
Convolution - Pattern recognition - Regression analysis - Signal encoding - Speech coding - Speech recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Synergies in processing requirements and knowledge of human speech production and perception have led to a similarity of the speech signal representations used for the tasks of recognition, coding, and modification. The representations are generally composed of a description of the vocal-tract transfer function and, in the case of coding and modification, a description of the excitation signal. This paper provides an overview of commonly used representations. For coding and modification, autoregressive models represented by line spectral frequencies perform well for the vocal tract, and pitch-synchronous filter banks and modulation-domain filters perform well for the excitation. For recognition, good representations are based on a smoothed magnitude response of the vocal tract.
引用
收藏
页码:359 / 376
相关论文
共 50 条
  • [41] Improving the Efficiency of Noise Resistance Processing of Speech Signal
    Abdulkhairov, Maulenbek T.
    Altay, Yeldos A.
    Zhumasheva, Zhadyra T.
    PROCEEDINGS OF THE 2017 IEEE RUSSIA SECTION YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING CONFERENCE (2017 ELCONRUS), 2017, : 618 - 620
  • [42] DIGITAL SIGNAL PROCESSOR (DSP) FOR TELECOMMUNICATIONS AND SPEECH PROCESSING
    HAGIWARA, Y
    ICHIKAWA, A
    SHIRASU, H
    JAPAN ANNUAL REVIEWS IN ELECTRONICS COMPUTERS & TELECOMMUNICATIONS, 1985, 20 : 355 - 363
  • [43] PROCESSING THE TELEPHONE SPEECH SIGNAL FOR THE HEARING-IMPAIRED
    TERRY, M
    BRIGHT, K
    DURIAN, M
    KEPLER, L
    SWEETMAN, R
    GRIM, M
    EAR AND HEARING, 1992, 13 (02): : 70 - 79
  • [44] SpeechLab: PC software for digital speech signal processing
    Eugen Diesch
    Behavior Research Methods, Instruments, & Computers, 1997, 29 : 302 - 302
  • [45] Optimal pitch bases expansions in speech signal processing
    Nickel, RM
    Oswal, SP
    CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2003, : 1885 - 1889
  • [46] Adaptive Signal Processing Method for Speech Organ Diagnostics
    A. Yu. Tychkov
    A. K. Alimuradov
    P. P. Churakov
    Measurement Techniques, 2016, 59 : 485 - 490
  • [47] SIGNAL PROCESSING TO IMPROVE SPEECH INTELLIGIBILITY IN PERCEPTIVE DEAFNESS
    VILLCHUR, E
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1973, 53 (06): : 1646 - 1657
  • [48] New orthogonal polynomials for speech signal and image processing
    Jassim, W. A.
    Raveendran, P.
    Mukundan, R.
    IET SIGNAL PROCESSING, 2012, 6 (08) : 713 - 723
  • [50] Wavelet network with OLS optimization for speech signal processing
    Chen, F
    Shi, DM
    Ng, GS
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2004, 3060 : 406 - 415