Psychoacoustical Approach to Sinusoidal Modeling of Speech

被引：0

作者：

Nagy, Martin Turi ^{[1
]}

Minarik, Ivan ^{[1
]}

机构：

[1] Slovak Univ Technol Bratislava, Dept Telecommun, Ilkovicova 3, Bratislava 81219, Slovakia

来源：

53RD INTERNATIONAL SYMPOSIUM ELMAR-2011 | 2011年

关键词：

Psychoacoustics; sinusoidal modeling; SN model; speech synthesis;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes the psychoacoustical approach to the sinusoidal analysis/synthesis of the speech. With the SN (sinusoids plus noise) model, the periodic part of the speech signal is described by time-varying sinusoids and the non-periodic part by noise. The number of sinusoids needed for reconstruction can be then reduced by application of the psychoacoustical principles. The proposed compression scheme allows us to reduce the amount of physical data needed to store the parameters. The whole method can be used in speech synthesis for storing speech corpuses that are prepared for easy prosodic manipulation.

引用

页码：217 / 220

页数：4

共 50 条

[1] Rate-distortion optimal sinusoidal modeling of audio and speech using psychoacoustical matching pursuits
Heusdens, R
van de Par, S
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 1809 - 1812
[2] Sinusoidal modeling and modification of unvoiced speech
Macon, MW
Clements, MA
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (06): : 557 - 560
[3] SPEECH AS TEMPORAL PATTERN - A PSYCHOACOUSTICAL PROFILE
LAUTER, JL
HIRSH, IJ
SPEECH COMMUNICATION, 1985, 4 (1-3) : 41 - 54
[4] FREQUENCY-VARYING SINUSOIDAL MODELING OF SPEECH
MARQUES, JS
ALMEIDA, LB
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (05): : 763 - 765
[5] Exponential sinusoidal modeling of transitional speech segments
Aalborg Univ, Aalborg, Denmark
ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, (473-476):
[6] Exponential sinusoidal modeling of transitional speech segments
Jensen, J
Jensen, SH
Hansen, E
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 473 - 476
[7] Psychoacoustical enhancement of speech based on multitaper spectrum
School of Information Science and Engineering, Southeast University, Nanjing 210096, China
不详
Shengxue Xuebao, 2007, 3 (275-281):
[8] A perceptual subspace method for sinusoidal speech and audio modeling
Jensen, J
Heusdens, R
Jensen, SH
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 401 - 404
[9] Single Channel Speech Separation Based on Sinusoidal Modeling
Wiem, Belhedi
anouar, Ben messaoud Mohamed
Aicha, Bouzid
2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 672 - 676
[10] Separation of harmonic and speech signals using sinusoidal modeling
Jancovic, Peter
Kokuer, Munevver
2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, : 261 - 264

← 1 2 3 4 5 →