SPEECH PROCESSING USING GROUP DELAY FUNCTIONS

被引:33
作者
MURTHY, HA
YEGNANARAYANA, B
机构
[1] Department of Computer Science and Engineering, Indian Institute of Technology, Madras
关键词
FOURIER TRANSFORM PHASE; GROUP DELAY FUNCTIONS; SPEECH PROCESSING; FORMANTS;
D O I
10.1016/0165-1684(91)90014-A
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper we demonstrate the feasibility of processing the Fourier transform (FT) phase of a speech signal to derive the smooth log magnitude spectrum corresponding to the vocal tract system. We exploit the additive property of the group delay function (negative derivative of the FT phase) to process the FT phase. We show that the rapid fluctuations in the log magnitude spectrum and the group delay function are caused by the zeroes of the z-transform of the excitation components of the speech signal. Zeroes close to the unit circle in the z-plane produce large amplitude spikes in the group delay function and mask the group delay information corresponding to the vocal tract system. We propose a technique to extract the vocal tract system component of the group delay function by using the spectral properties of the excitation signal.
引用
收藏
页码:259 / 267
页数:9
相关论文
共 50 条
[21]   Standardized GUI Framework using Python']Python for Speech Processing: NLP [J].
Rudrappa, Naveenkumar T. ;
Reddy, Mallamma, V ;
Hanumanthappa, M. .
INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, :1944-1947
[22]   Automatic language identification and discrimination using the modified group delay feature [J].
Hegde, RM ;
Murthy, HA .
2005 INTERNATIONAL CONFERENCE ON INTELLIGENT SENSING AND INFORMATION PROCESSING, PROCEEDINGS, 2005, :395-399
[23]   Intelligent processing of stuttered speech [J].
Czyzewski, A ;
Kaczmarek, A ;
Kostek, B .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2003, 21 (02) :143-171
[24]   UNDERGRADUATE SPEECH PROCESSING AWARENESS [J].
Ressl, Marc ;
Prendes, Jorge ;
Saint-Nom, Roxana .
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, :2773-2776
[25]   Turbo Processing for Speech Recognition [J].
Moon, Todd K. ;
Gunther, Jacob H. ;
Broadus, Cortnie ;
Hou, Wendy ;
Nelson, Nils .
IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (01) :83-91
[26]   Intelligent Processing of Stuttered Speech [J].
Andrzej Czyzewski ;
Andrzej Kaczmarek ;
Bozena Kostek .
Journal of Intelligent Information Systems, 2003, 21 :143-171
[27]   Speech Recognition Based on Open Source Speech Processing Software [J].
Klosowski, Piotr ;
Dustor, Adam ;
Izydorczyk, Jacek ;
Kotas, Jan ;
Slimok, Jacek .
COMPUTER NETWORKS, CN 2014, 2014, 431 :308-317
[28]   SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks [J].
Chang, Kai-Wei ;
Wu, Haibin ;
Wang, Yu-Kai ;
Wu, Yuan-Kuei ;
Shen, Hua ;
Tseng, Wei-Cheng ;
Kang, Iu-Thing ;
Li, Shang-Wen ;
Lee, Hung-Yi .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 :3730-3744
[29]   Speech Processing for Arabic Speech Synthesis Based on Concatenation Rules [J].
Imedjdouben F. .
SN Computer Science, 5 (3)
[30]   Speech Recognition Based on Open Source Speech Processing Software [J].
Klosowski, Piotr ;
Dustor, Adam ;
Izydorczyk, Jacek ;
Kotas, Jan ;
Ślimok, Jacek .
Communications in Computer and Information Science, 2014, 431 :308-317