Pitch extraction and voiced/unvoiced detection of speech by cross-coupling multi-layered neural network with feedback architecture

被引：0

作者：

Miyabayashi, H ^{[1
]}

Funada, T ^{[1
]}

机构：

[1] KANAZAWA UNIV,FAC ENGN,KANAZAWA,ISHIKAWA 920,JAPAN

来源：

ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE | 1997年 / 80卷 / 09期

关键词：

speech detection; pitch extraction; multilayer neural network; feedback architecture; cross-coupling neural network;

D O I：

10.1002/(SICI)1520-6440(199709)80:9<48::AID-ECJC6>3.0.CO;2-W

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Pitch frequency is one of the most important voice characteristics, and its accurate extraction is important not only in speech analysis and synthesis, but also in speech coding, speech recognition, speaker recognition, and the like. Existing methods of improving extraction accuracy include waveform processing, correlative processing, and spectral processing. This paper describes the use of a neural network to extract pitch from voice features delivered from the bandpass filter pairs (BPFPs) proposed by Fonda et al. Three types of multi-layered neutral networks able to learn time-continuity and high accuracy discrimination functions and have st recurrent structure are tested. The cross-coupling multi-layered neural network with feedback architecture gives the best improvement over conventional neural networks, and exhibits superior ability for learning time continuity of pitch and UN information. (C) 1997 Scripta Technica, Inc.

引用

页码：48 / 58

页数：11