Pitch frequency is one of the most important voice characteristics, and its accurate extraction is important not only in speech analysis and synthesis, but also in speech coding, speech recognition, speaker recognition, and the like. Existing methods of improving extraction accuracy include waveform processing, correlative processing, and spectral processing. This paper describes the use of a neural network to extract pitch from voice features delivered from the bandpass filter pairs (BPFPs) proposed by Fonda et al. Three types of multi-layered neutral networks able to learn time-continuity and high accuracy discrimination functions and have st recurrent structure are tested. The cross-coupling multi-layered neural network with feedback architecture gives the best improvement over conventional neural networks, and exhibits superior ability for learning time continuity of pitch and UN information. (C) 1997 Scripta Technica, Inc.