In this paper the idea of using the ARMA representation of speech for synthesis and recognition purposes is studied in some details. It is pointed out that the methods for tracking the formant frequencies may be improved within the ARMA approach, especially the method of formant - frequency extraction by moment calculations. Also a new distance measure based on the AR part of the ARMA model is proposed for speech recognition purposes.