Acoustic echo cancellation using adaptive filtering algorithms for Quranic accents (Qiraat) identification

被引：2

作者：

Kamarudin N. ^{[1
]}

Al-Haddad S.A.R. ^{[1
]}

Abushariah M.A.M. ^{[2
]}

Hashim S.J. ^{[1
]}

Hassan A.R.B. ^{[1
]}

机构：

[1] Department of Computer Engineering and Communication, Faculty of Engineering, Universiti Putra Malaysia, Serdang

[2] Department of Computer Information Systems, King Abdullah II School for Information Technology, The University of Jordan, Amman

来源：

International Journal of Speech Technology | 2016年 / 19卷 / 02期

关键词：

Acoustic echo cancellation; Adaptive filtering; Affine projection; Gaussian mixture model; K-nearest neighbor; Least mean squares; Probabilities principal component analysis; Quranic accents (Qiraat); Recursive least squares;

D O I：

10.1007/s10772-015-9319-z

中图分类号：

学科分类号：

摘要：

Echoed parts of Quranic accent (Qiraat) signals are exposed to reverberation of signals especially if they are listened to in a conference room or the Quranic recordings found in different media such as the web. Quranic verse rules identification/Tajweed are prone to additive noise and may reduce classification results. This research work aims to present our work towards Quranic accents (Qiraat) identification, which emphasizes on acoustic echo cancellation (AEC) of all echoed Quranic signals during the preprocessing phase of the system development. In order to conduct the AEC, three adaptive algorithms known as affine projection (AP), least mean square (LMS), and recursive least squares (RLS) are used during the preprocessing phase. Once clean Quranic signals are produced, they undergo feature extraction and pattern classification phases. The Mel Frequency Cepstral Coefficients is the most widely used technique for feature extraction and is adopted in this research work, whereas probabilities principal component analysis (PPCA), K-nearest neighbor (KNN) and gaussian mixture model (GMM) are used for pattern classification. In order to verify our methodology, audio files have been collected for Surat Ad-Duhaa for five different Quranic accents (Qiraat), namely: (1) Ad-Duri, (2) Al-Kisaie, (3) Hafs an A’asem, (4) IbnWardan, and (5) Warsh. Based on our experimental results, the AP algorithm achieved 93.9 % accuracy rate against all pattern classification techniques including PPCA, KNN, and GMM. For LMS and RLS, the achieved accuracy rates are different for PPCA, KNN, and GMM, whereby LMS with PPCA and GMM achieved the same accuracy rate of 96.9 %; however, LMS with KNN achieved 84.8 %. In addition, RLS with PPCA and GMM achieved the same accuracy rate of 90.9 %; however, RLS with KNN achieved 78.8 %. Therefore, the AP adaptive algorithm is able to reduce the echo of Quranic accents (Qiraat) signals in a consistent manner against all pattern classification techniques. © 2015, Springer Science+Business Media New York.

引用

页码：393 / 405

页数：12

共 47 条

[1] Abushariah M.A.M., A vector quantization approach to isolated-word automatic speech recognition, Master Dissertation, (2006)
[2] Adapa N.S., Bollu S., Performance analysis of different adaptive algorithms based on acoustic echo cancellation. Master Thesis, Blekinge Institute of Technology, 371 79 Karlskrona Sweden, (2012)
[3] Affandi A., Dobaie A.M., Husain M., Digital Filters Design using Matlab with Graphical User Interface (GUI), Life Science Journal, 11, 5, pp. 336-348, (2014)
[4] Al-Haddad S.A.R., Samad S.A., Hussain A., Ishak K.A., Noor A.O.A., Robust speech recognition using fusion techniques and adaptive filtering, American Journal of Applied Sciences, 6, 2, pp. 290-295, (2009)
[5] AnamulHaque M., Kamrul Islam A.K.M., Imdadul Islam M., Demystifying the digital adaptive filters conducts in acoustic echo cancellation, Journal of Multimedia, 5, 6, pp. 568-579, (2010)
[6] Ari C., Aksoy S., Arikan O., Maximum Likelihood Estimation of Gaussian Mixture Models Using Stochastic Search, Journal Pattern Recognition, 45, 7, pp. 2804-2816, (2012)
[7] Attarian A., Danis G., Gronsbell J., Iervolino G., Tran H., A Comparison of feature selection and classification algorithms in identifying baseball pitches. In Proceedings of the International MultiConference of Engineers and Computer Scientists (IMECS’2013), Hong Kong, 1, pp. 263-268, (2013)
[8] Balen J.V., Automatic Recognition of Samples in Musical Audio, Master Thesis, (2011)
[9] Chetouani M., Gas B., Zarader J.L., Chavy C., Neural predictive coding for speech discriminant feature extraction: The DFE-NPC. In Proceedings of European Symposium on Artificial Neural Networks (ESANN’2002), Bruges, Belgium, pp. 275-280, (2002)
[10] De Sena E., Antonello N., Moonen M., van Waterschoot T., On the modeling of rectangular geometries in room acoustic simulations, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23, 4, pp. 774-786, (2015)

← 1 2 3 4 5 →