Fuzzy-based algorithm for Fongbe continuous speech segmentation

被引：3

作者：

Laleye, Frejus A. A. ^{[1
]}

Ezin, Eugene C. ^{[2
]}

Motamed, Cina ^{[1
]}

机构：

[1] Univ Littoral Cote dOpale, Lab Informat Signal & Image Cote Opale, 50 Rue F Buisson,BP 719, F-62228 Calais, France

[2] Univ Abomey Calavi, Inst Math & Sci Phys, BP 719, Porto Novo, Benin

来源：

PATTERN ANALYSIS AND APPLICATIONS | 2017年 / 20卷 / 03期

关键词：

Speech segmentation; Nonlinear speech analysis; Time-domain features; Fuzzy logic; Fongbe language;

D O I：

10.1007/s10044-016-0591-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Text-independent speech segmentation is a challenging topic in computer-based speech recognition systems. This paper proposes a novel time-domain algorithm based on fuzzy knowledge for continuous speech segmentation task via a nonlinear speech analysis. Short-term energy, zero-crossing rate and the singularity exponents are the time-domain features that we have calculated in each point of speech signal in order to exploit relevant information for generating the significant segments. This is down for the phoneme or syllable identification and the transition fronts. Fuzzy logic technique helped us to fuzzify the calculated features into three complementary sets namely: low, medium, high and to perform a matching phase using a set of fuzzy rules. The outputs of our proposed algorithm are silence, phonemes, or syllables. Once evaluated, our algorithm produced the best performances with efficient results on Fongbe language (an African tonal language spoken especially in Benin, Togo and Nigeria).

引用

页码：855 / 864

页数：10

共 30 条

[1]

Akoha AB, 2010, SYNTAXE LEXICOLOGIE, P368

[2]

[Anonymous], 2009, INTERSPEECH 2009

[3]

[Anonymous], 2006, NIPS

[4] Voiced/Unvoiced Decision for Speech Signals Based on Zero-Crossing Rate and Energy [J].

Bachu, R. G. ;

Kopparthi, S. ;

Adapa, B. ;

Barkana, B. D. .

ADVANCES TECHNIQUES IN COMPUTING SCIENCES AND SOFTWARE ENGINEERING, 2010, :279-282

[5]

Greenberg J. H., 1966, LANGUAGES OF AFRICA

[6] A fast learning algorithm for deep belief nets [J].

Hinton, Geoffrey E. ;

Osindero, Simon ;

Teh, Yee-Whye .

NEURAL COMPUTATION, 2006, 18 (07) :1527-1554

[7]

Hioka Y, 2003, IEICE T FUND ELECTR, VE86A, P2802

[8]

Hsieh CT, 1999, J INF SCI ENG, V15, P615

[9]

Khanagha V, 2011, INT CONF ACOUST SPEE, P4484

[10] Weighted Combination of Naive Bayes and LVQ Classifier for Fongbe Phoneme Classification [J].

Laleye, Frejus A. A. ;

Ezin, Eugene C. ;

Motamed, Cina .

10TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY AND INTERNET-BASED SYSTEMS SITIS 2014, 2014, :7-13

← 1 2 3 →