Handling high dimensional features by ensemble learning for emotion identification from speech signal

被引：0

作者：

Ashok Kumar, Konduru ^{[1
]}

Iqbal, J. L. Mazher ^{[2
]}

机构：

[1] Veltech Rangarajan Dr Sagunthala R&D Inst Sci & T, Chennai, Tamil Nadu, India

[2] Veltech Rangarajan Dr Sagunthala R&D, ECE, Inst Sci & Technol, Chennai, Tamil Nadu, India

来源：

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY | 2021年 / 25卷 / 4期

基金：

英国科研创新办公室;

关键词：

Distribution diversity measures; Ensemble learning; Speech technology; Emotion prediction; Acoustic features; Machine learning (ML); RECOGNITION; DIAGNOSIS; IMPROVE;

D O I：

10.1007/s10772-021-09916-x

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In the recent past, handling the curse of dimensionality observed in acoustic features of the speech signal in machine learning-based emotion detection has been considered a crucial objective. The contemporary emotion prediction methods are experiencing false alarming due to the high dimensionality of the features used in training phase of the machine learning models. The majority of the contemporary models have endeavored to handle the curse of high dimensionality of the training corpus. However, the contemporary models are focusing more on using fusion of multiple classifiers, which is barely improvising the decision accuracy, if the volume of the training corpus is high. The contribution of this manuscript endeavored to portray a novel ensemble model that using fusion of diversity measures to suggest the optimal features. Moreover, the proposed method attempts to reduce the impact of the high dimensionality in feature values by using a novel clustering process. The experimental study signifies the proposed method performance in term of emotion prediction from speech signals and compared to contemporary models of emotion detection using machine learning. The fourfold cross-validation of standard data corpus has used in performance analysis.

引用

页码：837 / 851

页数：15

共 47 条

[1] Recognizing Emotion from Speech Based on Age and Gender Using Hierarchical Models [J].

Abu Shaqra, Ftoon ;

Duwairi, Rehab ;

Al-Ayyoub, Mahmoud .

10TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2019) / THE 2ND INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40 2019) / AFFILIATED WORKSHOPS, 2019, 151 :37-44

[2] New approach in quantification of emotional intensity from the speech signal: emotional temperature [J].

Alonso, Jesus B. ;

Cabrera, Josue ;

Medina, Manuel ;

Travieso, Carlos M. .

EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (24) :9554-9564

[3]

[Anonymous], 2002, 7 INT C SPOK LANG PR

[4] Effect of Nano Fillers on Mechanical Properties of Luffa Fiber Epoxy Composites [J].

Ashok, K. G. ;

Kalaichelvan, K. ;

Damodaran, Ajith .

JOURNAL OF NATURAL FIBERS, 2022, 19 (04) :1472-1489

[5]

Basu S, 2017, PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), P109, DOI 10.1109/ICICCT.2017.7975169

[6] Bagged support vector machines for emotion recognition from speech [J].

Bhavan, Anjali ;

Chauhan, Pankaj ;

Hitkul ;

Shah, Rajiv Ratn .

KNOWLEDGE-BASED SYSTEMS, 2019, 184

[7]

Breiman L, 1996, MACH LEARN, V24, P123, DOI 10.1007/BF00058655

[8]

Budak H., 2016, Anadolu University Journal of Science and Technology A - Applied Sciences and Engineering, V17, P845, DOI [10.18038/aubtda.279853, DOI 10.18038/AUBTDA.279853]

[9] Speaker-sensitive emotion recognition via ranking: Studies on acted and spontaneous speech [J].

Cao, Houwei ;

Verma, Ragini ;

Nenkova, Ani .

COMPUTER SPEECH AND LANGUAGE, 2015, 29 (01) :186-202

[10]

Cong P, 2016, 2016 10 INT S CHIN S, P1

← 1 2 3 4 5 →