Feature selection methods and their combinations in high-dimensional classification of speaker likability, intelligibility and personality traits

被引:145
作者
Pohjalainen, Jouni [1 ]
Rasanen, Okko [1 ]
Kadioglu, Serdar [2 ]
机构
[1] Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland
[2] Oracle Amer Inc, Burlington, MA 01803 USA
基金
芬兰科学院;
关键词
Feature selection; Pattern recognition; Machine learning; Computational paralinguistics; SPEECH;
D O I
10.1016/j.csl.2013.11.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study focuses on feature selection in paralinguistic analysis and presents recently developed supervised and unsupervised methods for feature subset selection and feature ranking. Using the standard k-nearest-neighbors (kNN) rule as the classification algorithm, the feature selection methods are evaluated individually and in different combinations in seven paralinguistic speaker trait classification tasks. In each analyzed data set, the overall number of features highly exceeds the number of data points available for training and evaluation, making a well-generalizing feature selection process extremely difficult. The performance of feature sets on the feature selection data is observed to be a poor indicator of their performance on unseen data. The studied feature selection methods clearly outperform a standard greedy hill-climbing selection algorithm by being more robust against overfitting. When the selection methods are suitably combined with each other, the performance in the classification task can be further improved. In general, it is shown that the use of automatic feature selection in paralinguistic analysis can be used to reduce the overall number of features to a fraction of the original feature set size while still achieving a comparable or even better performance than baseline support vector machine or random forest classifiers using the full feature set. The most typically selected features for recognition of speaker likability, intelligibility and five personality traits are also reported. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:145 / 171
页数:27
相关论文
共 53 条
[1]  
[Anonymous], IBM CPLEX REF MAN US
[2]  
[Anonymous], 2001, Pattern Classification
[3]   Whodunnit - Searching for the most important feature types signalling emotion-related user states in speech [J].
Batliner, Anton ;
Steidl, Stefan ;
Schuller, Bjoern ;
Seppi, Dino ;
Vogt, Thurid ;
Wagner, Johannes ;
Devillers, Laurence ;
Vidrascu, Laurence ;
Aharonson, Vered ;
Kessous, Loic ;
Amir, Noam .
COMPUTER SPEECH AND LANGUAGE, 2011, 25 (01) :4-28
[4]   OR-LIBRARY - DISTRIBUTING TEST PROBLEMS BY ELECTRONIC MAIL [J].
BEASLEY, JE .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1990, 41 (11) :1069-1072
[5]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[6]   Selection of relevant features and examples in machine learning [J].
Blum, AL ;
Langley, P .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :245-271
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]  
Burkhardt F, 2010, LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P1562
[9]  
Burkhardt F, 2011, 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, P1568
[10]   Algorithms for railway crew management [J].
Caprara, A ;
Fischetti, M ;
Toth, P ;
Vigo, D ;
Guida, PL .
MATHEMATICAL PROGRAMMING, 1997, 79 (1-3) :125-141