ATTITUDE RECOGNITION USING MULTI-RESOLUTION COCHLEAGRAM FEATURES

被引:0
|
作者
Haider, Fasih [1 ]
Luz, Saturnino [1 ]
机构
[1] Univ Edinburgh, Edinburgh Med Sch, Usher Inst Populat Hlth Sci & Informat, Edinburgh, Midlothian, Scotland
关键词
Feature Engineering; Attitude Recognition; Affect Recognition; Multi-Resolution Cochleagram; Video Blogs;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Attitudes play an important role in human communication. Models and algorithms for automatic recognition of attitudes therefore may have applications in areas where successful communication and interaction are crucial, such as health-care, education and digital entertainment. This paper focuses on the task of categorizing speaker attitudes using speech features. Data extracted from video recordings are employed in training and testing of predictive models consisting of different sets of speech features. A novel attitude recognition approach using Multi-Resolution Cochleagram (MRCG) features is proposed. The results show that MRCG feature set outperforms the feature sets most commonly used in computational paralinguistic tasks, including emobase, eGeMAPS and ComParE, in terms of attitude recognition accuracy for decision tree, 1-nearest neighbour and random forest classifiers. Analysis of the results suggests that MRCG features contribute information not captured by these existing feature sets. Indeed, while the ComParE feature set provides slightly better results than MRCG features for support vector machine classifiers, the fusion of the existing feature sets with the new MRCG features improves on those results. Overall, with the addition of MRCG, the attitude recognition method proposed in this study achieves accuracy scores approximately 11 points higher than reported in previous studies.
引用
收藏
页码:3737 / 3741
页数:5
相关论文
共 50 条
  • [21] Iris recognition basing on multi-resolution analysis
    Pan, L. L.
    Xie, M.
    2006 1ST IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-3, 2006, : 1236 - +
  • [22] Iris recognition basing on multi-resolution analysis
    Pan, L. L.
    Xie, M.
    ICIEA 2006: 1ST IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-3, PROCEEDINGS, 2006, : 392 - 396
  • [23] Face Recognition Through Multi-Resolution Images
    Mliki, Hazar
    Fendri, Emna
    Chebil, Ahmed
    INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2022, 10 (01)
  • [24] Multi-resolution dictionary learning for face recognition
    Luo, Xiaoling
    Xu, Yong
    Yang, Jian
    PATTERN RECOGNITION, 2019, 93 : 283 - 292
  • [25] Multi-resolution phonetic/segmental features and models for HMM-based speech recognition
    Vaseghi, S
    Harte, N
    Milner, B
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1263 - 1266
  • [26] Multi-resolution cepstral features for phoneme recognition across speech sub-bands
    McCourt, P
    Vaseghi, S
    Harte, N
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 557 - 560
  • [27] Early author profiling on Twitter using profile features with multi-resolution
    Pastor Lopez-Monroy, A.
    Gonzalez, Fabio A.
    Solorio, Thamar
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 140
  • [28] Multi-resolution feature fusion for face recognition
    Pong, Kuong-Hon
    Lam, Kin-Man
    PATTERN RECOGNITION, 2014, 47 (02) : 556 - 567
  • [29] Fast extraction of multi-resolution Gabor features
    Ilonen, Jarmo
    Kamarainen, Joni-Kristian
    Kalviainen, Heikki
    14TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING, PROCEEDINGS, 2007, : 481 - +
  • [30] Scene classification using a multi-resolution bag-of-features model
    Zhou, Li
    Zhou, Zongtan
    Hu, Dewen
    PATTERN RECOGNITION, 2013, 46 (01) : 424 - 433