Speaker age classification and regression using i-vectors

被引：35

作者：

Grzybowska, Joanna ^{[1
]}

Kacprzak, Stanislaw ^{[1
]}

机构：

[1] AGH Univ Sci & Technol, Krakow, Poland

来源：

17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年

关键词：

speaker age recognition; regression; classification; computational paralinguistics; RECOGNITION;

D O I：

10.21437/Interspeech.2016-1118

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we examine the use of i-vectors both for age regression as well as for age classification. Although i-vectors have been previously used for age regression task, we extend this approach by applying fusion of i-vectors and acoustic features regression to estimate the speaker age. By our fusion we obtain a relative improvement of 12.6% comparing to solely i-vector system. We also use i-vectors for age classification, which to our knowledge is the first attempt to do so. Our best results reach unweighted accuracy 62.9%, which is a relative improvement of 16.7% comparing to the best results obtained in age classification task at Age Sub-Challenge at Interspeech 2010.

引用

页码：1402 / 1406

页数：5

共 21 条

[1]

[Anonymous], 2006, LINGUISTICS PHONETIC

[2]

[Anonymous], 2010, P INTERSPEECH

[3]

[Anonymous], 2010, P 11 ANN C INT SPEEC

[4]

[Anonymous], 2010, P LANG RES C LREC

[5] Speaker age estimation using i-vectors [J].

Bahari, Mohamad Hasan ;

McLaren, Mitchell ;

Hugo Van Hamme ;

van Leeuwen, David A. .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 34 :99-108

[6] i-Vector Modeling of Speech Attributes for Automatic Foreign Accent Recognition [J].

Behravan, Hamid ;

Hautamaki, Ville ;

Siniscalchi, Sabato Marco ;

Kinnunen, Tomi ;

Lee, Chin-Hui .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (01) :29-41

[7]

Cole R., 1994, LINGUISTIC DATA CONS

[8] Front-End Factor Analysis for Speaker Verification [J].

Dehak, Najim ;

Kenny, Patrick J. ;

Dehak, Reda ;

Dumouchel, Pierre ;

Ouellet, Pierre .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04) :788-798

[9] Supervector Dimension Reduction for Efficient Speaker Age Estimation Based on the Acoustic Speech Signal [J].

Dobry, Gil ;

Hecht, Ron M. ;

Avigal, Mireille ;

Zigel, Yaniv .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07) :1975-1985

[10]

Eyben F., 2009, AFF COMP INT INT WOR, P1

← 1 2 3 →