Music Genre Classification Using Spectral Analysis and Sparse Representation of the Signals

被引：7

作者：

Banitalebi-Dehkordi, Mehdi ^{[1
]}

Banitalebi-Dehkordi, Amin ^{[2
]}

机构：

[1] Yazd Univ, Yazd, Iran

[2] Univ British Columbia, Vancouver, BC V5Z 1M9, Canada

来源：

JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY | 2014年 / 74卷 / 02期

关键词：

Feature extraction; Compressive sampling; Genre classification;

D O I：

10.1007/s11265-013-0797-4

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we proposed a robust music genre classification method based on a sparse FFT based feature extraction method which extracted with discriminating power of spectral analysis of non-stationary audio signals, and the capability of sparse representation based classifiers. Feature extraction method combines two sets of features namely short-term features (extracted from windowed signals) and long-term features (extracted from combination of extracted short-time features). Experimental results demonstrate that the proposed feature extraction method leads to a sparse representation of audio signals. As a result, a significant reduction in the dimensionality of the signals is achieved. The extracted features are then fed into a sparse representation based classifier (SRC). Our experimental results on the GTZAN database demonstrate that the proposed method outperforms the other state of the art SRC approaches. Moreover, the computational efficiency of the proposed method is better than that of the other Compressive Sampling (CS)-based classifiers.

引用

页码：273 / 280

页数：8

共 13 条

[1]

International Standard, 1994, 138181 ISOIEC

[2]

Jacques L., 2010, COMPRESSED SENSING S

[3] MUSIC GENRE CLASSIFICATION USING NOVEL FEATURES AND A WEIGHTED VOTING METHOD [J].

Jang, Dalwon ;

Jin, Minho ;

Yoo, Chang D. .

2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, :1377-1380

[4]

Kaichun K. C., 2010, PROC 11 INT SOC MUS

[5] Audio indexing for efficient music information retrieval [J].

Karydis, I ;

Nanopoulos, A ;

Papadopoulos, AN ;

Manolopoulos, Y .

11TH INTERNATIONAL MULTIMEDIA MODELLING CONFERENCE, PROCEEDINGS, 2005, :22-29

[6]

Panagakis Y., 2010, PROC IEEE INT CONF A

[7]

Sainath T., 2010, PROC IEEE INT CONF A

[8]

Saunders J, 1996, INT CONF ACOUST SPEE, P993, DOI 10.1109/ICASSP.1996.543290

[9]

Sukittanon S., 2003, PROC IEEE INT CONF A

[10] Combined speech and audio coding by discrimination [J].

Tancerel, L ;

Ragot, S ;

Ruoppila, VT ;

Lefebvre, R .

2000 IEEE WORKSHOP ON SPEECH CODING, PROCEEDINGS: MEETING THE CHALLENGES OF THE NEW MILLENNIUM, 2000, :154-156

← 1 2 →