Optical Music Recognition: Standard and Cost-Sensitive Learning with Imbalanced Data

被引:0
作者
Lesinski, Wojciech [1 ]
Jastrzebska, Agnieszka [2 ]
机构
[1] Univ Bialystok, Fac Math & Comp Sci, PL-15245 Bialystok, Poland
[2] Warsaw Univ Technol, Fac Math & Informat Sci, PL-00662 Warsaw, Poland
来源
COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT | 2015年 / 9339卷
关键词
Cost-sensitive learning; SVM; Random forest; Feature selection; Optical music recognition; Imbalanced data;
D O I
10.1007/978-3-319-24369-6_51
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The article is focused on a particular aspect of classification, namely the issue of class imbalance. Imbalanced data adversely affects the recognition ability and requires proper classifier's construction. In this work we present a case of music notation as an example of imbalanced data. Three classification algorithms - random forest, standard SVM and cost-sensitive SVM are described and tested. Feature selection based on random forest feature importance was used. Also, feature dimension reduction using PCA was studied.
引用
收藏
页码:601 / 612
页数:12
相关论文
共 15 条
[1]  
Abe N., 2004, P 10 ACM SIGKDD INT, P3, DOI DOI 10.1145/1014052.1014056
[2]   ON MULTI-CLASS COST-SENSITIVE LEARNING [J].
Zhou, Zhi-Hua ;
Liu, Xu-Ying .
COMPUTATIONAL INTELLIGENCE, 2010, 26 (03) :232-257
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[5]  
Elkan C., 2001, The Foundations of Cost-Sensitive Learning, P973
[6]  
Garcia V., 2007, 2 C ESP INF, P283
[7]   Learning from Imbalanced Data [J].
He, Haibo ;
Garcia, Edwardo A. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (09) :1263-1284
[8]  
Homenda W, 2005, ADV SOFT COMP, P835
[9]   PROCESSING UNCERTAIN-INFORMATION IN THE LINEAR-SPACE OF FUZZY-SETS [J].
HOMENDA, W ;
PEDRYCZ, W .
FUZZY SETS AND SYSTEMS, 1991, 44 (02) :187-198
[10]  
Homenda W., 2009, IMAGE PROCESSING COM, P120