Massively Parallel Feature Extraction Framework Application in Predicting Dangerous Seismic Events

被引:11
作者
Cirzegorowski, Marek [1 ]
机构
[1] Univ Warsaw, Fac Math Informat & Mech, Banacha 2, PL-02097 Warsaw, Poland
来源
PROCEEDINGS OF THE 2016 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS) | 2016年 / 8卷
关键词
D O I
10.15439/2016F90
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we introduce an automated mechanism for knowledge discovery from data streams. As a part of this work, we also present a new approach to the creation of classifiers ensemble based on a wide variety of models. Furthermore, we describe an innovative, highly scalable feature extraction and selection framework designed to work with the MapReduce programming model and the application of designed framework to build an ensemble of classifiers which takes into account both the quality and the diversity of individual models. The effectiveness of the solution has been verified through a participation in an open data mining competition which concerned the problem of predicting periods of increased seismic activity causing life threatening accidents in coal mines. The submitted solution obtained the highest AUC score of all the solutions uploaded by 106 participating research teams.
引用
收藏
页码:225 / 229
页数:5
相关论文
共 17 条
[1]  
[Anonymous], P FEDCSIS 2 IN PRESS
[2]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[3]   Ensemble methods in machine learning [J].
Dietterich, TG .
MULTIPLE CLASSIFIER SYSTEMS, 2000, 1857 :1-15
[4]  
Ganzha M., 2015, 2015 FED C COMP SCI
[5]  
Grzegorowski M, 2014, LECT NOTES COMPUT SC, V8610, P73, DOI 10.1007/978-3-319-09912-5_7
[6]  
Guyon I, 2003, J MACH LEARN RES, V3, P1157, DOI DOI 10.1162/153244303322753616
[7]  
Janusz Andrzej, 2015, Foundations of Intelligent Systems. 22nd International Symposium, ISMIS 2015. Proceedings: LNCS 9384, P19, DOI 10.1007/978-3-319-25252-0_3
[8]  
Janusz A, 2014, LECT NOTES COMPUT SC, V8537, P53
[9]   Rough Set Methods for Attribute Clustering and Selection [J].
Janusz, Andrzej ;
Slezak, Dominik .
APPLIED ARTIFICIAL INTELLIGENCE, 2014, 28 (03) :220-242
[10]   From Sensory Data to Decision Making: A Perspective on Supporting a Fire Commander [J].
Krasuski, Adam ;
Jankowski, Andrzej ;
Skowron, Andrzej ;
Slezak, Dominik .
2013 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY - WORKSHOPS (WI-IAT), VOL 3, 2013, :229-236