Data mining and preprocessing application on component reports of an airline company in Turkey

被引:17
作者
Gurbuz, Feyza [1 ]
Ozbakir, Lale [1 ]
Yapici, Huseyin [2 ]
机构
[1] Univ Erciyes Kayseri, Dept Ind Engn, TR-38039 Kayseri, Turkey
[2] Univ Erciyes Kayseri, Dept Mech Engn, TR-38039 Kayseri, Turkey
关键词
Data mining; Preprocessing; Rough sets; Find laws;
D O I
10.1016/j.eswa.2010.11.076
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Risk and safety have always been important considerations in aviation. With the rapid growth in air travel, flight delays, cancellations and incidents/accidents have also dramatically increased in recent years (Nazeri & Jianping, 2002). There is a large amount of knowledge and data accumulation in aviation industry. These data could be stored in the form of pilot reports, maintenance reports, incident reports or delay reports. This paper focuses on different preprocessing and feature selection techniques applied on the 15 component reports of an airline company in Turkey to understand and clean the data set. Regression analysis, anomaly detection analysis, find dependencies and rough sets are used in this study in order to reduce the data set. Also the classification techniques of data mining are used to predict the warning level of the component as the class attribute. For this purpose Polyanalyst, SPSS Clementine, Minitab and Rosetta software tools are used. Find laws module of Polyanalyst is used to find the relations and information retrieval about the components warning level. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:6618 / 6626
页数:9
相关论文
共 17 条
[1]  
[Anonymous], 2005, USER MANUEL POLYANAL
[2]  
[Anonymous], 2005, CLEMENTINE 10 0 USER
[3]   An efficient bit-based feature selection method [J].
Chen, Wei-Chou ;
Tseng, Shian-Shyong ;
Hong, Tzung-Pei .
EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (04) :2858-2869
[4]   The impact of preprocessing on data mining: An evaluation of classifier sensitivity in direct marketing [J].
Crone, Sven F. ;
Lessmann, Stefan ;
Stahlbock, Robert .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2006, 173 (03) :781-800
[5]  
DUNHAM MH, 2002, DATAMINING INTRO ADV, V1
[6]  
Guyon I., 2003, J MACH LEARN RES, V3, P1157
[7]  
Hand D., 2001, ADAP COMP MACH LEARN
[8]   DB-HReduction: A data preprocessing algorithm for data mining applications [J].
Hu, XH .
APPLIED MATHEMATICS LETTERS, 2003, 16 (06) :889-895
[9]  
Jiawei H., 2001, DATA MINING CONCEPTS
[10]  
Kim Y., 2003, DATA MINING