Cost-sensitive learning using logical analysis of data

被引:4
作者
Osman, Hany [1 ]
机构
[1] King Fahd Univ Petr & Minerals, KFUPM, Ind & Syst Engn Dept, Dhahran 31261, Saudi Arabia
关键词
Classification; Cost-sensitive learning; Combinatorial optimization; Patterns selection; Logical analysis of data; Machine learning; PATTERN GENERATION; PROGNOSTIC METHODOLOGY; ALGORITHM; PREDICTION; DIAGNOSIS;
D O I
10.1007/s10115-024-02070-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification is a common task in data mining that assigns a class label to an unseen situation. It has been widely used in decision making for various applications, and many machine learning algorithms have been developed to accomplish this task. Classification becomes critical when the problem under concern is related to serious situations such as fraud detection, cancer diseases, and quality control. Learning in these situations is characterized by predetermined asymmetric costs of incorrect class prediction, or critical consequences associated with erroneous class prediction. In this paper, a novel approach of cost-sensitive learning is proposed. The approach is constructed by employing the theory of logical analysis of data (LAD) to build accurate cost-sensitive classifiers. Two classifiers are proposed. The first classifier is established by solving a proposed pattern selection model, minimum misclassification cost model (MMCM), that aims at minimizing the asymmetric misclassification cost. The second classifier is established by solving another proposed pattern selection model, maximum precision-recall model (MPRM), that maximizes precision and recall willing to reach a 100% accuracy. A comparative study is conducted by using real datasets. The proposed MMCM has enabled LAD to realize up to 32.22% cost reduction from the misclassification cost realized by the traditional implementation of LAD. Moreover, MPRM has provided up to 19.15% increase in the precision and up to 37% increase in the recall. Also, MPRM has enhanced the performance of LAD while compared to common machine learning algorithms by providing better combinations of recall and false positive rate. This enabled LAD to provide the closet to the optimal point on the receiver operating characteristic (ROC) diagram when compared with existing machine learning methods. Incorporating the MMCM and the MPRM models into LAD establishes a novel implementation of LAD that makes LAD a promising cost-sensitive learning classifier compared to other machine learning classifiers.
引用
收藏
页码:3571 / 3606
页数:36
相关论文
共 42 条
[21]   Demurrage pattern analysis using logical analysis of data: A case study of the Ulsan Port Authority [J].
Kweon, Sang Jin ;
Hwang, Seong Wook ;
Lee, Seokgi ;
Jo, Min Ji .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 206
[22]  
Larose D. T., 2015, DATA MINING AND PREDICTIVE ANALYTICS, V2nd
[23]   Recent advances in the theory and practice of Logical Analysis of Data [J].
Lejeune, Miguel ;
Lozin, Vadim ;
Lozina, Irina ;
Ragab, Ahmed ;
Yacout, Soumaya .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2019, 275 (01) :1-15
[24]   Extensions of Logical Analysis of Data for growth hormone deficiency diagnoses [J].
Lemaire, Pierre .
ANNALS OF OPERATIONS RESEARCH, 2011, 186 (01) :199-211
[25]   A Survey of Cost-Sensitive Decision Tree Induction Algorithms [J].
Lomax, Susan ;
Vadera, Sunil .
ACM COMPUTING SURVEYS, 2013, 45 (02)
[26]   A data-driven approach to predict the success of bank telemarketing [J].
Moro, Sergio ;
Cortez, Paulo ;
Rita, Paulo .
DECISION SUPPORT SYSTEMS, 2014, 62 :22-31
[27]   Fault diagnosis in power transformers using multi-class logical analysis of data [J].
Mortada, Mohamad-Ali ;
Yacout, Soumaya ;
Lakis, Aouni .
JOURNAL OF INTELLIGENT MANUFACTURING, 2014, 25 (06) :1429-1439
[28]   Diagnosis of rotor bearings using logical analysis of data [J].
Mortada, Mohamad-Ali ;
Yacout, Soumaya ;
Lakis, Aouni .
JOURNAL OF QUALITY IN MAINTENANCE ENGINEERING, 2011, 17 (04) :371-+
[29]  
Nanda S., 2001, International Journal of Intelligent Systems in Accounting, Finance and Management, V10, P155, DOI 10.1002/isaf.203
[30]   Condition-based monitoring of the rail wheel using logical analysis of data and ant colony optimization [J].
Osma, Hany ;
Yacout, Soumaya .
JOURNAL OF QUALITY IN MAINTENANCE ENGINEERING, 2023, 29 (02) :377-400