Bayesian Optimization Cost-Sensitive XGBoost Learning Algorithm for Imbalanced Data in Semiconductor Industry

被引:0
作者
Shamsudin, Haziqah [1 ]
Yusof, Umi Kalsom [1 ]
Kashif, Fizza [1 ]
Isa, Iza Sazanita [1 ,2 ]
机构
[1] Univ Sains Malaysia, Sch Comp Sci, George Town, Malaysia
[2] Univ Teknol MARA, Coll Engn, Ctr Elect Engn Studies, George Town, Malaysia
来源
JORDAN JOURNAL OF ELECTRICAL ENGINEERING | 2023年 / 9卷 / 04期
关键词
XGBoost learning algorithm; Cost-sensitivity; Imbalanced data; Semiconductor classification; Ensembled model; CLASSIFICATION;
D O I
10.5455/jjee.204-1671971895
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes an improved ensemble learning model based on extreme gradient boosting (XGBoost) with Bayesian optimization cost-sensitive learning algorithm for dealing with highly imbalanced data in the semiconductor process to achieve the highest possible pass and fail accuracy or recall for the classification performances. Most of the existing models are biased toward the majority class neglecting the minority class. The proposed Bayesian optimization cost-sensitive XGboost model is configured to be applied to the semiconductor dataset. The obtained experimental results - based on benchmarking semiconductor industry dataset - show 91.46% and 23.08% for the pass and fail accuracies, respectively. This confirms that the proposed model is significant for imbalanced cases in semiconductor applications. Moreover, this investigation reveals that the proposed model is able not only to maintain the performance of the majority class, but also to classify well the minority class.
引用
收藏
页码:552 / 565
页数:14
相关论文
共 50 条
[31]   Improved cost-sensitive representation of data for solving the imbalanced big data classification problem [J].
Mahboubeh Fattahi ;
Mohammad Hossein Moattar ;
Yahya Forghani .
Journal of Big Data, 9
[32]   Cost-Sensitive Broad Learning System for Imbalanced Classification and Its Medical Application [J].
Yao, Liang ;
Wong, Pak Kin ;
Zhao, Baoliang ;
Wang, Ziwen ;
Lei, Long ;
Wang, Xiaozheng ;
Hu, Ying .
MATHEMATICS, 2022, 10 (05)
[33]   A cost-sensitive multi-criteria quadratic programming model for imbalanced data [J].
Chao, Xiangrui ;
Peng, Yi .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2018, 69 (04) :500-516
[34]   Novel Cost-Sensitive Approach to Improve the Multilayer Perceptron Performance on Imbalanced Data [J].
Castro, Cristiano L. ;
Braga, Antonio P. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (06) :888-899
[35]   COST-SENSITIVE SPARSE LINEAR REGRESSION FOR CROWD COUNTING WITH IMBALANCED TRAINING DATA [J].
Huang, Xiaolin ;
Zou, Yuexian ;
Wang, Yi .
2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
[36]   Efficient Utilization of Missing Data in Cost-Sensitive Learning [J].
Zhu, Xiaofeng ;
Yang, Jianye ;
Zhang, Chengyuan ;
Zhang, Shichao .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (06) :2425-2436
[37]   Cost-sensitive learning using logical analysis of data [J].
Osman, Hany .
KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (06) :3571-3606
[38]   A Differential Evolution-Based Method for Class-Imbalanced Cost-Sensitive Learning [J].
Qiu, Chen ;
Jiang, Liangxiao ;
Kong, Ganggang .
2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
[39]   Cost-Sensitive Hypergraph Learning With F-Measure Optimization [J].
Wang, Nan ;
Liang, Ruozhou ;
Zhao, Xibin ;
Gao, Yue .
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (05) :2767-2778
[40]   Using Cost-Sensitive Learning and Feature Selection Algorithms to Improve the Performance of Imbalanced Classification [J].
Feng, Fang ;
Li, Kuan-Ching ;
Shen, Jun ;
Zhou, Qingguo ;
Yang, Xuhui .
IEEE ACCESS, 2020, 8 :69979-69996