Semi-Supervised Ensemble Learning for Expanding the Low Sample Size of Microarray Dataset

被引:1
作者
Alrefai, Nashat [1 ]
Ibrahim, Othman [2 ]
机构
[1] Univ Teknol Malaysia, Sch Comp, Fac Engn, Skudi, Johor, Malaysia
[2] Univ Teknol Malaysia, Azman Hashim Int Business Sch, Skudi, Johor, Malaysia
来源
INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021) | 2021年
关键词
Semi-Supervised learning; Ensemble Learning; Feature Selection; cancer classification; Self-labeling;
D O I
10.1109/ICECET52533.2021.9698770
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cancer is considered one of the most common causes of death in the world. Using a microarray dataset for cancer classification can provide insight into possible treatment strategies. However, it is hard and expensive to collect a full labeled dataset. Self-training can solve this issue by train the classifier on limited labeled data, then adding unlabeled data incrementally for classification to select the most confidence between them and adding it to the original labeled data. The current study aims to propose a framework that can reduce the high dimensionality by using aggregation ranker-filters and particle swarm intelligence with the ensemble method (PSO-ensemble). Afterward, An Adaptive Self-Training Method (ASTM) can boost the labeled set by repeatedly incrementing it with the most confident samples from the unlabeled datasets to solve the low sample size issue. Empirical results demonstrate that ASTM can effectively improve classification performance.
引用
收藏
页码:1140 / 1145
页数:6
相关论文
共 12 条
[1]   A Survey on Hybrid Feature Selection Methods in Microarray Gene Expression Data for Cancer Classification [J].
Almugren, Nada ;
Alshamlan, Hala .
IEEE ACCESS, 2019, 7 :78533-78548
[2]  
Alrefai N., 2021, DEEP LEARNING COVID, V11, P4531, DOI [10.11591/ijece.v11i5.pp4531-4541, DOI 10.11591/IJECE.V11I5.PP4531-4541]
[3]  
alrefai nashat, 2021, J THEOR APPL INF TEC
[4]  
[Anonymous], CANCER
[5]  
Barbachan e Silva M., 2021, OPTIMISATION CANC ST, P442, DOI [10.1109/cec45853.2021.9504911, DOI 10.1109/CEC45853.2021.9504911]
[6]  
Cancer.Net Editorial Board, 2021, BRAIN TUM STAT
[7]   Multi-train: A semi-supervised heterogeneous ensemble classifier [J].
Gu, Shenkai ;
Jin, Yaochu .
NEUROCOMPUTING, 2017, 249 :202-211
[8]  
Hambali M.A., 2020, International Journal of Cognitive Computing in Engineering, V1, P78, DOI DOI 10.1016/J.IJCCE.2020.11.001
[9]   Semi-Supervised Self-Training Method Based on an Optimum-Path Forest [J].
Li, Junnan ;
Zhu, Qingsheng .
IEEE ACCESS, 2019, 7 :36388-36399
[10]   An enhanced feature selection filter for classification of microarray cancer data [J].
Mazumder, Dilwar Hussain ;
Veilumuthu, Ramachandran .
ETRI JOURNAL, 2019, 41 (03) :358-370