Semi-supervised Ensemble Learning for Efficient Cancer Sample Classification from miRNA Gene Expression Data

被引:0
作者
Dikme Chisil B. Marak
Anindya Halder
Ansuman Kumar
机构
[1] North-Eastern Hill University,Department of Computer Application, School of Technology
来源
New Generation Computing | 2021年 / 39卷
关键词
Semi-supervised learning; Ensemble learning; Cancer classification; miRNA gene expression data;
D O I
暂无
中图分类号
学科分类号
摘要
Traditional classifiers often fail to produce desired classification accuracy because of inadequate training samples present in microRNA (miRNA) gene expression cancer datasets. In this context, we propose a novel semi-supervised ensemble learning (SSEL) strategy combining the (advantages of) semi-supervised learning and ensemble learning which is able to produce better results than the individual constituent classifiers. The proposed method is validated using eight publicly available miRNA gene expression datasets of pancreatic and colorectal cancers with respect to classification accuracy, precision, recall, macro F1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F_{1}$$\end{document}-measure and kappa in comparison to six other state-of-the-art methods. The experimental results reveal that the proposed SSEL method significantly dominates other compared methods for cancer sample classification. The results of the statistical significance tests, receiver operating characteristic curve and area under curve justify the relevance of the better results in favor of the proposed method.
引用
收藏
页码:487 / 513
页数:26
相关论文
共 103 条
[1]  
Esquela-Kerscher E(2006)Oncomirs—microRNAs with a role in cancer Nat. Rev. cancer 6 259-269
[2]  
Slack FJ(2014)ncPred: ncRNA-disease association prediction through tripartite network-based inference Front. Bioeng. Biotechnol. 2 71-24
[3]  
Alaimo S(2020)Prediction of new associations between ncRNAs and diseases exploiting multi-type hierarchical clustering BMC Bioinform. 21 1-780
[4]  
Giugno R(2006)MicroRNAs in cell proliferation, cell death, and tumorigenesis Br. J. Cancer 96 776-297
[5]  
Pulvirenti A(2004)MicroRNAs: genomics, biogenesis, mechanism, and function Cell 116 281-13
[6]  
Barracchia EP(2008)A comparative study of different machine learning methods on microarray gene expression data BMC Genom. 9 1-159
[7]  
Pio G(2017)Gene expression based cancer classification Egypt. Inform. J. 18 151-129
[8]  
D’Elia D(2013)A survey of logic based classifiers Int. J. Future Comput. Commun. 2 126-24
[9]  
Ceci M(2007)Supervised machine learning: a review of classification techniques Emerg. Artif. Intell. Appl. Comput. Eng. 160 3-21
[10]  
Hwang HW(2015)Gene expression data classification using support vector machine and mutual information-based gene selection Procedia Comput. Sci. 47 13-329