Machine learning identifies 10 feature miRNAs for lung squamous cell carcinoma

被引:11
作者
Ye, Zheng [1 ]
Sun, Bo [1 ]
Xiao, Zhongdang [1 ]
机构
[1] Southeast Univ, Sch Biol Sci & Med Engn, State Key Lab Bioelect, Nanjing 210096, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
LUSC; miRNA; Biomarker; Machine learning; MICRORNA EXPRESSION PROFILES; CLINICAL-OUTCOMES; FEATURE-SELECTION; CANCER; SURVIVAL; SIGNATURES; ADENOCARCINOMA; ASSOCIATIONS; RECURRENCE; PREDICTION;
D O I
10.1016/j.gene.2020.144669
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Lung squamous cell carcinoma (LUSC) is a common type of malignancy. The mechanism behind its tumor progression is not clear yet. The aim of this study is to use machine learning to identify the feature miRNAs, which can be reliably used as biomarkers for diagnosis LUSC. We downloaded microRNA expression data and clinical data from The Cancer Genome Atlas (TCGA) database and Gene Expression Omnibus (GEO) database to identify differences in microRNA expression of primary tumor tissues and para-carcinoma tissues from LUSC. Construction of miRNA-mRNA interaction network, GO, KEGG pathway analysis and Kaplan-Meier survival analysis were used to explore the biological functions of the identified microRNAs. 21 feature miRNAs were identified between lung SCC tumor tissues and para-carcinoma tissues with the support of SVM and PCA methods. Among them, ten feature miRNAs: mir-143, mir-100, mir-101-1, mir-101-2, mir-182, mir-183, mir205, mir-21, mir-30a, mir-30-d were identified which could be used as a feature group to separate the cancer tissues from the adjacent tissues ultimately, and cross-validation of the obtained data showed that it can achieve extremely high accuracy and recall rate. Using KEGG, Reactome, GO databases, these 10 miRNAs and their target genes were found to be highly correlated with cancer. Survival analysis found that this group of miRNAs had a significant relationship with the survival rate of cancer patients, and the expression was significantly different between tumor tissues and healthy tissues. The dysregulated feature miRNAs might be involved in the pathology of LUSC and could be used as potential diagnostic biomarkers or therapeutic targets for LUSC.
引用
收藏
页数:11
相关论文
共 49 条
[1]   SurvMicro: assessment of miRNA-based prognostic signatures for cancer clinical outcomes by multivariate survival analysis [J].
Aguirre-Gamboa, Raul ;
Trevino, Victor .
BIOINFORMATICS, 2014, 30 (11) :1630-1632
[2]   MicroRNAs as biomarkers in rheumatic diseases [J].
Alevizos, Ilias ;
Illei, Gabor G. .
NATURE REVIEWS RHEUMATOLOGY, 2010, 6 (07) :391-398
[3]  
[Anonymous], 1999, P BIOC DEC
[4]  
[Anonymous], 1996, Pattern Recognition and Neural Networks
[5]  
[Anonymous], 2001, ELEMENTS STAT LEARNI
[6]   MicroRNA as tools and therapeutics in lung cancer [J].
Barger, Jennifer F. ;
Nana-Sinkam, S. Patrick .
RESPIRATORY MEDICINE, 2015, 109 (07) :803-812
[7]   MicroRNAs: Genomics, biogenesis, mechanism, and function (Reprinted from Cell, vol 116, pg 281-297, 2004) [J].
Bartel, David P. .
CELL, 2007, 131 (04) :11-29
[8]   MicroRNA signatures in tissues and plasma predict development and prognosis of computed tomography detected lung cancer [J].
Boeri, Mattia ;
Verri, Carla ;
Conte, Davide ;
Roz, Luca ;
Modena, Piergiorgio ;
Facchinetti, Federica ;
Calabro, Elisa ;
Croce, Carlo M. ;
Pastorino, Ugo ;
Sozzi, Gabriella .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (09) :3713-3718
[9]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[10]  
Breiman L., 2017, CLASSIFICATION REGRE, DOI [10.1201/9781315139470, DOI 10.1201/9781315139470]