Identification of Common Gene Signatures in Microarray and RNA-Sequencing Data Using Network-Based Regularization

被引:0
作者
Diegues, Ines [1 ]
Vinga, Susana [1 ]
Lopes, Marta B. [2 ,3 ]
机构
[1] Univ Lisbon, Inst Super Tecn, INESC ID, R Alves Redol 9, P-1000029 Lisbon, Portugal
[2] UNL, FCT, NOVA Lab Comp Sci & Informat NOVA LINCS, P-2829516 Caparica, Portugal
[3] UNL, FCT, CMA, P-2829516 Caparica, Portugal
来源
BIOINFORMATICS AND BIOMEDICAL ENGINEERING (IWBBIO 2020) | 2020年 / 12108卷
关键词
Microarray; RNA-sequencing; Machine learning; Biomarkers; Network-based regularization; EXPRESSION OMNIBUS; CANCER; ASSOCIATION; SELECTION;
D O I
10.1007/978-3-030-45385-5_2
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Microarray and RNA-sequencing (RNA-seq) gene expression data alongside machine learning algorithms are promising in the discovery of new cancer biomarkers. However, even though they are similar in purpose, there are some fundamental differences between the two techniques. We propose a methodology for cross-platform integration, and biomarker discovery based on network-based regularization via the Twin Networks Recovery (twiner) penalty, as a strategy to enhance the selection of breast cancer gene signatures that have similar correlation patterns in both platforms. In a classification setting based on sparse logistic regression (LR) taking as classes tumor from both RNA-seq and microarray, and normal tissue samples, twiner achieved precision-recall accuracies of 99.71% and 99.57% in the training and test set, respectively. Moreover, the survival analysis results validated the biological relevance of the signatures identified by twiner. Therefore, by leveraging from the existing amount of data for microarray and RNA-seq, a single biological conclusion can be reached, independent of each technology.
引用
收藏
页码:15 / 26
页数:12
相关论文
共 50 条
  • [21] Inferring Diagnostic and Prognostic Gene Expression Signatures Across WHO Glioma Classifications: A Network-Based Approach
    Coletti, Roberta
    de Mendonca, Monica Leiria
    Vinga, Susana
    Lopes, Marta B.
    BIOINFORMATICS AND BIOLOGY INSIGHTS, 2024, 18
  • [22] Network-Based Inference of Cancer Progression from Microarray Data
    Park, Yongjin
    Shackney, Stanley
    Schwartz, Russell
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2009, 6 (02) : 200 - 212
  • [23] Microarray and network-based identification of functional modules and pathways of active tuberculosis
    Bian, Zhong-Rui
    Yin, Juan
    Sun, Wen
    Lin, Dian-Jie
    MICROBIAL PATHOGENESIS, 2017, 105 : 68 - 73
  • [24] A tail-based test to detect differential expression in RNA-sequencing data
    Chen, Jiong
    Mi, Xinlei
    Ning, Jing
    He, Xuming
    Hu, Jianhua
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2021, 30 (01) : 261 - 276
  • [25] RNA-Sequencing Reveals Gene Expression and Pathway Signatures in Umbilical Cord Blood Affected by Birth Delivery Mode
    Liu, Yongjie
    Sun, Kun
    Gan, Yuexin
    Liu, Han
    Yu, Juehua
    Xu, Wei
    Zhang, Lin
    Chen, Dan
    PHENOMICS, 2023, 3 (03): : 228 - 242
  • [26] A neural network-based similarity index for clustering DNA microarray data
    Sawa, T
    Ohno-Machado, L
    COMPUTERS IN BIOLOGY AND MEDICINE, 2003, 33 (01) : 1 - 15
  • [27] Calling genotypes from public RNA-sequencing data enables identification of genetic variants that affect gene-expression levels
    Deelen, Patrick
    Zhernakova, Daria V.
    de Haan, Mark
    van der Sijde, Marijke
    Bonder, Marc Jan
    Karjalainen, Juha
    van der Velde, K. Joeri
    Abbott, Kristin M.
    Fu, Jingyuan
    Wijmenga, Cisca
    Sinke, Richard J.
    Swertz, Morris A.
    Franke, Lude
    GENOME MEDICINE, 2015, 7
  • [28] Identification of potential key genes related to idiopathic male infertility using RNA-sequencing data: an in-silico approach
    Lukkani, Laxman Kumar
    Naorem, Leimarembi Devi
    Muthaiyan, Mathavan
    Venkatesan, Amouda
    HUMAN FERTILITY, 2023, 26 (05) : 1149 - 1163
  • [29] Systematic Identification of Characteristic Genes of Ovarian Clear Cell Carcinoma Compared with High-Grade Serous Carcinoma Based on RNA-Sequencing
    Nagasawa, Saya
    Ikeda, Kazuhiro
    Horie-Inoue, Kuniko
    Sato, Sho
    Itakura, Atsuo
    Takeda, Satoru
    Hasegawa, Kosei
    Inoue, Satoshi
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (18)
  • [30] Identification of Structural Variants Associated with Mastitis in Holstein Dairy Cows Using Whole Genome Sequencing and RNA-sequencing
    Asselstine, Victoria
    Medrano, Juan F. F.
    Muniz, Malane M. M.
    Canovas, Angela
    JOURNAL OF ANIMAL SCIENCE, 2021, 99 : 6 - 6