Multi-omics Data Integration Model based on Isomap and Convolutional Neural Network

被引:1
作者
Alkhateeb, Abedalrhman [1 ]
ElKarami, Bashier [2 ]
Qattous, Hazem [1 ]
Al-refai, Abdullah [1 ]
AlAfeshat, Noor [1 ]
Shahrrava, Behnam [2 ]
Azzeh, Mohammad [3 ]
机构
[1] Princess Sumaya Univ Technol, Software Engn Dept, Amman, Jordan
[2] Univ Windsor, Dept Elect & Comp Engn, Windsor, ON, Canada
[3] Princess Sumaya Technol, Data Sci Dept, Amman, Jordan
来源
2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA | 2022年
关键词
Multi-omics data integration; Data Embedding; Isomap; Deep learning; DIMENSIONALITY REDUCTION;
D O I
10.1109/ICMLA55696.2022.00218
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent advances in genetics technologies have led to high-throughput characterized different biological molecules' functionalities. The availability of heterogeneous omics sparked the challenge of integrating them for further analysis. This work incorporates the Isomap technique to embed multiomic data into a convolutional neural network (CNN). The deep learning model fuses three omics data, which are gene expression, copy number alteration (CNA), and DNA methylation data, for breast cancer stage prediction. Isomap is utilized to convert the high-dimensional data into 2-dimensional maps. The gene similarity network (GSN) map is created based on gene expression data to preserve gene relationships. The values from three omics for each sample are used to color the GSN map based on the RGB system. The created GSN maps for all samples are fed into the CNN for classification. The model was applied to TCGA breast invasive carcinoma data set to predict the stage of breast cancer. It outperformed the state-of-art iSOM-GSN model in performance metrics, including accuracy, precision, recall, f1-measure, and area under the curve (AUC). The results indicate that a combination of Isomap embedding technique and CNN can successfully integrate a multiomics data set for cancer outcome prediction, including the diagnosis and prognosis of the complex disease.
引用
收藏
页码:1381 / 1385
页数:5
相关论文
共 20 条
[1]  
Agarap A. F., 2018, arXiv
[2]   Laplacian eigenmaps for dimensionality reduction and data representation [J].
Belkin, M ;
Niyogi, P .
NEURAL COMPUTATION, 2003, 15 (06) :1373-1396
[3]   Prospects and challenges of multi-omics data integration in toxicology [J].
Canzler, Sebastian ;
Schor, Jana ;
Busch, Wibke ;
Schubert, Kristin ;
Rolle-Kampczyk, Ulrike E. ;
Seitz, Herve ;
Kamp, Hennicke ;
von Bergen, Martin ;
Buesen, Roland ;
Hackermueller, Joerg .
ARCHIVES OF TOXICOLOGY, 2020, 94 (02) :371-388
[4]   Gene clustering by using query-based self-organizing maps [J].
Chang, Ray-I ;
Chu, Chih-Chun ;
Wu, Yu-Ying ;
Chen, Yen-Liang .
EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (09) :6689-6694
[5]   Comprehensive Molecular Portraits of Invasive Lobular Breast Cancer [J].
Ciriello, Giovanni ;
Gatza, Michael L. ;
Beck, Andrew H. ;
Wilkerson, Matthew D. ;
Rhie, Suhn K. ;
Pastore, Alessandro ;
Zhang, Hailei ;
McLellan, Michael ;
Yau, Christina ;
Kandoth, Cyriac ;
Bowlby, Reanne ;
Shen, Hui ;
Hayat, Sikander ;
Fieldhouse, Robert ;
Lester, Susan C. ;
Tse, Gary M. K. ;
Factor, Rachel E. ;
Collins, Laura C. ;
Allison, Kimberly H. ;
Chen, Yunn-Yi ;
Jensen, Kristin ;
Johnson, Nicole B. ;
Oesterreich, Steffi ;
Mills, Gordon B. ;
Cherniack, Andrew D. ;
Robertson, Gordon ;
Benz, Christopher ;
Sander, Chris ;
Laird, Peter W. ;
Hoadley, Katherine A. ;
King, Tari A. ;
Perou, Charles M. .
CELL, 2015, 163 (02) :506-519
[6]   A review of UMAP in population genetics [J].
Diaz-Papkovich, Alex ;
Anderson-Trocme, Luke ;
Gravel, Simon .
JOURNAL OF HUMAN GENETICS, 2021, 66 (01) :85-91
[7]   Multi-omics Data Integration Model Based on UMAP Embedding and Convolutional Neural Network [J].
ElKarami, Bashier ;
Alkhateeb, Abedalrhman ;
Qattous, Hazem ;
Alshomali, Lujain ;
Shahrrava, Behnam .
CANCER INFORMATICS, 2022, 21
[8]   iSOM-GSN: an integrative approach for transforming multi-omic data into gene similarity networks via self-organizing maps [J].
Fatima, Nazia ;
Rueda, Luis .
BIOINFORMATICS, 2020, 36 (15) :4248-4254
[9]  
Huang J, 2016, FRONT GENET, V7, DOI [10.3389/fgene.2016.00084, 10.3389/fgene.2017.00084]
[10]   Dimensionality reduction and visualization in principal component analysis [J].
Ivosev, Gordana ;
Burton, Lyle ;
Bonner, Ron .
ANALYTICAL CHEMISTRY, 2008, 80 (13) :4933-4944