MoCaGCN: Cancer Subtype Classification by Developing Causal Graph Structure Learning

被引:0
作者
Zhang, Xiaobin [1 ]
Yang, Fan [2 ]
Yang, Xiaohui [1 ]
Li, Qian [2 ]
Li, Na [3 ]
Zhao, Yaoyao [1 ]
机构
[1] Jinan Univ, Sch Informat Sci & Engn, Jinan, Peoples R China
[2] Shandong Univ, Sch Publ Hlth, Cheeloo Coll Med, Jinan, Peoples R China
[3] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan, Peoples R China
来源
2024 IEEE INTERNATIONAL CONFERENCE ON MEDICAL ARTIFICIAL INTELLIGENCE, MEDAI 2024 | 2024年
基金
中国国家自然科学基金;
关键词
multi-omits; cancer subtype classification; causal structure learning; graph convolutional network; data integration; SELECTION; NETWORK;
D O I
10.1109/MedAI62885.2024.00087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The classification of cancer subtypes is a crucial step towards more personalized treatment and has important implications for understanding the biological mechanisms of cancer subtypes. Multi-omits data present challenges in cancer research due to their high-dimensional feature spaces, significant noise, and limited sample sizes. However, the scale and heterogeneity of this data makes integration and classification a non-trivial task. Models based on correlations often contain confounding and redundant features, resulting in uninterpretability and weak generalization. Therefore, we proposed a novel multi-omits classification framework named Muti-omits Causal Graph Convolutional Network incorporated with causal structure learning and graph neural networks. Furthermore, the model can improve the performance and generalization capabilities of cancer classification by considering the prior knowledge of gene-gene interactions. This study constructed a more efficient framework for cancer subtype classification based on multi-omics data, which mainly include multi-omits data preprocessing, omits causal graph network construction, and multi-omits data integration methods. Finally, we demonstrated that our model outperforms the slate-of-the-art methods with fewer variables for higher accuracy on the real lung adenocarcinoma dataset from the cancer genome atlas program.
引用
收藏
页码:617 / 625
页数:9
相关论文
共 35 条
[1]   Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection [J].
Ang, Jun Chin ;
Mirzal, Andri ;
Haron, Habibollah ;
Hamed, Haza Nuzly Abdull .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (05) :971-989
[2]   Two hybrid wrapper-filter feature selection algorithms applied to high-dimensional microarray experiments [J].
Apolloni, Javier ;
Leguizamon, Guillermo ;
Alba, Enrique .
APPLIED SOFT COMPUTING, 2016, 38 :922-932
[3]   Selection of relevant features and examples in machine learning [J].
Blum, AL ;
Langley, P .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :245-271
[4]  
Cancer Genome Atlas Research Network, 2018, Nature, V559, pE12, DOI [10.1038/nature13385, 10.1038/s41586-018-0228-6]
[5]   MOCSS: Multi-omics data clustering and cancer subtyping via shared and specific representation learning [J].
Chen, Yuxin ;
Wen, Yuqi ;
Xie, Chenyang ;
Chen, Xinjian ;
He, Song ;
Bo, Xiaochen ;
Zhang, Zhongnan .
ISCIENCE, 2023, 26 (08)
[6]  
Defferrard M, 2016, ADV NEUR IN, V29
[7]   The future of early cancer detection [J].
Fitzgerald, Rebecca C. ;
Antoniou, Antonis C. ;
Fruk, Ljiljana ;
Rosenfeld, Nitzan .
NATURE MEDICINE, 2022, 28 (04) :666-677
[8]  
Goswami Siddharth, 2024, 2024 IEEE International Conference on Interdisciplinary Approaches in Technology and Management for Social Innovation (IATMSI), P1, DOI 10.1109/IATMSI60426.2024.10502533
[9]   Interpreting Black-Box Models: A Review on Explainable Artificial Intelligence [J].
Hassija, Vikas ;
Chamola, Vinay ;
Mahapatra, Atmesh ;
Singal, Abhinandan ;
Goel, Divyansh ;
Huang, Kaizhu ;
Scardapane, Simone ;
Spinelli, Indro ;
Mahmud, Mufti ;
Hussain, Amir .
COGNITIVE COMPUTATION, 2024, 16 (01) :45-74
[10]   Gene Expression Classification of Lung Adenocarcinoma into Molecular Subtypes [J].
Hu, Fuyan ;
Zhou, Yuxuan ;
Wang, Qing ;
Yang, Zhiyuan ;
Shi, Yu ;
Chi, Qingjia .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (04) :1187-1197