Identification of Breast Cancer Metastasis Markers from Gene Expression Profiles Using Machine Learning Approaches

被引:8
作者
Jung, Jinmyung [1 ]
Yoo, Sunyong [2 ]
机构
[1] Univ Suwon, Coll Informat & Commun Technol, Div Data Sci, Hwaseong 18323, South Korea
[2] Chonnam Natl Univ, Dept ICT Convergence Syst Engn, Gwangju 61005, South Korea
基金
新加坡国家研究基金会;
关键词
metastasis marker; gene expression; machine learning; XGBoost; breast cancer; feature importance; PROTEIN; REGULATOR; RESOURCE;
D O I
10.3390/genes14091820
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Cancer metastasis accounts for approximately 90% of cancer deaths, and elucidating markers in metastasis is the first step in its prevention. To characterize metastasis marker genes (MGs) of breast cancer, XGBoost models that classify metastasis status were trained with gene expression profiles from TCGA. Then, a metastasis score (MS) was assigned to each gene by calculating the inner product between the feature importance and the AUC performance of the models. As a result, 54, 202, and 357 genes with the highest MS were characterized as MGs by empirical p-value cutoffs of 0.001, 0.005, and 0.01, respectively. The three sets of MGs were compared with those from existing metastasis marker databases, which provided significant results in most comparisons (p-value < 0.05). They were also significantly enriched in biological processes associated with breast cancer metastasis. The three MGs, SPPL2C, KRT23, and RGS7, showed highly significant results (p-value < 0.01) in the survival analysis. The MGs that could not be identified by statistical analysis (e.g., GOLM1, ELAVL1, UBP1, and AZGP1), as well as the MGs with the highest MS (e.g., ZNF676, FAM163B, LDOC2, IRF1, and STK40), were verified via the literature. Additionally, we checked how close the MGs were to each other in the protein-protein interaction networks. We expect that the characterized markers will help understand and prevent breast cancer metastasis.
引用
收藏
页数:11
相关论文
共 50 条
[41]   Predicting breast cancer metastasis by using serum biomarkers and clinicopathological data with machine learning technologies [J].
Tseng, Yi-Ju ;
Huang, Chuan-En ;
Wen, Chiao-Ni ;
Lai, Po-Yin ;
Wu, Min-Hsien ;
Sun, Yu-Chen ;
Wang, Hsin-Yao ;
Lu, Jang-Jih .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2019, 128 :79-86
[42]   Neuroblastoma-derived secretory protein is a novel secreted factor overexpressed in neuroblastoma [J].
Vasudevan, Sanjeev A. ;
Shang, Xiaoyi ;
Chang, Shirong ;
Ge, Ningling ;
Diaz-Miron, Jose L. ;
Russell, Heidi V. ;
Hicks, M. John ;
Ludwig, Andrew D. ;
Wesson, Catherine L. ;
Burlingame, Susan M. ;
Kim, Eugene S. ;
Khan, Javed ;
Yang, Jianhua ;
Nuchtern, Jed G. .
MOLECULAR CANCER THERAPEUTICS, 2009, 8 (08) :2478-2489
[43]   A multigene support vector machine predictor for metastasis of cutaneous melanoma [J].
Wei, Dong .
MOLECULAR MEDICINE REPORTS, 2018, 17 (02) :2907-2914
[44]   Identification of key genes involved in the metastasis of clear cell renal cell carcinoma [J].
Wei, Wenhao ;
Lv, Yufeng ;
Gan, Zuhuan ;
Zhang, Yanxian ;
Han, Xueqiong ;
Xu, Zihai .
ONCOLOGY LETTERS, 2019, 17 (05) :4321-4328
[45]   Breast Cancer Migration and Invasion Depend on Proteasome Degradation of Regulator of G-Protein Signaling 4 [J].
Xie, Yan ;
Wolff, Dennis W. ;
Wei, Taotao ;
Wang, Bo ;
Deng, Caishu ;
Kirui, Joseph K. ;
Jiang, Haihong ;
Qin, Jianbing ;
Abel, Peter W. ;
Tu, Yaping .
CANCER RESEARCH, 2009, 69 (14) :5743-5751
[46]   AZGP1 suppresses epithelial-to-mesenchymal transition and hepatic carcinogenesis by blocking TGFβ-ERK2 pathways [J].
Xu, Ming-Yi ;
Chen, Rong ;
Yu, Jing-Xia ;
Liu, Ting ;
Qu, Ying ;
Lu, Lun-Gen .
CANCER LETTERS, 2016, 374 (02) :241-249
[47]   LDOC1 regulates Wnt5a expression and osteosarcoma cell metastasis and is correlated with the survival of osteosarcoma patients [J].
Yong, Bi-Cheng ;
Lu, Jin-Chang ;
Xie, Xian-Biao ;
Su, Qiao ;
Tan, Ping-Xian ;
Tang, Qing-Lian ;
Wang, Jing ;
Huang, Gang ;
Han, Ju ;
Xu, Hong-Wen ;
Shen, Jing-Nan .
TUMOR BIOLOGY, 2017, 39 (02)
[48]   Golgi Membrane Protein 1 (GOLM1) Promotes Growth and Metastasis of Breast Cancer Cells via Regulating Matrix Metalloproteinase-13 (MMP13) [J].
Zhang, Rui ;
Zhu, Zhi ;
Shen, Wenzhuang ;
Li, Xingrui ;
Dhoomun, Deenraj Kush ;
Tian, Yao .
MEDICAL SCIENCE MONITOR, 2019, 25 :847-855
[49]   A Feedback Loop Comprising EGF/TGFα Sustains TFCP2-Mediated Breast Cancer Progression [J].
Zhao, Yi ;
Kaushik, Neha ;
Kang, Jae-Hyeok ;
Kaushik, Nagendra Kumar ;
Son, Seung Han ;
Uddin, Nizam ;
Kim, Min-Jung ;
Kim, Chul Geun ;
Lee, Su-Jae .
CANCER RESEARCH, 2020, 80 (11) :2217-2229
[50]   HCMDB: the human cancer metastasis database [J].
Zheng, Guantao ;
Ma, Yijie ;
Zou, Yang ;
Yin, An ;
Li, Wushuang ;
Dong, Dong .
NUCLEIC ACIDS RESEARCH, 2018, 46 (D1) :D950-D955