Colorectal Cancer Detection via Metabolites and Machine Learning

被引:1
作者
Yang, Rachel [1 ]
Tsigelny, Igor F. [2 ,3 ,4 ,5 ]
Kesari, Santosh [6 ]
Kouznetsova, Valentina L. [2 ,3 ,5 ]
机构
[1] Univ Calif San Diego, San Diego Supercomp Ctr, REHS Program, MC 0505, 9500 Gilman Dr, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, San Diego Supercomp Ctr, MC 0505, 9500 Gilman Dr, La Jolla, CA 92093 USA
[3] BiAna, POB 2525, La Jolla, CA 92038 USA
[4] Univ Calif San Diego, Dept Neurosci, MC00505, 9500 Gilman Dr, La Jolla, CA 92093 USA
[5] CureSci Inst, 5820 Oberlin Dr, STE 202, San Diego, CA 92121 USA
[6] Pacific Neurosci Inst, 2125 Arizona Ave, Santa Monica, CA 90404 USA
关键词
colorectal cancer; data mining; machine learning; metabolites; pathway analysis; DATABASE; SMILES;
D O I
10.3390/cimb46050254
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Today, colorectal cancer (CRC) diagnosis is performed using colonoscopy, which is the current, most effective screening method. However, colonoscopy poses risks of harm to the patient and is an invasive process. Recent research has proven metabolomics as a potential, non-invasive detection method, which can use identified biomarkers to detect potential cancer in a patient's body. The aim of this study is to develop a machine-learning (ML) model based on chemical descriptors that will recognize CRC-associated metabolites. We selected a set of metabolites found as the biomarkers of CRC, confirmed that they participate in cancer-related pathways, and used them for training a machine-learning model for the diagnostics of CRC. Using a set of selective metabolites and random compounds, we developed a range of ML models. The best performing ML model trained on Stage 0-2 CRC metabolite data predicted a metabolite class with 89.55% accuracy. The best performing ML model trained on Stage 3-4 CRC metabolite data predicted a metabolite class with 95.21% accuracy. Lastly, the best-performing ML model trained on Stage 0-4 CRC metabolite data predicted a metabolite class with 93.04% accuracy. These models were then tested on independent datasets, including random and unrelated-disease metabolites. In addition, six pathways related to these CRC metabolites were also distinguished: aminoacyl-tRNA biosynthesis; glyoxylate and dicarboxylate metabolism; glycine, serine, and threonine metabolism; phenylalanine, tyrosine, and tryptophan biosynthesis; arginine biosynthesis; and alanine, aspartate, and glutamate metabolism. Thus, in this research study, we created machine-learning models based on metabolite-related descriptors that may be helpful in developing a non-invasive diagnosis method for CRC.
引用
收藏
页码:4133 / 4146
页数:14
相关论文
共 37 条
[1]   Serine and glycine metabolism in cancer [J].
Amelio, Ivano ;
Cutruzzola, Francesca ;
Antonov, Alexey ;
Agostini, Massimiliano ;
Melino, Gerry .
TRENDS IN BIOCHEMICAL SCIENCES, 2014, 39 (04) :191-198
[2]   Metabolic Profiling of Formalin-Fixed Paraffin-Embedded Tissues Discriminates Normal Colon from Colorectal Cancer [J].
Arima, Kota ;
Lau, Mai Chan ;
Zhao, Melissa ;
Haruki, Koichiro ;
Kosumi, Keisuke ;
Mima, Kosuke ;
Gu, Mancang ;
Vayrynen, Juha P. ;
Twombly, Tyler S. ;
Baba, Yoshifumi ;
Fujiyoshi, Kenji ;
Kishikawa, Junko ;
Guo, Chunguang ;
Baba, Hideo ;
Richards, William G. ;
Chan, Andrew T. ;
Nishihara, Reiko ;
Meyerhardt, Jeffrey A. ;
Nowak, Jonathan A. ;
Giannakis, Marios ;
Fuchs, Charles S. ;
Ogino, Shuji .
MOLECULAR CANCER RESEARCH, 2020, 18 (06) :883-890
[3]   PathBIX-a web server for network-based pathway annotation with adaptive null models [J].
Castresana-Aguirre, Miguel ;
Persson, Emma ;
Sonnhammer, Erik L. L. .
BIOINFORMATICS ADVANCES, 2021, 1 (01)
[4]   Connections between metabolism and epigenetics: mechanisms and novel anti-cancer strategy [J].
Chen, Chen ;
Wang, Zehua ;
Qin, Yanru .
FRONTIERS IN PHARMACOLOGY, 2022, 13
[5]  
Chirasani Venkat R, 2022, Curr Res Chem Biol, V2, DOI 10.1016/j.crchbi.2022.100035
[6]   KEGGSum: Summarizing Genomic Pathways [J].
David, Chaim ;
Kondylakis, Haridimos .
INFORMATION, 2024, 15 (01)
[7]   ChemDes: an integrated web-based platform for molecular descriptor and fingerprint computation [J].
Dong, Jie ;
Cao, Dong-Sheng ;
Miao, Hong-Yu ;
Liu, Shao ;
Deng, Bai-Chuan ;
Yun, Yong-Huan ;
Wang, Ning-Ning ;
Lu, Ai-Ping ;
Zeng, Wen-Bin ;
Chen, Alex F. .
JOURNAL OF CHEMINFORMATICS, 2015, 7
[8]   Arginine Metabolism and Its Potential in Treatment of Colorectal Cancer [J].
Du, Tao ;
Han, Junyi .
FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2021, 9
[9]   Serum Metabolomics Study of Papillary Thyroid Carcinoma Based on HPLC-Q-TOF-MS/MS [J].
Du, Yang ;
Fan, Peizhi ;
Zou, Lianhong ;
Jiang, Yu ;
Gu, Xiaowen ;
Yu, Jie ;
Zhang, Chaojie .
FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2021, 9
[10]   Radiomics in prostate cancer: an up-to-date review [J].
Ferro, Matteo ;
de Cobelli, Ottavio ;
Musi, Gennaro ;
del Giudice, Francesco ;
Carrieri, Giuseppe ;
Busetto, Gian Maria ;
Falagario, Ugo Giovanni ;
Sciarra, Alessandro ;
Maggi, Martina ;
Crocetto, Felice ;
Barone, Biagio ;
Caputo, Vincenzo Francesco ;
Marchioni, Michele ;
Lucarelli, Giuseppe ;
Imbimbo, Ciro ;
Mistretta, Francesco Alessandro ;
Luzzago, Stefano ;
Vartolomei, Mihai Dorin ;
Cormio, Luigi ;
Autorino, Riccardo ;
Tataru, Octavian Sabin .
THERAPEUTIC ADVANCES IN UROLOGY, 2022, 14