Prediction and Analysis of Hepatocellular Carcinoma Related Genes Using Gene Ontology and KEGG

被引:1
|
作者
Jiang, Min [1 ,2 ]
Li, Bi-Qing [3 ]
Huang, Tao [4 ]
Xu, Yao Chen [5 ]
Gu, Lei [6 ]
Kong, Xiang Yin [1 ,2 ]
机构
[1] Chinese Acad Sci, Shanghai Inst Biol Sci, Inst Hlth Sci, Shanghai 200031, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Med, Shanghai 200031, Peoples R China
[3] Chinese Acad Sci, Shanghai Inst Biol Sci, Key Lab Syst Biol, Shanghai 200031, Peoples R China
[4] Mt Sinai Sch Med, Dept Genet & Genom Sci, New York, NY USA
[5] E China Normal Univ, Inst Software Engn, Shanghai 200062, Peoples R China
[6] German Canc Res Ctr, Div Theoret Bioinformat BO80, Grp Computat Oncol, D-69120 Heidelberg, Germany
基金
中国国家自然科学基金;
关键词
Gene ontology; hepatocellular carcinoma (HCC); incremental feature selection (IFS); KEGG; maximum relevance minimum redundancy (mRMR); random forest (RF); HEPATITIS-B; EXPRESSION; DATABASE; IDENTIFICATION; RELEVANCE; PROTEINS; PATHWAY; GROWTH; SITES; CELLS;
D O I
10.2174/157489361001150309131453
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Hepatocellular carcinoma (HCC) is the most common type of liver cancer worldwide and mostly occurs in viral hepatitis endemic areas such as China. Knowledge of HCC-related genes may lead to an early detection of HCC and develop molecularly targeted therapeutics, reducing mortality and improving a patient's prognosis significantly. Therefore, it is valuable and important for us to identify common characters of HCC related genes. In this study, we proposed a computational method to predict HCC related genes based on Gene Ontology terms and KEGG terms using Random Forest (RF), in which features were optimized by maximum relevance minimum redundancy (mRMR) and incremental feature selection (IFS). 224 HCC gene candidates were compiled from some databases, while 11,200non-HCC gene candidates were randomly selected from Ensemble database. 10 candidate datasets were constructed by dividing non-HCC gene candidates into 10 groups. Each gene in datasets was encoded by 13,126 features including 12,887 Gene Ontology enrichment scores and 239 KEGG enrichment scores. Finally, an optimal feature set including 615 GO terms and 11 KEGG pathways was discovered. Through analysis, we found these features were closely related to HCC, which means our method is effective for discovering HCC related genes, and it is hopeful that it can also be used to predict and analyze genes for other types of cancer.
引用
收藏
页码:31 / 38
页数:8
相关论文
共 50 条
  • [1] Prediction and Analysis of Retinoblastoma Related Genes through Gene Ontology and KEGG
    Li, Zhen
    Li, Bi-Qing
    Jiang, Min
    Chen, Lei
    Zhang, Jian
    Liu, Lin
    Huang, Tao
    BIOMED RESEARCH INTERNATIONAL, 2013, 2013
  • [2] The use of Gene Ontology terms and KEGG pathways for analysis and prediction of oncogenes
    Xing, Zhihao
    Chu, Chen
    Chen, Lei
    Kong, Xiangyin
    BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 2016, 1860 (11): : 2725 - 2734
  • [3] Analysis of cancer-related IncRNAs using gene ontology and KEGG pathways
    Chen, Lei
    Zhang, Yu-Hang
    Lu, Guohui
    Huang, Tao
    Cai, Yu-Dong
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2017, 76 : 27 - 36
  • [4] Feature Classification and Analysis of Lung Cancer Related Genes Through Gene Ontology and KEGG Pathways
    Zhou, You
    Li, Biqing
    Zhang, Yuchao
    Chen, Lei
    Kong, Xiangyin
    CURRENT BIOINFORMATICS, 2016, 11 (01) : 40 - 50
  • [5] Prediction of Colorectal Cancer Related Genes Based on Gene Ontology
    Li, Bi-Qing
    Huang, Guo-Hua
    Huang, Tao
    Feng, Kai-Yan
    Liu, Lei
    Cai, Yu-Dong
    CURRENT BIOINFORMATICS, 2015, 10 (01) : 22 - 30
  • [6] Analysis of Protein-Protein Functional Associations by Using Gene Ontology and KEGG Pathway
    Yuan, Fei
    Pan, Xiaoyong
    Chen, Lei
    Zhang, Yu-Hang
    Huang, Tao
    Cai, Yu-Dong
    BIOMED RESEARCH INTERNATIONAL, 2019, 2019
  • [7] Analysis and prediction of protein stability based on interaction network, gene ontology, and KEGG pathway enrichment scores
    Huang, Feiming
    Fu, Minfei
    Li, JiaRui
    Chen, Lei
    Feng, KaiYan
    Huang, Tao
    Cai, Yu-Dong
    BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2023, 1871 (03):
  • [8] Analysis of key pathways and genes in nodal structure on rat skin surface using gene ontology and KEGG pathway
    Shin, Joonyoung
    Park, A. Yeong
    Ju, Suk
    Lee, Hyorin
    Kang, Hyung Won
    Han, Dongwoon
    Kim, Sungchul
    GENES & GENOMICS, 2025, 47 (01) : 71 - 85
  • [9] Prediction and analysis of weighted genes in hepatocellular carcinoma using bioinformatics analysis
    Zhang, Qifan
    Sun, Shibo
    Zhu, Chen
    Zheng, Yujian
    Cai, Qing
    Liang, Xiaolu
    Xie, Haorong
    Zhou, Jie
    MOLECULAR MEDICINE REPORTS, 2019, 19 (04) : 2479 - 2488
  • [10] The prediction of candidate genes for cervix related cancer through gene ontology and graph theoretical approach
    Hindumathi, V.
    Kranthi, T.
    Rao, S. B.
    Manimaran, P.
    MOLECULAR BIOSYSTEMS, 2014, 10 (06) : 1450 - 1460