Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways

被引:313
作者
Chen, Lei [1 ,2 ]
Zhang, Yu-Hang [3 ]
Wang, ShaoPeng [1 ]
Zhang, YunHua [4 ]
Huang, Tao [3 ]
Cai, Yu-Dong [1 ]
机构
[1] Shanghai Univ, Sch Life Sci, Shanghai, Peoples R China
[2] Shanghai Maritime Univ, Coll Informat Engn, Shanghai, Peoples R China
[3] Chinese Acad Sci, Shanghai Inst Biol Sci, Inst Hlth Sci, Shanghai, Peoples R China
[4] Anhui Agr Univ, Sch Resources & Environm, Anhui Prov Key Lab Farmland Ecol Conversat & Poll, Hefei, Anhui, Peoples R China
来源
PLOS ONE | 2017年 / 12卷 / 09期
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
CHRONIC LYMPHOCYTIC-LEUKEMIA; MESSENGER-RNA EXPRESSION; ACUTE LYMPHOBLASTIC-LEUKEMIA; ACUTE MYELOID-LEUKEMIA; AMINO-ACID TRANSPORTER; BACILLUS-SUBTILIS; FEATURE-SELECTION; ESCHERICHIA-COLI; BINDING PROTEIN; RIBOSOMAL-RNAS;
D O I
10.1371/journal.pone.0184129
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identifying essential genes in a given organism is important for research on their fundamental roles in organism survival. Furthermore, if possible, uncovering the links between core functions or pathways with these essential genes will further help us obtain deep insight into the key roles of these genes. In this study, we investigated the essential and non-essential genes reported in a previous study and extracted gene ontology (GO) terms and biological pathways that are important for the determination of essential genes. Through the enrichment theory of GO and KEGG pathways, we encoded each essential/non-essential gene into a vector in which each component represented the relationship between the gene and one GO term or KEGG pathway. To analyze these relationships, the maximum relevance minimum redundancy (mRMR) was adopted. Then, the incremental feature selection (IFS) and support vector machine (SVM) were employed to extract important GO terms and KEGG pathways. A prediction model was built simultaneously using the extracted GO terms and KEGG pathways, which yielded nearly perfect performance, with a Matthews correlation coefficient of 0.951, for distinguishing essential and non-essential genes. To fully investigate the key factors influencing the fundamental roles of essential genes, the 21 most important GO terms and three KEGG pathways were analyzed in detail. In addition, several genes was provided in this study, which were predicted to be essential genes by our prediction model. We suggest that this study provides more functional and pathway information on the essential genes and provides a new way to investigate related problems.
引用
收藏
页数:22
相关论文
共 118 条
  • [1] [Anonymous], IEEE ACM T COMPUT BI
  • [2] [Anonymous], J SENS
  • [3] [Anonymous], COMBINATORIAL CHEM H
  • [4] [Anonymous], CELL GROWTH DIFFEREN
  • [5] [Anonymous], BLOOD
  • [6] Essential Amino Acids Regulate Both Initiation and Elongation of mRNA Translation Independent of Insulin in MAC-T Cells and Bovine Mammary Tissue Slices
    Appuhamy, J. A. D. Ranga Niroshan
    Bell, Ashley L.
    Nayananjalie, W. A. Deepthi
    Escobar, Jeffery
    Hanigan, Mark D.
    [J]. JOURNAL OF NUTRITION, 2011, 141 (06) : 1209 - 1215
  • [7] Novel intron-encoded small nucleolar RNAs with long sequence complementarities to mature rRNAs involved in ribosome biogenesis
    Bachellerie, JP
    Nicoloso, M
    Qu, LH
    Michot, B
    CaizerguesFerrer, M
    Cavaille, J
    Renalier, MH
    [J]. BIOCHEMISTRY AND CELL BIOLOGY-BIOCHIMIE ET BIOLOGIE CELLULAIRE, 1995, 73 (11-12): : 835 - 843
  • [8] Network medicine: a network-based approach to human disease
    Barabasi, Albert-Laszlo
    Gulbahce, Natali
    Loscalzo, Joseph
    [J]. NATURE REVIEWS GENETICS, 2011, 12 (01) : 56 - 68
  • [9] Gene Ontology Consortium: going forward
    Blake, J. A.
    Christie, K. R.
    Dolan, M. E.
    Drabkin, H. J.
    Hill, D. P.
    Ni, L.
    Sitnikov, D.
    Burgess, S.
    Buza, T.
    Gresham, C.
    McCarthy, F.
    Pillai, L.
    Wang, H.
    Carbon, S.
    Dietze, H.
    Lewis, S. E.
    Mungall, C. J.
    Munoz-Torres, M. C.
    Feuermann, M.
    Gaudet, P.
    Basu, S.
    Chisholm, R. L.
    Dodson, R. J.
    Fey, P.
    Mi, H.
    Thomas, P. D.
    Muruganujan, A.
    Poudel, S.
    Hu, J. C.
    Aleksander, S. A.
    McIntosh, B. K.
    Renfro, D. P.
    Siegele, D. A.
    Attrill, H.
    Brown, N. H.
    Tweedie, S.
    Lomax, J.
    Osumi-Sutherland, D.
    Parkinson, H.
    Roncaglia, P.
    Lovering, R. C.
    Talmud, P. J.
    Humphries, S. E.
    Denny, P.
    Campbell, N. H.
    Foulger, R. E.
    Chibucos, M. C.
    Giglio, M. Gwinn
    Chang, H. Y.
    Finn, R.
    [J]. NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) : D1049 - D1056
  • [10] Deletion of Amino Acid Transporter ASCT2 (SLC1A5) Reveals an Essential Role for Transporters SNAT1 (SLC38A1) and SNAT2 (SLC38A2) to Sustain Glutaminolysis in Cancer Cells
    Broer, Angelika
    Rahimi, Farid
    Broer, Stefan
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2016, 291 (25) : 13194 - 13205