A Review of Computational Approaches to Predict Gene Functions

被引:6
作者
Loh, Swee Kuan [1 ]
Low, Swee Thing [1 ]
Chai, Lian En [1 ]
Chan, Weng Howe [1 ]
Mohamad, Mohd Saberi [2 ,3 ]
Deris, Safaai [2 ,3 ]
Ibrahim, Zuwairie [4 ]
Kasim, Shahreen [5 ]
Shah, Zuraini Ali [1 ]
Jamil, Hamimah Mohd [1 ]
Zakaria, Zalmiyah [1 ]
Napis, Suhaimi [6 ]
机构
[1] Univ Teknol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, Malaysia
[2] Univ Malaysia Kelantan, Inst Artificial Intelligence & Big Data, City Campus, Kota Baharu 16100, Kelantan, Malaysia
[3] Univ Malaysia Kelantan, Fac Bioengn & Technol, Jeli Campus,Lock Bag 100, Jeli, Kelantan, Malaysia
[4] Univ Malaysia Pahang, Fac Elect & Elect Engn, Pekan 26600, Pahang, Malaysia
[5] Univ Tun Hussein Onn, Fac Comp Sci & Informat Technol, Batu Pahat 86400, Johor, Malaysia
[6] Univ Putra Malaysia, Fac Biotechnol & Biomol Sci, Dept Cell & Mol Biol, Upm Serdang 43400, Selangor, Malaysia
关键词
Artificial intelligence; gene function; functional prediction; classifier; computational biology; HETEROGENEOUS DATA SOURCES; PROTEIN FUNCTION; EXPRESSION; CLASSIFICATION; INTEGRATION; ENSEMBLES; SEQUENCE; MODEL;
D O I
10.2174/1574893612666171002113742
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Recently, novel high-throughput biotechnologies have provided rich data about different genomes. However, manual annotation of gene function is time consuming. It is also very expensive and infeasible for the growing amounts of data. At present there are numerous functions in certain species that remain unknown or only partially known. Hence, the use of computational approaches to predicting gene function is becoming widespread. Computational approaches are time saving and less costly. Prediction analysis provided can be used in hypotheses to drive the biological validation of gene function. Objective: This paper reviews computational approaches such as the support vector machine, clustering, hierarchical ensemble and network-based approaches. Methods: Comparisons between these approaches are also made in the discussion portion. Results: In addition, the advantages and disadvantages of these computational approaches are discussed. Conclusion: With the emergence of omics data, the focus should be continued on integrating newly added data for gene functions prediction field.
引用
收藏
页码:373 / 386
页数:14
相关论文
共 77 条
  • [1] Allmer J, 2014, METHODS MOL BIOL, V1107, P157, DOI 10.1007/978-1-62703-748-8_9
  • [2] [Anonymous], 2005, FINDING GROUPS DATA, DOI DOI 10.1002/9780470316801
  • [3] Arkova O.V., 2016, RUSS J GENET APPL RE, V6, P785, DOI [10.1134/S2079059716070029, DOI 10.1134/S2079059716070029]
  • [4] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [5] Parkinson's disease prediction using gene expression - A projection based learning meta-cognitive neural classifier approach
    Babu, G. Sateesh
    Suresh, S.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (05) : 1519 - 1529
  • [6] A New Exon Derived from a Mammalian Apparent LTR Retrotransposon of the SUPT16H Gene
    Bae, Min-In
    Kim, Yun-Ji
    Lee, Ja-Rang
    Jung, Yi-Deun
    Kim, Heui-Soo
    [J]. INTERNATIONAL JOURNAL OF GENOMICS, 2013, 2013
  • [7] Hierarchical multi-label prediction of gene function
    Barutcuoglu, Z
    Schapire, RE
    Troyanskaya, OG
    [J]. BIOINFORMATICS, 2006, 22 (07) : 830 - 836
  • [8] Basler Georg, 2015, Methods Mol Biol, V1279, P183, DOI 10.1007/978-1-4939-2398-4_12
  • [9] Benhui C, 2010, 2010 2 WORLD C NAT B
  • [10] The advantage of functional prediction based on clustering of yeast genes and its correlation with non-sequence based classifications
    Bilu, Y
    Linial, M
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (02) : 193 - 210