SFM: A novel sequence-based fusion method for disease genes identification and prioritization

被引:10
|
作者
Yousef, Abdulaziz [1 ]
Charkari, Nasrollah Moghadam [1 ]
机构
[1] Tarbiat Modares Univ, Fac Elect & Comp Engn, Tehran, Iran
关键词
Classification; Disease gene; Protein; Physicochemical properties of amino acid; Fusion method; PROTEIN-PROTEIN INTERACTIONS; PREDICTION; FEATURES; AUTOCORRELATION; CLASSIFICATION; SIMILARITY; SURFACE;
D O I
10.1016/j.jtbi.2015.07.010
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The identification of disease genes from human genome is of great importance to improve diagnosis and treatment of disease. Several machine learning methods have been introduced to identify disease genes. However, these methods mostly differ in the prior knowledge used to construct the feature vector for each instance (gene), the ways of selecting negative data (non-disease genes) where there is no investigational approach to find them and the classification methods used to make the final decision. In this work, a novel Sequence-based fusion method (SFM) is proposed to identify disease genes. In this regard, unlike existing methods, instead of using a noisy and incomplete prior-knowledge, the amino acid sequence of the proteins which is universal data has been carried out to present the genes (proteins) into four different feature vectors. To select more likely negative data from candidate genes, the intersection set of four negative sets which are generated using distance approach is considered. Then, Decision Tree (C4.5) has been applied as a fusion method to combine the results of four independent state-of the-art predictors based on support vector machine (SVM) algorithm, and to make the final decision. The experimental results of the proposed method have been evaluated by some standard measures. The results indicate the precision, recall and F-measure of 82.6%, 85.6% and 84, respectively. These results confirm the efficiency and validity of the proposed method. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:12 / 19
页数:8
相关论文
共 50 条
  • [1] A Novel Candidate Disease Genes Prioritization Method Based on Module Partition and Rank Fusion
    Chen, Xing
    Yan, Gui-Ying
    Liao, Xiao-Ping
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2010, 14 (04) : 337 - 356
  • [2] Nucleotide sequence-based identification of a novel circovirus of canaries
    Todd, D
    Weston, J
    Ball, NW
    Borghmans, BJ
    Smyth, JA
    Gelmini, L
    Lavazza, A
    AVIAN PATHOLOGY, 2001, 30 (04) : 321 - 325
  • [3] An improved method for DNA sequence-based identification of nematodes
    Hanson, S.
    Solano, F.
    Thomas, S.
    Beacham, J.
    PHYTOPATHOLOGY, 2012, 102 (07) : 50 - 50
  • [4] The identification of three novel MICA alleles by sequence-based typing
    Quiroga, I
    Sweeney, D
    Sutton, PM
    Ahmad, T
    Walton, R
    Barnardo, MCNM
    Fuggle, SV
    TISSUE ANTIGENS, 2006, 67 (04): : 321 - 325
  • [5] Sequence-based prioritization of nonsynonymous single-nucleotide polymorphisms for the study of disease mutations
    Jiang, Rui
    Yang, Hua
    Zhou, Linqi
    Kuo, C.-C. Jay
    Sun, Fengzhu
    Chen, Ting
    AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (02) : 346 - 360
  • [6] Sequence-based identification of polyamine oxidase genes in Sorghum bicolor L.
    Ebeed, Heba T.
    El-Zonkorany, Mohamed
    Habib, Eman
    Ali, Esraa
    Zahran, Ahmed
    Ragab, Aya Hamdy
    GENOME, 2025, 68 : 1 - 5
  • [7] Reliability of Sequence-Based Identification of Microorganism
    Sung, Heungsup
    Kim, Mi-Na
    INFECTION AND CHEMOTHERAPY, 2008, 40 (06): : 355 - 356
  • [8] Identification of a novel allele MICA*008:05 by sequence-based typing
    Du, Z. -H.
    Jiao, S. -X.
    Du, B.
    Liu, X. -H.
    Wu, Z. -J.
    TISSUE ANTIGENS, 2013, 82 (03): : 217 - 218
  • [9] Identification of a novel allele HLA-A*9208 by sequence-based typing
    Yan, L. X.
    Zhu, F. M.
    Zhang, W.
    Xu, X. G.
    TISSUE ANTIGENS, 2007, 70 (02): : 170
  • [10] Sequence-based classification and identification of Fungi
    Hibbett, David
    Abarenkov, Kessy
    Koljalg, Urmas
    Opik, Maarja
    Chai, Benli
    Cole, James
    Wang, Qiong
    Crous, Pedro
    Robert, Vincent
    Helgason, Thorunn
    Herr, Joshua R.
    Kirk, Paul
    Lueschow, Shiloh
    O'Donnell, Kerry
    Nilsson, R. Henrik
    Oono, Ryoko
    Schoch, Conrad
    Smyth, Christopher
    Walker, Donald M.
    Porras-Alfaro, Andrea
    Taylor, John W.
    Geiser, David M.
    MYCOLOGIA, 2016, 108 (06) : 1049 - 1068