Prediction of Protein-Protein Interactions Using An Effective Sequence Based Combined Method

被引:37
作者
Goktepe, Yunus Emre [1 ]
Kodaz, Halife [2 ]
机构
[1] Necmettin Erbakan Univ, Seydisehir Vocat High Sch, Seydisehir, Konya, Turkey
[2] Selcuk Univ, Program Comp Sci, Dept Comp Engn, Fac Engn, Konya, Turkey
关键词
Protein-protein interactions; Sequence-based prediction; Feature extraction; Conjoint triads; Principal component analyses; Support vector machines; AMINO-ACID-COMPOSITION; STRUCTURAL CLASS; FEATURES; DATABASE; FUSION; LOCALIZATION; CONSERVATION; HYPERPLANES; MACHINE;
D O I
10.1016/j.neucom.2018.03.062
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Proteins and their interactions play a key role in the realization of all cellular biological activities of organisms. Therefore, prediction of protein-protein interactions is crucial for elucidating biological processes. Experimental studies are inadequate for some reasons such as the time required to reveal interactions, the fact that it is an expensive way and the number of yet unknown interactions is too great. Thus, a number of computational methods have been developed to predict protein-protein interactions. Generally, many of these methods that produce good results cannot be used without additional biological information such as protein domains, protein structural information, gene neighborhoods, gene expressions, and phylogenetic profiles. Therefore, there is a need for computational methods that can successfully predict interactions using only protein sequences. In this study, we present a novel sequence-based computational model. We applied a new technique called weighted skip-sequential conjoint triads in the proposed method. The results of this research were evaluated on generally used databases and demonstrated its success in this field. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:68 / 74
页数:7
相关论文
共 52 条
  • [1] BIND - The Biomolecular Interaction Network Database
    Bader, GD
    Donaldson, I
    Wolting, C
    Ouellette, BFF
    Pawson, T
    Hogue, CWV
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (01) : 242 - 245
  • [2] Whole-proteome interaction mining
    Bock, JR
    Gough, DA
    [J]. BIOINFORMATICS, 2003, 19 (01) : 125 - 134
  • [3] The use of the area under the roc curve in the evaluation of machine learning algorithms
    Bradley, AP
    [J]. PATTERN RECOGNITION, 1997, 30 (07) : 1145 - 1159
  • [4] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [5] Prediction of protein cellular attributes using pseudo-amino acid composition
    Chou, KC
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2001, 43 (03): : 246 - 255
  • [6] Protein-protein interaction prediction by combined analysis of genomic and conservation information
    Emamjomeh, Abbasali
    Goliaei, Bahram
    Torkamani, Ali
    Ebrahimpour, Reza
    Mohammadi, Nima
    Parsian, Ahmad
    [J]. GENES & GENETIC SYSTEMS, 2014, 89 (06) : 259 - 272
  • [7] A NOVEL GENETIC SYSTEM TO DETECT PROTEIN PROTEIN INTERACTIONS
    FIELDS, S
    SONG, OK
    [J]. NATURE, 1989, 340 (6230) : 245 - 246
  • [8] Functional organization of the yeast proteome by systematic analysis of protein complexes
    Gavin, AC
    Bösche, M
    Krause, R
    Grandi, P
    Marzioch, M
    Bauer, A
    Schultz, J
    Rick, JM
    Michon, AM
    Cruciat, CM
    Remor, M
    Höfert, C
    Schelder, M
    Brajenovic, M
    Ruffner, H
    Merino, A
    Klein, K
    Hudak, M
    Dickson, D
    Rudi, T
    Gnau, V
    Bauch, A
    Bastuck, S
    Huhse, B
    Leutwein, C
    Heurtier, MA
    Copley, RR
    Edelmann, A
    Querfurth, E
    Rybin, V
    Drewes, G
    Raida, M
    Bouwmeester, T
    Bork, P
    Seraphin, B
    Kuster, B
    Neubauer, G
    Superti-Furga, G
    [J]. NATURE, 2002, 415 (6868) : 141 - 147
  • [9] Predicting protein-protein interactions by weighted pseudo amino acid composition
    Goktepe, Yunus Emre
    Ilhan, Ilhan
    Kahramanli, Sirzat
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2016, 15 (03) : 272 - 290
  • [10] PROFILE ANALYSIS - DETECTION OF DISTANTLY RELATED PROTEINS
    GRIBSKOV, M
    MCLACHLAN, AD
    EISENBERG, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (13) : 4355 - 4358