Delayed Comparison and Apriori Algorithm (DCAA): A Tool for Discovering Protein-Protein Interactions From Time-Series Phosphoproteomic Data

被引:5
|
作者
Ding, Lianhong [1 ]
Xie, Shaoshuai [2 ]
Zhang, Shucui [3 ,4 ]
Shen, Hangyu [5 ]
Zhong, Huaqiang [5 ]
Li, Daoyuan [2 ]
Shi, Peng [5 ]
Chi, Lianli [2 ]
Zhang, Qunye [3 ,4 ]
机构
[1] Beijing Wuzi Univ, Sch Informat, Beijing, Peoples R China
[2] Shandong Univ, Natl Glycoengn Res Ctr, Qingdao, Peoples R China
[3] Qilu Hosp Shandong Univ, Chinese Natl Hlth Commiss, Chinese Minist Educ, Key Lab Cardiovasc Remodeling & Funct Res, Jinan, Peoples R China
[4] Qilu Hosp Shandong Univ, Chinese Acad Med Sci, Jinan, Peoples R China
[5] Univ Sci & Technol Beijing, Natl Ctr Mat Serv Safety, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
protein– protein interactions; phosphoproteomics; delayed comparison; Apriori; DCAA; INTERACTION PREDICTION; PHOSPHORYLATION; MAP;
D O I
10.3389/fmolb.2020.606570
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Analysis of high-throughput omics data is one of the most important approaches for obtaining information regarding interactions between proteins/genes. Time-series omics data are a series of omics data points indexed in time order and normally contain more abundant information about the interactions between biological macromolecules than static omics data. In addition, phosphorylation is a key posttranslational modification (PTM) that is indicative of possible protein function changes in cellular processes. Analysis of time-series phosphoproteomic data should provide more meaningful information about protein interactions. However, although many algorithms, databases, and websites have been developed to analyze omics data, the tools dedicated to discovering molecular interactions from time-series omics data, especially from time-series phosphoproteomic data, are still scarce. Moreover, most reported tools ignore the lag between functional alterations and the corresponding changes in protein synthesis/PTM and are highly dependent on previous knowledge, resulting in high false-positive rates and difficulties in finding newly discovered protein-protein interactions (PPIs). Therefore, in the present study, we developed a new method to discover protein-protein interactions with the delayed comparison and Apriori algorithm (DCAA) to address the aforementioned problems. DCAA is based on the idea that there is a lag between functional alterations and the corresponding changes in protein synthesis/PTM. The Apriori algorithm was used to mine association rules from the relationships between items in a dataset and find PPIs based on time-series phosphoproteomic data. The advantage of DCAA is that it does not rely on previous knowledge and the PPI database. The analysis of actual time-series phosphoproteomic data showed that more than 68% of the protein interactions/regulatory relationships predicted by DCAA were accurate. As an analytical tool for PPIs that does not rely on a priori knowledge, DCAA should be useful to predict PPIs from time-series omics data, and this approach is not limited to phosphoproteomic data.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] MSCA: A spectral comparison algorithm between time series to identify protein-protein interactions
    Universidad del Quindío, Gepamol, Carrera 15 Calle 12N, Armenia, Colombia
    不详
    460, Colombia
    BMC Bioinform., 1
  • [2] MSCA: a spectral comparison algorithm between time series to identify protein-protein interactions
    Ailan F Arenas
    Gladys E Salcedo
    Andrey M Montoya
    Jorge E Gomez-Marin
    BMC Bioinformatics, 16
  • [3] MSCA: a spectral comparison algorithm between time series to identify protein-protein interactions
    Arenas, Ailan F.
    Salcedo, Gladys E.
    Montoya, Andrey M.
    Gomez-Marin, Jorge E.
    BMC BIOINFORMATICS, 2015, 16
  • [4] Discovering protein complexes from protein-protein interaction data by local cluster detecting algorithm
    Liu, Juan
    Liu, Bin
    Li, Deyi
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 4, PROCEEDINGS, 2007, : 280 - +
  • [5] Discovering patterns to extract protein-protein interactions from full texts
    Huang, ML
    Zhu, XY
    Hao, Y
    Payan, DG
    Qu, KB
    Li, M
    BIOINFORMATICS, 2004, 20 (18) : 3604 - 3612
  • [6] Discovering Protein Complexes from Protein-Protein Interaction Data by Dense Subgraph
    LIU Bin1
    2. State Key Laboratory of Software Engineering
    Wuhan University Journal of Natural Sciences, 2011, 16 (01) : 64 - 68
  • [7] Discovering ecosystem models from time-series data
    George, D
    Saito, K
    Langley, P
    Bay, S
    Arrigo, KR
    DISCOVERY SCIENCE, PROCEEDINGS, 2003, 2843 : 141 - 152
  • [8] Discovering patterns to extract protein-protein interactions from the literature: Part II
    Hao, Y
    Zhu, XY
    Huang, ML
    Li, M
    BIOINFORMATICS, 2005, 21 (15) : 3294 - 3300
  • [9] Discovering novel protein-protein interactions by measuring the protein semantic similarity from the biomedical literature
    Chiang, Jung-Hsien
    Ju, Jiun-Huang
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2014, 12 (06)
  • [10] Uncovering the rules for protein-protein interactions from yeast genomic data
    Wang, Jin
    Li, Chunhe
    Wang, Erkang
    Wang, Xidi
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (10) : 3752 - 3757