Delayed Comparison and Apriori Algorithm (DCAA): A Tool for Discovering Protein-Protein Interactions From Time-Series Phosphoproteomic Data

被引:5
|
作者
Ding, Lianhong [1 ]
Xie, Shaoshuai [2 ]
Zhang, Shucui [3 ,4 ]
Shen, Hangyu [5 ]
Zhong, Huaqiang [5 ]
Li, Daoyuan [2 ]
Shi, Peng [5 ]
Chi, Lianli [2 ]
Zhang, Qunye [3 ,4 ]
机构
[1] Beijing Wuzi Univ, Sch Informat, Beijing, Peoples R China
[2] Shandong Univ, Natl Glycoengn Res Ctr, Qingdao, Peoples R China
[3] Qilu Hosp Shandong Univ, Chinese Natl Hlth Commiss, Chinese Minist Educ, Key Lab Cardiovasc Remodeling & Funct Res, Jinan, Peoples R China
[4] Qilu Hosp Shandong Univ, Chinese Acad Med Sci, Jinan, Peoples R China
[5] Univ Sci & Technol Beijing, Natl Ctr Mat Serv Safety, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
protein– protein interactions; phosphoproteomics; delayed comparison; Apriori; DCAA; INTERACTION PREDICTION; PHOSPHORYLATION; MAP;
D O I
10.3389/fmolb.2020.606570
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Analysis of high-throughput omics data is one of the most important approaches for obtaining information regarding interactions between proteins/genes. Time-series omics data are a series of omics data points indexed in time order and normally contain more abundant information about the interactions between biological macromolecules than static omics data. In addition, phosphorylation is a key posttranslational modification (PTM) that is indicative of possible protein function changes in cellular processes. Analysis of time-series phosphoproteomic data should provide more meaningful information about protein interactions. However, although many algorithms, databases, and websites have been developed to analyze omics data, the tools dedicated to discovering molecular interactions from time-series omics data, especially from time-series phosphoproteomic data, are still scarce. Moreover, most reported tools ignore the lag between functional alterations and the corresponding changes in protein synthesis/PTM and are highly dependent on previous knowledge, resulting in high false-positive rates and difficulties in finding newly discovered protein-protein interactions (PPIs). Therefore, in the present study, we developed a new method to discover protein-protein interactions with the delayed comparison and Apriori algorithm (DCAA) to address the aforementioned problems. DCAA is based on the idea that there is a lag between functional alterations and the corresponding changes in protein synthesis/PTM. The Apriori algorithm was used to mine association rules from the relationships between items in a dataset and find PPIs based on time-series phosphoproteomic data. The advantage of DCAA is that it does not rely on previous knowledge and the PPI database. The analysis of actual time-series phosphoproteomic data showed that more than 68% of the protein interactions/regulatory relationships predicted by DCAA were accurate. As an analytical tool for PPIs that does not rely on a priori knowledge, DCAA should be useful to predict PPIs from time-series omics data, and this approach is not limited to phosphoproteomic data.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] PPIA-coExp: Discovering Context-Specific Biomarkers Based on Protein-Protein Interactions, Co-Expression Networks, and Expression Data
    Yan, Dongsheng
    Fan, Zhiyu
    Li, Qianzhong
    Chen, Yingli
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (23)
  • [32] New advances in extracting and learning from protein-protein interactions within unstructured biomedical text data
    Caufield, J. Harry
    Ping, Peipei
    EMERGING TOPICS IN LIFE SCIENCES, 2019, 3 (04) : 357 - 369
  • [33] Inferring protein-protein interactions through high-throughput interaction data from diverse organisms
    Liu, Y
    Liu, NJ
    Zhao, HY
    BIOINFORMATICS, 2005, 21 (15) : 3279 - 3285
  • [34] Using protein-protein interactions for refining gene networks estimated from microarray data by Bayesian networks
    Nariai, N
    Kim, S
    Imoto, S
    Miyano, S
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2004, 2003, : 336 - 347
  • [35] Discovering biological patterns from short time-series gene expression profiles with integrating PPI data
    Fan, Wei-Wei
    Chen, Bolin
    Selvaraj, Gopalan
    Wu, Fang-Xiang
    NEUROCOMPUTING, 2014, 145 : 3 - 13
  • [36] PIPINO: A Software Package to Facilitate the Identification of Protein-Protein Interactions from Affinity Purification Mass Spectrometry Data
    Kalkhof, Stefan
    Schildbach, Stefan
    Blumert, Conny
    Horn, Friedemann
    von Bergen, Martin
    Labudde, Dirk
    BIOMED RESEARCH INTERNATIONAL, 2016, 2016
  • [37] A CYCLE REGRESSION-ANALYSIS ALGORITHM FOR EXTRACTING CYCLES FROM TIME-SERIES DATA
    SIMMONS, LF
    WILLIAMS, DR
    COMPUTERS & OPERATIONS RESEARCH, 1982, 9 (03) : 243 - 254
  • [38] Pulldown Assay Coupled with Co-Expression in Bacteria Cells as a Time-Efficient Tool for Testing Challenging Protein-Protein Interactions
    Bonchuk, Artem
    Zolotarev, Nikolay
    Balagurov, Konstantin
    Arkova, Olga
    Georgiev, Pavel
    JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2022, (190):
  • [39] Integrative Modeling of Protein Dynamics from Time-Series Data of Single-Molecule Experiments and Molecular Dynamics Simulations
    Matsunaga, Yasuhiro
    Sugita, Yuji
    BIOPHYSICAL JOURNAL, 2019, 116 (03) : 343A - 343A
  • [40] Estimating gene regulatory networks and protein-protein interactions of Saccharomyces cerevisiae from multiple genome-wide data
    Nariai, N
    Tamada, Y
    Imoto, S
    Miyano, S
    BIOINFORMATICS, 2005, 21 : 206 - 212