Delayed Comparison and Apriori Algorithm (DCAA): A Tool for Discovering Protein-Protein Interactions From Time-Series Phosphoproteomic Data

被引:5
|
作者
Ding, Lianhong [1 ]
Xie, Shaoshuai [2 ]
Zhang, Shucui [3 ,4 ]
Shen, Hangyu [5 ]
Zhong, Huaqiang [5 ]
Li, Daoyuan [2 ]
Shi, Peng [5 ]
Chi, Lianli [2 ]
Zhang, Qunye [3 ,4 ]
机构
[1] Beijing Wuzi Univ, Sch Informat, Beijing, Peoples R China
[2] Shandong Univ, Natl Glycoengn Res Ctr, Qingdao, Peoples R China
[3] Qilu Hosp Shandong Univ, Chinese Natl Hlth Commiss, Chinese Minist Educ, Key Lab Cardiovasc Remodeling & Funct Res, Jinan, Peoples R China
[4] Qilu Hosp Shandong Univ, Chinese Acad Med Sci, Jinan, Peoples R China
[5] Univ Sci & Technol Beijing, Natl Ctr Mat Serv Safety, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
protein– protein interactions; phosphoproteomics; delayed comparison; Apriori; DCAA; INTERACTION PREDICTION; PHOSPHORYLATION; MAP;
D O I
10.3389/fmolb.2020.606570
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Analysis of high-throughput omics data is one of the most important approaches for obtaining information regarding interactions between proteins/genes. Time-series omics data are a series of omics data points indexed in time order and normally contain more abundant information about the interactions between biological macromolecules than static omics data. In addition, phosphorylation is a key posttranslational modification (PTM) that is indicative of possible protein function changes in cellular processes. Analysis of time-series phosphoproteomic data should provide more meaningful information about protein interactions. However, although many algorithms, databases, and websites have been developed to analyze omics data, the tools dedicated to discovering molecular interactions from time-series omics data, especially from time-series phosphoproteomic data, are still scarce. Moreover, most reported tools ignore the lag between functional alterations and the corresponding changes in protein synthesis/PTM and are highly dependent on previous knowledge, resulting in high false-positive rates and difficulties in finding newly discovered protein-protein interactions (PPIs). Therefore, in the present study, we developed a new method to discover protein-protein interactions with the delayed comparison and Apriori algorithm (DCAA) to address the aforementioned problems. DCAA is based on the idea that there is a lag between functional alterations and the corresponding changes in protein synthesis/PTM. The Apriori algorithm was used to mine association rules from the relationships between items in a dataset and find PPIs based on time-series phosphoproteomic data. The advantage of DCAA is that it does not rely on previous knowledge and the PPI database. The analysis of actual time-series phosphoproteomic data showed that more than 68% of the protein interactions/regulatory relationships predicted by DCAA were accurate. As an analytical tool for PPIs that does not rely on a priori knowledge, DCAA should be useful to predict PPIs from time-series omics data, and this approach is not limited to phosphoproteomic data.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Application of Genetic Programming (GP) Formalism for Building Disease Predictive Models from Protein-Protein Interactions (PPI) Data
    Vyas, Renu
    Bapat, Sanket
    Goel, Purva
    Karthikeyan, Muthukumarasamy
    Tambe, Sanjeev S.
    Kulkarni, Bhaskar D.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (01) : 27 - 37
  • [42] Predicting Protein-Protein Interactions from Multimodal Biological Data Sources via Nonnegative Matrix Tri-Factorization
    Wang, Hua
    Huang, Heng
    Ding, Chris
    Nie, Feiping
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2013, 20 (04) : 344 - 358
  • [43] Inducing pairwise gene interactions from time-series data by EDA based Bayesian Network
    Dai, Chao
    Liu, Juan
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 7746 - 7749
  • [44] Uncertainty quantification of the effects of biotic interactions on community dynamics from nonlinear time-series data
    Cenci, Simone
    Saavedra, Serguei
    JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2018, 15 (147)
  • [45] On drilling speed of London Clay from MWD data with time-series algorithm for ground characterisation
    Wu, Si Yuan
    Yue, Wendal Victor
    Yue, Zhongqi Quentin
    GEOTECHNIQUE, 2023,
  • [46] A Frequency Domain Algorithm to Identify Recurrent Sedentary Behaviors from Activity Time-Series Data
    He, Qian
    Agu, Emmanuel O.
    2016 3RD IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS, 2016, : 45 - 48
  • [47] A Boolean network inference from time-series gene expression data using a genetic algorithm
    Barman, Shohag
    Kwon, Yung-Keun
    BIOINFORMATICS, 2018, 34 (17) : 927 - 933
  • [48] Extraction Algorithm of Similar Parts from Multiple Time-Series Data of Cerebral Blood Flow
    Hiroyasu, Tomoyuki
    Fukushma, Arika
    Yamamoto, Utako
    BRAIN AND HEALTH INFORMATICS, 2013, 8211 : 138 - 146
  • [49] The detection of masstransport effects using SPRTool, a software environment for the analysis of protein-protein interactions from surface plasmon resonance data
    Ober, RJ
    Lai, XM
    Ward, ES
    BIOPHYSICAL JOURNAL, 2004, 86 (01) : 514A - 514A
  • [50] Discovering reliable protein interactions from high-throughput experimental data using network topology
    Chen, J
    Hsu, W
    Lee, ML
    Ng, SK
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2005, 35 (1-2) : 37 - 47