Data mining methods for protein-protein interactions

被引:0
|
作者
Nafar, Zahra [1 ]
Golshani, Ashkan [2 ]
机构
[1] Carleton Univ, Fac Sci, Ottawa, ON K1S 5B6, Canada
[2] Carleton Univ, Dept Biol, Ottawa, ON K1S 5B6, Canada
关键词
bioinformatics; data mining; protein-protein interaction; protein interaction network; system biology; genomics; functional proteomics;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, recent bioinformatics methods using data mining techniques are presented to analyze protein-protein interaction data gathered from recent large-scale biological studies. Novel approaches are suggested to tackle some of the challenges in this area. Protein-protein interaction data can provide a wealth of information to better understand the biology of a cell. The analysis of these interactions is also important for the discovery of disease-associated proteins. The data can also he used for the identification of novel cellular sites that are crucial for the development of new and improved pharmaceutical drugs. Knowledge Discovery and Data Mining (KDD) is the process of extracting implicit information from large amounts of data using mathematical and statistical methods. It grows in synergy with computer technology, creating new analytical tools and using them for knowledge discovery in large volume of data. A multidisciplinary science and technology with links in statistics, machine learning, data base systems, and computer programming and visualization, KDD has proved to be a promising solution to various problems in molecular biology, and gene analysis. An overview of various data mining techniques is presented in this paper with specific examples of their applications in protein-protein interaction data analysis. While some of the most widely used data mining techniques for exploring protein interaction data sets are clustering (including supervised and unsupervised), classification and association rule discovery, others are based on methods for mining interaction information from scientific sources such as PubMed and MedLine. There are areas such as prediction and profiling that have not been explored much for mining information in protein-protein interactions. We propose methods to employ these novel techniques to analyze protein-protein interaction data.
引用
收藏
页码:2090 / +
页数:2
相关论文
共 50 条
  • [1] DAPPER: a data-mining resource for protein-protein interactions
    Haider, Syed
    Lipinszki, Zoltan
    Przewloka, Marcin R.
    Ladak, Yaseen
    D'Avino, Pier Paolo
    Kimata, Yuu
    Lio, Pietro
    Glover, David M.
    BIODATA MINING, 2015, 8
  • [2] DAPPER: a data-mining resource for protein-protein interactions
    Syed Haider
    Zoltan Lipinszki
    Marcin R. Przewloka
    Yaseen Ladak
    Pier Paolo D’Avino
    Yuu Kimata
    Pietro Lio’
    David M. Glover
    BioData Mining, 8
  • [3] Mining new protein-protein interactions
    Mamitsuka, H
    IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 2005, 24 (03): : 103 - 108
  • [4] Mining from protein-protein interactions
    Mamitsuka, Hiroshi
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 2 (05) : 400 - 410
  • [5] Mining literature for protein-protein interactions
    Marcotte, EM
    Xenarios, I
    Eisenberg, D
    BIOINFORMATICS, 2001, 17 (04) : 359 - 363
  • [6] Mining protein-protein interaction data
    Haasl, Ryan J.
    Fang, Jianwen
    CURRENT BIOINFORMATICS, 2006, 1 (02) : 197 - 205
  • [7] Predicting protein-protein interactions by association mining
    Kotlyar, M
    Jurisica, I
    INFORMATION SYSTEMS FRONTIERS, 2006, 8 (01) : 37 - 46
  • [8] Predicting Protein-Protein Interactions by Association Mining
    Information Systems Frontiers, 2006, 8 : 37 - 47
  • [9] Methods of study of protein-protein interactions
    Wildova, Marcela
    Rumlova, Michaela
    CHEMICKE LISTY, 2008, 102 (01): : 28 - 34
  • [10] Efficient mining from heterogeneous data sets for predicting protein-protein interactions
    Mamitsuka, H
    14TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2003, : 32 - 36