The principled design of large-scale recursive neural network architectures-DAG-RNNs and the protein structure prediction problem

被引:57
作者
Baldi, P [1 ]
Pollastri, G [1 ]
机构
[1] Univ Calif Irvine, Sch Informat & Comp Sci, Inst Genom & Bioinformat, Irvine, CA 92697 USA
关键词
recursive neural networks; recurrent neural networks; directed acyclic graphs; graphical models; lateral processing; protein structure; contact maps; Bayesian networks;
D O I
10.1162/153244304773936054
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We describe a general methodology for the design of large-scale recursive neural network architectures (DAG-RNNs) which comprises three fundamental steps: (1) representation of a given domain using suitable directed acyclic graphs (DAGs) to connect visible and hidden node variables; (2) parameterization of the relationship between each variable and its parent variables by feedforward neural networks; and (3) application of weight-sharing within appropriate subsets of DAG connections to capture stationarity and control model complexity. Here we use these principles to derive several specific classes of DAG-RNN architectures based on lattices, trees, and other structured graphs. These architectures can process a wide range of data structures with variable sizes and dimensions. While the overall resulting models remain probabilistic, the internal deterministic dynamics allows efficient propagation of information, as well as training by gradient descent, in order to tackle large-scale problems. These methods are used here to derive state-of-the-art predictors for protein structural features such as secondary structure (1D) and both fine- and coarse-grained contact maps (2D). Extensions, relationships to graphical models, and implications for the design of neural architectures are briefly discussed. The protein prediction servers are available over the Web at: www.igb.uci.edu/tools.htm.
引用
收藏
页码:575 / 602
页数:28
相关论文
共 35 条
  • [1] Do aligned sequences share the same fold?
    Abagyan, RA
    Batalov, S
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1997, 273 (01) : 355 - 368
  • [2] GLOBAL FOLD DETERMINATION FROM A SMALL NUMBER OF DISTANCE RESTRAINTS
    ASZODI, A
    GRADWELL, MJ
    TAYLOR, WR
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1995, 251 (02) : 308 - 326
  • [3] Protein structure prediction and structural genomics
    Baker, D
    Sali, A
    [J]. SCIENCE, 2001, 294 (5540) : 93 - 96
  • [4] Exploiting the past and the future in protein secondary structure prediction
    Baldi, P
    Brunak, S
    Frasconi, P
    Soda, G
    Pollastri, G
    [J]. BIOINFORMATICS, 1999, 15 (11) : 937 - 946
  • [5] Hybrid modeling, HMM/NN architectures, and protein applications
    Baldi, P
    Chauvin, Y
    [J]. NEURAL COMPUTATION, 1996, 8 (07) : 1541 - 1565
  • [6] Baldi P., 2001, Bioinformatics: the machine learning approach
  • [7] BALDI P, 2002, IEEE INTELLIGENT SYS, V17
  • [8] Input-output HMM's for sequence processing
    Bengio, Y
    Frasconi, P
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1996, 7 (05): : 1231 - 1249
  • [9] Prediction of contact maps with neural networks and correlated mutations
    Fariselli, P
    Olmea, O
    Valencia, A
    Casadio, R
    [J]. PROTEIN ENGINEERING, 2001, 14 (11): : 835 - 843
  • [10] Fariselli P, 2000, Proc Int Conf Intell Syst Mol Biol, V8, P146