The principled design of large-scale recursive neural network architectures-DAG-RNNs and the protein structure prediction problem

被引:57
作者
Baldi, P [1 ]
Pollastri, G [1 ]
机构
[1] Univ Calif Irvine, Sch Informat & Comp Sci, Inst Genom & Bioinformat, Irvine, CA 92697 USA
关键词
recursive neural networks; recurrent neural networks; directed acyclic graphs; graphical models; lateral processing; protein structure; contact maps; Bayesian networks;
D O I
10.1162/153244304773936054
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We describe a general methodology for the design of large-scale recursive neural network architectures (DAG-RNNs) which comprises three fundamental steps: (1) representation of a given domain using suitable directed acyclic graphs (DAGs) to connect visible and hidden node variables; (2) parameterization of the relationship between each variable and its parent variables by feedforward neural networks; and (3) application of weight-sharing within appropriate subsets of DAG connections to capture stationarity and control model complexity. Here we use these principles to derive several specific classes of DAG-RNN architectures based on lattices, trees, and other structured graphs. These architectures can process a wide range of data structures with variable sizes and dimensions. While the overall resulting models remain probabilistic, the internal deterministic dynamics allows efficient propagation of information, as well as training by gradient descent, in order to tackle large-scale problems. These methods are used here to derive state-of-the-art predictors for protein structural features such as secondary structure (1D) and both fine- and coarse-grained contact maps (2D). Extensions, relationships to graphical models, and implications for the design of neural architectures are briefly discussed. The protein prediction servers are available over the Web at: www.igb.uci.edu/tools.htm.
引用
收藏
页码:575 / 602
页数:28
相关论文
共 35 条
  • [11] A neural network based predictor of residue contacts in proteins
    Fariselli, P
    Casadio, R
    [J]. PROTEIN ENGINEERING, 1999, 12 (01): : 15 - 21
  • [12] A general framework for adaptive processing of data structures
    Frasconi, P
    Gori, M
    Sperduti, A
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1998, 9 (05): : 768 - 786
  • [13] CORRELATED MUTATIONS AND RESIDUE CONTACTS IN PROTEINS
    GOBEL, U
    SANDER, C
    SCHNEIDER, R
    VALENCIA, A
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1994, 18 (04): : 309 - 317
  • [14] GOLLER C, 1997, THESIS TU MUNICH
  • [15] Gorodkin J, 1999, Proc Int Conf Intell Syst Mol Biol, P95
  • [16] HOBOHM U, 1992, PROTEIN SCI, V1, P409
  • [17] DICTIONARY OF PROTEIN SECONDARY STRUCTURE - PATTERN-RECOGNITION OF HYDROGEN-BONDED AND GEOMETRICAL FEATURES
    KABSCH, W
    SANDER, C
    [J]. BIOPOLYMERS, 1983, 22 (12) : 2577 - 2637
  • [18] Gradient-based learning applied to document recognition
    Lecun, Y
    Bottou, L
    Bengio, Y
    Haffner, P
    [J]. PROCEEDINGS OF THE IEEE, 1998, 86 (11) : 2278 - 2324
  • [19] Lesk AM, 2001, PROTEINS, P98
  • [20] Protein distance constraints predicted by neural networks and probability density functions
    Lund, O
    Frimand, K
    Gorodkin, J
    Bohr, H
    Bohr, J
    Hansen, J
    Brunak, S
    [J]. PROTEIN ENGINEERING, 1997, 10 (11): : 1241 - 1248