Stochastic k-Tree Grammar and Its Application in Biomolecular Structure Modeling

被引:0
|
作者
Ding, Liang [1 ]
Samad, Abdul [1 ]
Xue, Xingran [1 ]
Huang, Xiuzhen [4 ]
Malmberg, Russell L. [2 ,3 ]
Cai, Liming [1 ,2 ]
机构
[1] Univ Georgia, Dept Comp Sci, Athens, GA 30602 USA
[2] Univ Georgia, Inst Bioinformat, Athens, GA 30602 USA
[3] Univ Georgia, Dept Plant Biol, Athens, GA 30602 USA
[4] Arkansas State Univ, Dept Comp Sci, Jonesboro, AR 72467 USA
关键词
stochastic grammar; context-sensitive language; k-tree; dynamic programming; biomolecule; RNA tertiary structure; SECONDARY STRUCTURE PREDICTION; RNA; ALGORITHMS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Stochastic context-free grammar (SCFG) has been successful in modeling biomolecular structures, typically RNA secondary structure, for statistical analysis and structure prediction. Context-free grammar rules specify parallel and nested co-occurren-ces of terminals, and thus are ideal for modeling nucleotide canonical base pairs that constitute the RNA secondary structure. Stochastic grammars have been sought, which may adequately model biomolecular tertiary structures that are beyond context-free. Some of the existing linguistic grammars, developed mostly for natural language processing, appear insufficient to account for crossing relationships incurred by distant interactions of bio-residues, while others are overly powerful and cause excessive computational complexity. This paper introduces a novel stochastic grammar, called stochastic k-tree grammar (SkTG), for the analysis of context-sensitive languages. With the new grammar rules, co-occurrences of distant terminals are characterized and recursively organized into k-tree graphs. The new grammar offers a viable approach to modeling context-sensitive interactions between bioresidues because such relationships are often constrained by k-trees, for small values of k, as demonstrated by earlier investigations. In this paper it is shown, for the first time, that probabilistic analysis of k-trees over strings are computable in polynomial time n(O(k)). Hence, SkTG permits not only modeling of biomolecular tertiary structures but also efficient analysis and prediction of such structures.
引用
收藏
页码:308 / 322
页数:15
相关论文
共 50 条
  • [1] Clustering with Random Indexing K-tree and XML Structure
    De Vries, Christopher M.
    Geva, Shlomo
    De Vine, Lance
    FOCUSED RETRIEVAL AND EVALUATION, 2010, 6203 : 407 - 415
  • [2] Stochastic textures modeling and its application in texture structure decomposition
    Khawaled, Samah
    Zeevi, Yehoshua Y.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2025, 107
  • [3] Efficient computation of the convex hull on sets of points stored in a k-tree compact data structure
    Felipe Castro, Juan
    Romero, Miguel
    Gutierrez, Gilberto
    Caniupan, Monica
    Quijada-Fuentes, Carlos
    KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (10) : 4091 - 4111
  • [4] Efficient computation of the convex hull on sets of points stored in a k-tree compact data structure
    Juan Felipe Castro
    Miguel Romero
    Gilberto Gutiérrez
    Mónica Caniupán
    Carlos Quijada-Fuentes
    Knowledge and Information Systems, 2020, 62 : 4091 - 4111
  • [5] Smoothing techniques for tree-k-grammar-based natural language modeling
    Verdú-Mas, JL
    Calera-Rubio, J
    Carrasco, RC
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PROCEEDINGS, 2003, 2652 : 1057 - 1065
  • [6] Modeling the Electrochemical Response of Mesoporous Materials Toward Its Application to Biomolecular Detection
    Peinetti, Ana Sol
    Gonzalez, Graciela A.
    Battaglini, Fernando
    ELECTROANALYSIS, 2010, 22 (12) : 1329 - 1336
  • [7] Data-based stochastic modeling of tree growth and structure formation
    Potapov, Ilya
    Jarvenpaa, Marko
    Akerblom, Markku
    Raumonen, Pasi
    Kaasalainen, Mikko
    SILVA FENNICA, 2016, 50 (01)
  • [8] Bayesian Synchronous Tree-Substitution Grammar Induction and its Application to Sentence Compression
    Yamangil, Elif
    Shieber, Stuart M.
    ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2010, : 937 - 947
  • [9] MATHEMATIZATION OF GRAMMAR AND ITS APPLICATION TO THE VERBAL MORPHOLOGICAL STRUCTURE OF THE BASQUE LANGUAGE
    Domene Verdu, Jose Fernando
    ESTUDIOS DE LINGUISTICA-UNIVERSIDAD DE ALICANTE-ELUA, 2007, (21):
  • [10] Stable foliage cluster (FC), a basic unit of the crown structure of tree species and its application to Modeling of tree and forest structure: Configuration of the FC model
    Sumida, Akihiro
    Umeki, Kiyoshi
    PMA 2006: SECOND INTERNATIONAL SYMPOSIUM ON PLANT GROWTH MODELING, SIMULATION, VISUALIZATION AND APPLICATIONS, PROCEEDINGS, 2007, : 51 - +