Sequence-specific recognition of nucleic-acid motifs is critical to many cellular processes. We have developed a new and general method called Neighborhood Inference ( NI) that predicts sequences with activity in regulating a biochemical process based on the local density of known sites in sequence space. Applied to the problem of RNA splicing regulation, NI was used to predict hundreds of new exonic splicing enhancer (ESE) and silencer (ESS) hexanucleotides from known human ESEs and ESSs. These predictions were supported by cross-validation analysis, by analysis of published splicing regulatory activity data, by sequence-conservation analysis, and by measurement of the splicing regulatory activity of 24 novel predicted ESEs, ESSs, and neutral sequences using an in vivo splicing reporter assay. These results demonstrate the ability of NI to accurately predict splicing regulatory activity and show that the scope of exonic splicing regulatory elements is substantially larger than previously anticipated. Analysis of orthologous exons in four mammals showed that the NI score of ESEs, a measure of function, is much more highly conserved above background than ESE primary sequence. This observation indicates a high degree of selection for ESE activity in mammalian exons, with surprisingly frequent interchangeability between ESE sequences.
机构:
Univ Calif Los Angeles, Howard Hughes Med Inst, Dept Microbiol Immunol & Mol Genet, Los Angeles, CA 90095 USAUniv Calif Los Angeles, Howard Hughes Med Inst, Dept Microbiol Immunol & Mol Genet, Los Angeles, CA 90095 USA
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Blanchette, M
;
Kent, WJ
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Kent, WJ
;
Riemer, C
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Riemer, C
;
Elnitski, L
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Elnitski, L
;
Smit, AFA
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Smit, AFA
;
Roskin, KM
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Roskin, KM
;
Baertsch, R
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Baertsch, R
;
Rosenbloom, K
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Rosenbloom, K
;
Clawson, H
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Clawson, H
;
Green, ED
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Green, ED
;
Haussler, D
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Haussler, D
;
Miller, W
论文数: 0引用数: 0
h-index: 0
机构:
Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USAPenn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
机构:
Univ Toronto, Charles H Best Inst, Dept Med Res, Toronto, ON M5G 1L6, CanadaUniv Toronto, Charles H Best Inst, Dept Med Res, Toronto, ON M5G 1L6, Canada
机构:
Univ Calif Los Angeles, Howard Hughes Med Inst, Dept Microbiol Immunol & Mol Genet, Los Angeles, CA 90095 USAUniv Calif Los Angeles, Howard Hughes Med Inst, Dept Microbiol Immunol & Mol Genet, Los Angeles, CA 90095 USA
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Blanchette, M
;
Kent, WJ
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Kent, WJ
;
Riemer, C
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Riemer, C
;
Elnitski, L
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Elnitski, L
;
Smit, AFA
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Smit, AFA
;
Roskin, KM
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Roskin, KM
;
Baertsch, R
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Baertsch, R
;
Rosenbloom, K
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Rosenbloom, K
;
Clawson, H
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Clawson, H
;
Green, ED
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Green, ED
;
Haussler, D
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Haussler, D
;
Miller, W
论文数: 0引用数: 0
h-index: 0
机构:
Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USAPenn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
机构:
Univ Toronto, Charles H Best Inst, Dept Med Res, Toronto, ON M5G 1L6, CanadaUniv Toronto, Charles H Best Inst, Dept Med Res, Toronto, ON M5G 1L6, Canada