CHANCE AND STATISTICAL SIGNIFICANCE IN PROTEIN AND DNA-SEQUENCE ANALYSIS

被引:142
作者
KARLIN, S
BRENDEL, V
机构
[1] Department of Mathematics, Stanford University, Stanford
关键词
D O I
10.1126/science.1621093
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Statistical approaches help in the determination of significant configurations in protein and nucleic acid sequence data. Three recent statistical methods are discussed: (i) score-based sequence analysis that provides a means for characterizing anomalies in local sequence text and for evaluating sequence comparisons; (ii) quantile distributions of amino acid usage that reveal general compositional biases in proteins and evolutionary relations; and (iii) r-scan statistics that can be applied to the analysis of spacings of sequence markers.
引用
收藏
页码:39 / 49
页数:11
相关论文
共 115 条
[1]   AMINO-ACID SUBSTITUTION MATRICES FROM AN INFORMATION THEORETIC PERSPECTIVE [J].
ALTSCHUL, SF .
JOURNAL OF MOLECULAR BIOLOGY, 1991, 219 (03) :555-565
[2]   PROTEIN DATABASE SEARCHES FOR MULTIPLE ALIGNMENTS [J].
ALTSCHUL, SF ;
LIPMAN, DJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1990, 87 (14) :5509-5513
[3]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[4]   MULTICOMPONENT ORIGIN OF CYTOMEGALOVIRUS LYTIC-PHASE DNA-REPLICATION [J].
ANDERS, DG ;
PUNTURIERI, SM .
JOURNAL OF VIROLOGY, 1991, 65 (02) :931-937
[5]  
[Anonymous], 1968, INTRO PROBABILITY TH
[6]   2 MOMENTS SUFFICE FOR POISSON APPROXIMATIONS - THE CHEN-STEIN METHOD [J].
ARRATIA, R ;
GOLDSTEIN, L ;
GORDON, L .
ANNALS OF PROBABILITY, 1989, 17 (01) :9-25
[7]   AN ERDOS-RENYI LAW WITH SHIFTS [J].
ARRATIA, R ;
WATERMAN, MS .
ADVANCES IN MATHEMATICS, 1985, 55 (01) :13-23
[8]  
ARRATIA R, 1991, ANNU REV BIOPHYS BIO, V11, P806
[9]  
Arratia R. A., 1990, STAT SCI, V5, P403, DOI [10.1214/ss/1177012015, DOI 10.1214/SS/1177012015]