WEIGHTING ALIGNED PROTEIN OR NUCLEIC-ACID SEQUENCES TO CORRECT FOR UNEQUAL REPRESENTATION

被引:62
|
作者
SIBBALD, PR
ARGOS, P
机构
[1] European Molecular Biology Laboratory, 6900 Heidelberg, Postfach 10 22 09
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1016/S0022-2836(99)80003-5
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Aligned sequences from the same family (e.g. the haemoglobins) are seldom representative of the entire family. This is because (1) the sequence databases are heavily skewed toward a small number of organisms and (2) only a minute fraction of all the different family members have been sequenced. For many applications, such as using alignments or profiles to perform database searches for distantly related family members, such unequal representation requires correction. An algorithm to perform appropriate weighting of individual sequences is presented along with examples illustrating its efficacy. © 1990 Academic Press Limited.
引用
收藏
页码:813 / 818
页数:6
相关论文
共 50 条