Reconciling the numbers: ESTs versus protein-coding genes

被引:13
作者
Nekrutenko, A [1 ]
机构
[1] Penn State Univ, Huck Inst Life Sci, Dept Biochem & Mol Biol, University Pk, PA 16802 USA
[2] Penn State Univ, Ctr Comparat Genomics & Bioinformat, University Pk, PA 16802 USA
关键词
protein-coding genes; human; mouse; comparative genome analysis; ESTs;
D O I
10.1093/molbev/msh125
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The number of expressed sequences greatly surpasses the estimated number of protein-coding genes in mammalian genomes. An evolutionary approach reveals that only 9% to 14% of human-expressed and mouse-expressed sequences are able to code for proteins. Clustering of these sequences using cross-species relationships suggests that millions of expressed sequences may correspond to only approximately 20,000 distinct protein-coding transcripts.
引用
收藏
页码:1278 / 1282
页数:5
相关论文
共 15 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]   Gene number - What if there are only 30,000 human genes? [J].
Claverie, JM .
SCIENCE, 2001, 291 (5507) :1255-1257
[3]   Initial sequencing and analysis of the human genome [J].
Lander, ES ;
Int Human Genome Sequencing Consortium ;
Linton, LM ;
Birren, B ;
Nusbaum, C ;
Zody, MC ;
Baldwin, J ;
Devon, K ;
Dewar, K ;
Doyle, M ;
FitzHugh, W ;
Funke, R ;
Gage, D ;
Harris, K ;
Heaford, A ;
Howland, J ;
Kann, L ;
Lehoczky, J ;
LeVine, R ;
McEwan, P ;
McKernan, K ;
Meldrim, J ;
Mesirov, JP ;
Miranda, C ;
Morris, W ;
Naylor, J ;
Raymond, C ;
Rosetti, M ;
Santos, R ;
Sheridan, A ;
Sougnez, C ;
Stange-Thomann, N ;
Stojanovic, N ;
Subramanian, A ;
Wyman, D ;
Rogers, J ;
Sulston, J ;
Ainscough, R ;
Beck, S ;
Bentley, D ;
Burton, J ;
Clee, C ;
Carter, N ;
Coulson, A ;
Deadman, R ;
Deloukas, P ;
Dunham, A ;
Dunham, I ;
Durbin, R ;
French, L .
NATURE, 2001, 409 (6822) :860-921
[4]   Evolutionary analyses of the human genome [J].
Li, WH ;
Gu, ZL ;
Wang, HD ;
Nekrutenko, A .
NATURE, 2001, 409 (6822) :847-849
[5]  
Li WH., 1997, MOL EVOLUTION
[6]   An evolutionary approach reveals a high protein-coding capacity of the human genome [J].
Nekrutenko, A ;
Chung, WY ;
Li, WH .
TRENDS IN GENETICS, 2003, 19 (06) :306-310
[7]   The KA/KS ratio test for assessing the protein-coding potential of genomic regions:: An empirical and simulation study [J].
Nekrutenko, A ;
Makova, KD ;
Li, WH .
GENOME RESEARCH, 2002, 12 (01) :198-202
[8]  
OKAZAKI Y, 2003, NATURE, V420, P512
[9]  
SOKAL RR, 2000, BIOMETRY
[10]   CLUSTAL-W - IMPROVING THE SENSITIVITY OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT THROUGH SEQUENCE WEIGHTING, POSITION-SPECIFIC GAP PENALTIES AND WEIGHT MATRIX CHOICE [J].
THOMPSON, JD ;
HIGGINS, DG ;
GIBSON, TJ .
NUCLEIC ACIDS RESEARCH, 1994, 22 (22) :4673-4680