Orphans and new gene origination, a structural and evolutionary perspective

被引:21
作者
Light, Sara [1 ,2 ]
Basile, Walter [1 ,2 ]
Elofsson, Arne [1 ,2 ,3 ]
机构
[1] Stockholm Univ, Sci Life Lab, SE-17121 Solna, Sweden
[2] Stockholm Univ, Dept Biochem & Biophys, SE-10691 Stockholm, Sweden
[3] Swedish eSci Res Ctr SeRC, Stockholm, Sweden
基金
瑞典研究理事会;
关键词
DE-NOVO; SEGMENTAL DUPLICATIONS; EXPANSION; FAMILIES; PROTEINS; TRANSCRIPTION; SELECTION; SEQUENCE; DATABASE; DOMAINS;
D O I
10.1016/j.sbi.2014.05.006
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The frequency of de novo creation of proteins has been debated. Early it was assumed that de novo creation should be extremely rare and that the vast majority of all protein coding genes were created in early history of life. However, the early genomics era lead to the insight that protein coding genes do appear to be lineage-specific. Today, with thousands of completely sequenced genomes, this impression remains. It has even been proposed that the creation of novel genes, a continuous process where most de novo genes are short-lived, is as frequent as gene duplications. There exist reports with strongly indicative evidence for de novo gene emergence in many organisms ranging from Bacteria, sometimes generated through bacteriophages, to humans, where orphans appear to be overexpressed in brain and testis. In contrast, research on protein evolution indicates that many very distantly related proteins appear to share partial homology. Here, we discuss recent results on de novo gene emergence, as well as important technical challenges limiting our ability to get a definite answer to the extent of de novo protein creation.
引用
收藏
页码:73 / 83
页数:11
相关论文
共 67 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   A galaxy of folds [J].
Alva, Vikram ;
Remmert, Michael ;
Biegert, Andreas ;
Lupas, Andrei N. ;
Soeding, Johannes .
PROTEIN SCIENCE, 2010, 19 (01) :124-130
[3]  
[Anonymous], 2004, NUCLEIC ACIDS RES, DOI DOI 10.1093/nar/gkh131
[4]   InterPro - an integrated documentation resource for protein families, domains and functional sites [J].
Apweiler, R ;
Attwood, TK ;
Bairoch, A ;
Bateman, A ;
Birney, E ;
Biswas, M ;
Bucher, P ;
Cerutti, L ;
Corpet, F ;
Croning, MDR ;
Durbin, R ;
Falquet, L ;
Fleischmann, W ;
Gouzy, J ;
Hermjakob, H ;
Hulo, N ;
Jonassen, I ;
Kahn, D ;
Kanapin, A ;
Karavidopoulou, Y ;
Lopez, R ;
Marx, B ;
Mulder, NJ ;
Oinn, TM ;
Pagni, M ;
Servant, F ;
Sigrist, CJA ;
Zdobnov, EM .
BIOINFORMATICS, 2000, 16 (12) :1145-1150
[5]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[6]   Evidence for de novo evolution of testis-expressed genes in the Drosophila yakuba Drosophila erecta clade [J].
Begun, David J. ;
Lindfors, Heather A. ;
Kern, Andrew D. ;
Jones, Corbin D. .
GENETICS, 2007, 176 (02) :1131-1137
[7]   DOMAIN SWAPPING - ENTANGLING ALLIANCES BETWEEN PROTEINS [J].
BENNETT, MJ ;
CHOE, S ;
EISENBERG, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (08) :3127-3131
[8]   Domain rearrangements in protein evolution [J].
Björklund, ÅK ;
Ekman, D ;
Light, S ;
Frey-Skött, J ;
Elofsson, A .
JOURNAL OF MOLECULAR BIOLOGY, 2005, 353 (04) :911-923
[9]   Expansion of protein domain repeats [J].
Bjorklund, Asa K. ;
Ekman, Diana ;
Elofsson, Arne .
PLOS COMPUTATIONAL BIOLOGY, 2006, 2 (08) :959-970
[10]  
Brown C, 2002, EMBED SYST PROGRAM, V15, P55