APPRIS: annotation of principal and alternative splice isoforms

被引:156
作者
Manuel Rodriguez, Jose [1 ]
Maietta, Paolo [2 ]
Ezkurdia, Iakes [2 ]
Pietrelli, Alessandro [1 ]
Wesselink, Jan-Jaap [2 ]
Lopez, Gonzalo [2 ]
Valencia, Alfonso [1 ,2 ]
Tress, Michael L. [2 ]
机构
[1] Spanish Natl Canc Res Ctr CNIO, Spanish Natl Bioinformat Inst INB, Madrid 28029, Spain
[2] Spanish Natl Canc Res Ctr CNIO, Struct Biol & Biocomp Programme, Madrid 28029, Spain
关键词
PROTEIN ISOFORMS; DATABASE; PREDICTION; PROTEOMICS; RNA; SEQUENCES; ALIGNMENT; TOPOLOGY; GENOME; DECAY;
D O I
10.1093/nar/gks1058
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Here, we present APPRIS (http://appris.bioinfo.cnio.es), a database that houses annotations of human splice isoforms. APPRIS has been designed to provide value to manual annotations of the human genome by adding reliable protein structural and functional data and information from cross-species conservation. The visual representation of the annotations provided by APPRIS for each gene allows annotators and researchers alike to easily identify functional changes brought about by splicing events. In addition to collecting, integrating and analyzing reliable predictions of the effect of splicing events, APPRIS also selects a single reference sequence for each gene, here termed the principal isoform, based on the annotations of structure, function and conservation for each transcript. APPRIS identifies a principal isoform for 85% of the protein-coding genes in the GENCODE 7 release for ENSEMBL. Analysis of the APPRIS data shows that at least 70% of the alternative (non-principal) variants would lose important functional or structural information relative to the principal isoform.
引用
收藏
页码:D110 / D117
页数:8
相关论文
共 58 条
[1]   Global analysis of exon creation versus loss and the role of alternative splicing in 17 vertebrate genomes [J].
Alekseyenko, Alexander V. ;
Kim, Namshin ;
Lee, Christopher J. .
RNA, 2007, 13 (05) :661-670
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Reorganizing the protein space at the Universal Protein Resource (UniProt) [J].
Apweiler, Rolf ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alam-Faruque, Yasmin ;
Antunes, Ricardo ;
Casanova, Elisabet Barrera ;
Bely, Benoit ;
Bingley, Mark ;
Bower, Lawrence ;
Bursteinas, Borisas ;
Chan, Wei Mun ;
Chavali, Gayatri ;
Da Silva, Alan ;
Dimmer, Emily ;
Eberhardt, Ruth ;
Fazzini, Francesco ;
Fedotov, Alexander ;
Garavelli, John ;
Castro, Leyla Garcia ;
Gardner, Michael ;
Hieta, Reija ;
Huntley, Rachael ;
Jacobsen, Julius ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
Orchard, Sandra ;
Patient, Samuel ;
Pichler, Klemens ;
Poggioli, Diego ;
Pontikos, Nikolas ;
Pundir, Sangya ;
Rosanoff, Steven ;
Sawford, Tony ;
Sehra, Harminder ;
Turner, Edward ;
Wardell, Tony ;
Watkins, Xavier ;
Corbett, Matt ;
Donnelly, Mike ;
van Rensburg, Pieter ;
Goujon, Mickael ;
McWilliam, Hamish ;
Lopez, Rodrigo ;
Xenarios, Ioannis ;
Bougueleret, Lydie ;
Bridge, Alan ;
Poux, Sylvain ;
Redaschi, Nicole .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D71-D75
[4]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[5]   Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project [J].
Birney, Ewan ;
Stamatoyannopoulos, John A. ;
Dutta, Anindya ;
Guigo, Roderic ;
Gingeras, Thomas R. ;
Margulies, Elliott H. ;
Weng, Zhiping ;
Snyder, Michael ;
Dermitzakis, Emmanouil T. ;
Stamatoyannopoulos, John A. ;
Thurman, Robert E. ;
Kuehn, Michael S. ;
Taylor, Christopher M. ;
Neph, Shane ;
Koch, Christoph M. ;
Asthana, Saurabh ;
Malhotra, Ankit ;
Adzhubei, Ivan ;
Greenbaum, Jason A. ;
Andrews, Robert M. ;
Flicek, Paul ;
Boyle, Patrick J. ;
Cao, Hua ;
Carter, Nigel P. ;
Clelland, Gayle K. ;
Davis, Sean ;
Day, Nathan ;
Dhami, Pawandeep ;
Dillon, Shane C. ;
Dorschner, Michael O. ;
Fiegler, Heike ;
Giresi, Paul G. ;
Goldy, Jeff ;
Hawrylycz, Michael ;
Haydock, Andrew ;
Humbert, Richard ;
James, Keith D. ;
Johnson, Brett E. ;
Johnson, Ericka M. ;
Frum, Tristan T. ;
Rosenzweig, Elizabeth R. ;
Karnani, Neerja ;
Lee, Kirsten ;
Lefebvre, Gregory C. ;
Navas, Patrick A. ;
Neri, Fidencio ;
Parker, Stephen C. J. ;
Sabo, Peter J. ;
Sandstrom, Richard ;
Shafer, Anthony .
NATURE, 2007, 447 (7146) :799-816
[6]   ProSAS:: a database for analyzing alternative splicing in the context of protein structures [J].
Birzele, Fabian ;
Kueffner, Robert ;
Meier, Franziska ;
Oefinger, Florian ;
Potthast, Christian ;
Zimmer, Ralf .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D63-D68
[7]   Protein diversity from alternative splicing: A challenge for bioinformatics and post-genome biology [J].
Black, DL .
CELL, 2000, 103 (03) :367-370
[8]   Aligning multiple genomic sequences with the threaded blockset aligner [J].
Blanchette, M ;
Kent, WJ ;
Riemer, C ;
Elnitski, L ;
Smit, AFA ;
Roskin, KM ;
Baertsch, R ;
Rosenbloom, K ;
Clawson, H ;
Green, ED ;
Haussler, D ;
Miller, W .
GENOME RESEARCH, 2004, 14 (04) :708-715
[9]   Shotgun proteomics aids discovery of novel protein-coding genes, alternative splicing, and "resurrected" pseudogenes in the mouse genome [J].
Brosch, Markus ;
Saunders, Gary I. ;
Frankish, Adam ;
Collins, Mark O. ;
Yu, Lu ;
Wright, James ;
Verstraten, Ruth ;
Adams, David J. ;
Harrow, Jennifer ;
Choudhary, Jyoti S. ;
Hubbard, Tim .
GENOME RESEARCH, 2011, 21 (05) :756-767
[10]   Discovery and revision of Arabidopsis genes by proteogenomics [J].
Castellana, Natalie E. ;
Payne, Samuel H. ;
Shen, Zhouxin ;
Stanke, Mario ;
Bafna, Vineet ;
Briggs, Steven P. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (52) :21034-21038