Understanding the causes of errors in eukaryotic protein-coding gene prediction: a case study of primate proteomes

被引:0
作者
Corentin Meyer
Nicolas Scalzitti
Anne Jeannin-Girardon
Pierre Collet
Olivier Poch
Julie D. Thompson
机构
[1] University of Strasbourg,Department of Computer Science, ICube, CNRS
来源
BMC Bioinformatics | / 21卷
关键词
Genome annotation; Primates; Gene prediction; Protein sequence errors; Error correction;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 113 条
[1]  
Mudge JM(2016)The state of play in higher eukaryote gene annotation Nat Rev Genet 17 758-772
[2]  
Harrow J(2018)No wisdom in the crowd: genome annotation in the era of big data-current status and future prospects Microb Biotechnol 11 588-605
[3]  
Danchin A(2020)The neXtProt knowledgebase in 2020: data, tools and usability improvements Nucleic Acids Res 48 D328-D334
[4]  
Ouzounis C(2013)Improving genome assemblies and annotations for nonhuman primates ILAR J 54 144-153
[5]  
Tokuyasu T(2019)EnTAP: bringing faster and smarter functional annotation to non-model eukaryotic transcriptomes Mol Ecol Resour 47 D506-D515
[6]  
Zucker J-D(2019)UniProt: a worldwide hub of protein knowledge Nucleic Acids Res 44 D733-D745
[7]  
Zahn-Zabal M(2016)Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation Nucleic Acids Res 48 D682-D688
[8]  
Michel PA(2020)Ensembl 2020 Nucleic Acids Res 47 10994-11006
[9]  
Gateau A(2019)Tandem repeats lead to sequence assembly errors and impose multi-level challenges for genome and protein databases Nucleic Acids Res 7 e50609-1642
[10]  
Norgren RB(2012)Evaluating high-throughput ab initio gene finders to discover proteins encoded in eukaryotic pathogen genomes missed by laboratory techniques PLoS ONE 10 e1003998-831