Understanding the causes of errors in eukaryotic protein-coding gene prediction: a case study of primate proteomes

被引：0

作者：

Corentin Meyer

Nicolas Scalzitti

Anne Jeannin-Girardon

Pierre Collet

Olivier Poch

Julie D. Thompson

机构：

[1] University of Strasbourg,Department of Computer Science, ICube, CNRS

来源：

BMC Bioinformatics | / 21卷

关键词：

Genome annotation; Primates; Gene prediction; Protein sequence errors; Error correction;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

引用

共 113 条

[1]

Mudge JM(2016)The state of play in higher eukaryote gene annotation Nat Rev Genet 17 758-772

[2]

Harrow J(2018)No wisdom in the crowd: genome annotation in the era of big data-current status and future prospects Microb Biotechnol 11 588-605

[3]

Danchin A(2020)The neXtProt knowledgebase in 2020: data, tools and usability improvements Nucleic Acids Res 48 D328-D334

[4]

Ouzounis C(2013)Improving genome assemblies and annotations for nonhuman primates ILAR J 54 144-153

[5]

Tokuyasu T(2019)EnTAP: bringing faster and smarter functional annotation to non-model eukaryotic transcriptomes Mol Ecol Resour 47 D506-D515

[6]

Zucker J-D(2019)UniProt: a worldwide hub of protein knowledge Nucleic Acids Res 44 D733-D745

[7]

Zahn-Zabal M(2016)Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation Nucleic Acids Res 48 D682-D688

[8]

Michel PA(2020)Ensembl 2020 Nucleic Acids Res 47 10994-11006

[9]

Gateau A(2019)Tandem repeats lead to sequence assembly errors and impose multi-level challenges for genome and protein databases Nucleic Acids Res 7 e50609-1642

[10]

Norgren RB(2012)Evaluating high-throughput ab initio gene finders to discover proteins encoded in eukaryotic pathogen genomes missed by laboratory techniques PLoS ONE 10 e1003998-831

← 1 2 3 4 5 6 7 8 9 10 →