Controversies in modern evolutionary biology: the imperative for error detection and quality control

被引:26
作者
Prosdocimi, Francisco [1 ,2 ]
Linard, Benjamin [1 ]
Pontarotti, Pierre [3 ]
Poch, Olivier [1 ]
Thompson, Julie D. [1 ]
机构
[1] Univ Strasbourg, Dept Integrated Struct Biol, IGBMC, CNRS,INSERM, F-67404 Illkirch Graffenstaden, France
[2] Univ Fed Rio de Janeiro, Dept Med Biochem, BR-21941902 Rio De Janeiro, Brazil
[3] Univ Aix Marseille 1, UMR CNRS Evolut Biol & Modelisat 6632, F-13331 Marseille, France
关键词
gene duplication; asymmetric evolution; gene prediction; error detection; quality control; MULTIPLE ALIGNMENT; GENES; IDENTIFICATION; PHYLOGENOMICS; DUPLICATIONS; ORTHOLOGS; ONTOLOGY; PAIRWISE; PROGRESS; GENOMES;
D O I
10.1186/1471-2164-13-5
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The data from high throughput genomics technologies provide unique opportunities for studies of complex biological systems, but also pose many new challenges. The shift to the genome scale in evolutionary biology, for example, has led to many interesting, but often controversial studies. It has been suggested that part of the conflict may be due to errors in the initial sequences. Most gene sequences are predicted by bioinformatics programs and a number of quality issues have been raised, concerning DNA sequencing errors or badly predicted coding regions, particularly in eukaryotes. Results: We investigated the impact of these errors on evolutionary studies and specifically on the identification of important genetic events. We focused on the detection of asymmetric evolution after duplication, which has been the subject of controversy recently. Using the human genome as a reference, we established a reliable set of 688 duplicated genes in 13 complete vertebrate genomes, where significantly different evolutionary rates are observed. We estimated the rates at which protein sequence errors occur and are accumulated in the higher-level analyses. We showed that the majority of the detected events (57%) are in fact artifacts due to the putative erroneous sequences and that these artifacts are sufficient to mask the true functional significance of the events. Conclusions: Initial errors are accumulated throughout the evolutionary analysis, generating artificially high rates of event predictions and leading to substantial uncertainty in the conclusions. This study emphasizes the urgent need for error detection and quality control strategies in order to efficiently extract knowledge from the new genome data.
引用
收藏
页数:16
相关论文
共 50 条
[31]   AfterQC: automatic filtering, trimming, error removing and quality control for fastq data [J].
Shifu Chen ;
Tanxiao Huang ;
Yanqing Zhou ;
Yue Han ;
Mingyan Xu ;
Jia Gu .
BMC Bioinformatics, 18
[32]   AfterQC: automatic filtering, trimming, error removing and quality control for fastq data [J].
Chen, Shifu ;
Huang, Tanxiao ;
Zhou, Yanqing ;
Han, Yue ;
Xu, Mingyan ;
Gu, Jia .
BMC BIOINFORMATICS, 2017, 18
[33]   Towards a QuikSCAT quality control indicator: rain detection [J].
Portabella, M ;
Stoffelen, A .
REMOTE SENSING OF THE OCEAN AND SEA ICE 2000, 2000, 4172 :177-180
[34]   ADVANCED TEHERTZ TECHNIQUES FOR QUALITY CONTROL AND COUNTERFEIT DETECTION [J].
Ahi, Kiarash ;
Anwar, Mehdi .
TERAHERTZ PHYSICS, DEVICES, AND SYSTEMS X: ADVANCED APPLICATIONS IN INDUSTRY AND DEFENSE, 2016, 9856
[35]   Raman spectroscopy for quality control and detection of substandard painkillers [J].
Omar, Jone ;
Boix, Ana ;
Ulberth, Franz .
VIBRATIONAL SPECTROSCOPY, 2020, 111
[36]   SegQC: a segmentation network-based framework for multi-metric segmentation quality control and segmentation error detection in volumetric medical images [J].
Specktor-Fadida, Bella ;
Ben-Sira, Liat ;
Ben-Bashat, Dafna ;
Joskowicz, Leo .
MEDICAL IMAGE ANALYSIS, 2025, 103
[37]   Model-based measurement error detection of a coagulant dosage control system [J].
Liu, W. ;
Ratnaweera, H. ;
Kvaal, K. .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL SCIENCE AND TECHNOLOGY, 2019, 16 (07) :3135-3144
[38]   SPREADSHEET ERROR DETECTION AND DEBUGGING APPROACH FOR DYNAMIC DISCRETE INVENTORY CONTROL MODELS [J].
Milutinovic, Lena Djordjevic ;
Lecic-Cvetkovic, Danica ;
Makajic-Nikolic, Dragana ;
Babarogic, Sladjan ;
Omerbegovic-Bijelovic, Jasmina .
INTERNATIONAL JOURNAL OF INDUSTRIAL ENGINEERING-THEORY APPLICATIONS AND PRACTICE, 2019, 26 (05) :797-818
[39]   Model-based measurement error detection of a coagulant dosage control system [J].
W. Liu ;
H. Ratnaweera ;
K. Kvaal .
International Journal of Environmental Science and Technology, 2019, 16 :3135-3144
[40]   A proactive approach to human error detection and identification in aviation and air traffic control [J].
Kontogiannis, Tom ;
Malakis, Stathis .
SAFETY SCIENCE, 2009, 47 (05) :693-706