Elucidation of cross-species proteomic effects in human and hominin bone proteome identification through a bioinformatics experiment

被引:16
作者
Welker, F. [1 ,2 ]
机构
[1] Max Planck Inst Evolutionary Anthropol, Dept Human Evolut, Leipzig, Germany
[2] Univ Copenhagen, Nat Hist Museum Denmark, Copenhagen, Denmark
关键词
Palaeoproteomics; Error-tolerant proteomics; Bioinformatics experiment; Single amino acid polymorphisms; Hominidae; MASS-SPECTROMETRY; BRACHYLOPHOSAURUS-CANADENSIS; EVOLUTIONARY HISTORY; GENOME SEQUENCE; ANCIENT BONE; PROTEINS; PRESERVATION; SOFTWARE; SURVIVAL; REVEALS;
D O I
10.1186/s12862-018-1141-1
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The study of ancient protein sequences is increasingly focused on the analysis of older samples, including those of ancient hominins. The analysis of such ancient proteomes thereby potentially suffers from "cross-species proteomic effects": the loss of peptide and protein identifications at increased evolutionary distances due to a larger number of protein sequence differences between the database sequence and the analyzed organism. Error-tolerant proteomic search algorithms should theoretically overcome this problem at both the peptide and protein level; however, this has not been demonstrated. If error-tolerant searches do not overcome the cross-species proteomic issue then there might be inherent biases in the identified proteomes. Here, a bioinformatics experiment is performed to test this using a set of modern human bone proteomes and three independent searches against sequence databases at increasing evolutionary distances: the human (0 Ma), chimpanzee (6-8 Ma) and orangutan (16-17 Ma) reference proteomes, respectively. Results: Incorrectly suggested amino acid substitutions are absent when employing adequate filtering criteria for mutable Peptide Spectrum Matches (PSMs), but roughly half of the mutable PSMs were not recovered. As a result, peptide and protein identification rates are higher in error-tolerant mode compared to non-error-tolerant searches but did not recover protein identifications completely. Data indicates that peptide length and the number of mutations between the target and database sequences are the main factors influencing mutable PSM identification. Conclusions: The error-tolerant results suggest that the cross-species proteomics problem is not overcome at increasing evolutionary distances, even at the protein level. Peptide and protein loss has the potential to significantly impact divergence dating and proteome comparisons when using ancient samples as there is a bias towards the identification of conserved sequences and proteins. Effects are minimized between moderately divergent proteomes, as indicated by almost complete recovery of informative positions in the search against the chimpanzee proteome (approximate to 90%, 6-8 Ma). This provides a bioinformatic background to future phylogenetic and proteomic analysis of ancient hominin proteomes, including the future description of novel hominin amino acid sequences, but also has negative implications for the study of fast-evolving proteins in hominins, non-hominin animals, and ancient bacterial proteins in evolutionary contexts.
引用
收藏
页数:11
相关论文
共 46 条
[1]   Cross-species proteomics in analysis of mammalian sperm proteins [J].
Bayram, Helen L. ;
Claydon, Amy J. ;
Brownridge, Philip J. ;
Hurst, Jane L. ;
Mileham, Alan ;
Stockley, Paula ;
Beynon, Robert J. ;
Hammond, Dean E. .
JOURNAL OF PROTEOMICS, 2016, 135 :38-50
[2]  
Bern M, 2012, CURR PROTOC BIOINFOR, V40
[3]   Identification of a new hominin bone from Denisova Cave, Siberia using collagen fingerprinting and mitochondrial DNA analysis [J].
Brown, Samantha ;
Higham, Thomas ;
Slon, Viviane ;
Paeaebo, Svante ;
Meyer, Matthias ;
Douka, Katerina ;
Brock, Fiona ;
Comeskey, Daniel ;
Procopio, Noemi ;
Shunkov, Michael ;
Derevianko, Anatoly ;
Buckley, Michael .
SCIENTIFIC REPORTS, 2016, 6
[4]   Proteome degradation in ancient bone: Diagenesis and phylogenetic potential [J].
Buckley, M. ;
Wadsworth, C. .
PALAEOGEOGRAPHY PALAEOCLIMATOLOGY PALAEOECOLOGY, 2014, 416 :69-79
[5]   Collagen Sequence Analysis of the Extinct Giant Ground Sloths Lestodon and Megatherium (vol 10, e0139611, 2015) [J].
Buckley, Michael ;
Farina, Richard A. ;
Lawless, Craig ;
Tambusso, P. Sebastian ;
Varela, Luciano ;
Carlini, Alfredo A. ;
Powell, Jaime E. ;
Martinez, Jorge G. .
PLOS ONE, 2015, 10 (12)
[6]   Ancient collagen reveals evolutionary history of the endemic South American 'ungulates' [J].
Buckley, Michael .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2015, 282 (1806)
[7]   Unlocking Ancient Protein Palimpsests [J].
Cappellini, Enrico ;
Collins, Matthew J. ;
Gilbert, M. Thomas P. .
SCIENCE, 2014, 343 (6177) :1320-1322
[8]   Proteomic Analysis of a Pleistocene Mammoth Femur Reveals More than One Hundred Ancient Bone Proteins [J].
Cappellini, Enrico ;
Jensen, Lars J. ;
Szklarczyk, Damian ;
Ginolhac, Aurelien ;
da Fonseca, Rute A. R. ;
Stafford, Thomas W., Jr. ;
Holen, Steven R. ;
Collins, Matthew J. ;
Orlando, Ludovic ;
Willerslev, Eske ;
Gilbert, M. Thomas P. ;
Olsen, Jesper V. .
JOURNAL OF PROTEOME RESEARCH, 2012, 11 (02) :917-926
[9]   Patterns of coding variation in the complete exomes of three Neandertals [J].
Castellano, Sergi ;
Parra, Genis ;
Sanchez-Quinto, Federico A. ;
Racimo, Fernando ;
Kuhlwilm, Martin ;
Kircher, Martin ;
Sawyer, Susanna ;
Fu, Qiaomei ;
Heinze, Anja ;
Nickel, Birgit ;
Dabney, Jesse ;
Siebauer, Michael ;
White, Louise ;
Burbano, Hernan A. ;
Renaud, Gabriel ;
Stenzel, Udo ;
Lalueza-Fox, Carles ;
de la Rasilla, Marco ;
Rosas, Antonio ;
Rudan, Pavao ;
Brajkovic, Dejana ;
Kucan, Zeljko ;
Gusic, Ivan ;
Shunkov, Michael V. ;
Derevianko, Anatoli P. ;
Viola, Bence ;
Meyer, Matthias ;
Kelso, Janet ;
Andres, Aida M. ;
Paeaebo, Svante .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (18) :6666-6671
[10]   Peptide sequences from the first Castoroides ohioensis skull and the utility of old museum collections for palaeoproteomics [J].
Cleland, Timothy P. ;
Schroeter, Elena R. ;
Feranec, Robert S. ;
Vashishth, Deepak .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2016, 283 (1832)