Identifying errors in avian influenza virus gene sequences and implications for data usage of public databases

被引:4
|
作者
Li, Jinling [1 ]
Dohna, Heinrich Zu
Miller, Joy [2 ]
Cardona, Carol J.
Carpenter, Tim E.
机构
[1] Univ Calif Davis, Ctr Anim Dis Modeling & Surveillance, Sch Vet Med, Davis, CA 95616 USA
[2] Natl Ctr Med Intelligence, Ft Detrick, MD USA
关键词
Avian influenza virus; Sequence; Database; Hemagglutinin; Neuraminidase; A VIRUS; AMINO-ACID; HEMAGGLUTININ; SUBTYPE; CHINA; POULTRY; ORIGIN;
D O I
10.1016/j.ygeno.2009.09.005
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Public gene sequence databases have become important research tools to understand viruses and other organisms. Evidence Suggests that the identifying information for some of the sequences in these databases might not belong to the sequences they are associated with. We developed two tests to conduct a comprehensive analysis of all published sequences of the hemaglutinin and neuramidase genes of avian influenza viruses (AIVs) to identify sequences that may have been misclassified. One test identified sequence pairs with highly similar nucleotide sequences despite a difference of several years between their sampling dates. Another test, which was applied to samples sequenced and deposited more than once, detected sequences with more nucleotide differences to their own than to their closest relatives. All sequences identified as misclassified were further traced to relevant publications to assess the likelihood of contamination and determine if any conclusions were associated with the use of these sequences. Our results suggested that among 4040 published gene sequences examined, approximately 0.8% might be misclassified and that publications using these sequences may include inaccurate statements. Findings from this report suggest that using laboratory-adapted strains and handling multiple samples simultaneously increases the risk of contamination. The tests reported here may be useful for screening new submissions to public sequence databases. Published by Elsevier Inc.
引用
收藏
页码:29 / 36
页数:8
相关论文
共 50 条
  • [1] TreeGeneBrowser: phylogenetic data mining of gene sequences from public databases
    Jakobsen, IB
    Saleeba, JA
    Poidinger, M
    Littlejohn, TG
    BIOINFORMATICS, 2001, 17 (06) : 535 - 540
  • [2] An evaluation of errors in the mitochondrial COI sequences of Hydrachnidia (Acari, Parasitengona) in public databases
    Pelaez, Maria L.
    Horreo, Jose L.
    Garcia-Jimenez, Ricardo
    Valdecasas, Antonio G.
    EXPERIMENTAL AND APPLIED ACAROLOGY, 2022, 86 (03) : 371 - 384
  • [3] Implications of Public Understanding of Avian Influenza for Fostering Effective Risk Communication
    Elledge, Brenda L.
    Brand, Michael
    Regens, James L.
    Boatright, Daniel T.
    HEALTH PROMOTION PRACTICE, 2008, 9 (04) : 54S - 59S
  • [4] Construction and expression of avian influenza virus HA gene eukaryotic vectors
    Zhang, QZ
    Qin, XM
    Liang, R
    Deng, MJ
    He, HX
    Zhang, YM
    Zheng, CX
    Duan, MX
    PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2003, 30 (03) : 483 - 487
  • [5] Myxovirus resistance (Mx) Gene Diversity in Avian Influenza Virus Infections
    Alam, Jahangir
    Rahman, Md Mostafizer
    Halder, Joyanta
    Islam, Md Rezuanul
    Sarkar, Nandini
    Jabeen, Ishrat
    Hossain, Mridha Md Kamal
    Rubaya, Rubaya
    Alim, Md Abdul
    Bhuyan, Anjuman Ara
    Jahan, Nusrat
    Rahman, Md Masudur
    Ashour, Hossam M.
    BIOMEDICINES, 2022, 10 (11)
  • [6] Surveillance and control measures of avian influenza in birds.: Implications for public health
    Rodriguez, Alejandro Arteaga
    Izquierdo, Mercedes Pilar
    Moros, Maria Jose Sierra
    Heras, Carmen Amela
    REVISTA ESPANOLA DE SALUD PUBLICA, 2006, 80 (06): : 621 - 630
  • [7] The effect of avian influenza virus NS1 allele on virus replication and innate gene expression in avian cells
    Adams, Sean
    Xing, Zheng
    Li, Jinling
    Mendoza, Kristelle
    Perez, Daniel
    Reed, Kent
    Cardona, Carol
    MOLECULAR IMMUNOLOGY, 2013, 56 (04) : 358 - 368
  • [8] Sequences in influenza A virus PB2 protein that determine productive infection for an avian influenza virus in mouse and human cell lines
    Yao, YX
    Mingay, LJ
    McCauley, JW
    Barclay, WS
    JOURNAL OF VIROLOGY, 2001, 75 (11) : 5410 - 5415
  • [9] Data mining of DNA sequences submitted by Peruvian institutions to public genetic databases
    Eduardo Romero, Pedro
    Castillo-Vilcahuaman, Camila
    REVISTA PERUANA DE BIOLOGIA, 2021, 28 (01):
  • [10] Gene transfer mediated by influenza virus peptides: The role of peptide sequences
    Mechtler, K
    Wagner, E
    NEW JOURNAL OF CHEMISTRY, 1997, 21 (01) : 105 - 111