The porcine translational research database: a manually curated, genomics and proteomics-based research resource

被引:42
作者
Dawson, Harry D. [1 ]
Chen, Celine [1 ]
Gaynor, Brady [2 ]
Shao, Jonathan [2 ]
Urban, Joseph F., Jr. [1 ]
机构
[1] ARS, USDA, Beltsville Human Nutr Res Ctr, Diet Genom & Immunol Lab, Beltsville, MD 20705 USA
[2] ARS, USDA, Beltsville Agr Res Ctr, Mol Plant Pathol Lab, Beltsville, MD 20705 USA
来源
BMC GENOMICS | 2017年 / 18卷
关键词
Porcine; Database; Comparative genomics; FUNCTIONAL-CHARACTERIZATION; TRANSPORTER FAMILY; EXPRESSION ATLAS; PIG; GENE; MODELS; IDENTIFICATION; CONSTRUCTION; NOMENCLATURE; PROTEINS;
D O I
10.1186/s12864-017-4009-7
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The use of swine in biomedical research has increased dramatically in the last decade. Diverse genomic- and proteomic databases have been developed to facilitate research using human and rodent models. Current porcine gene databases, however, lack the robust annotation to study pig models that are relevant to human studies and for comparative evaluation with rodent models. Furthermore, they contain a significant number of errors due to their primary reliance on machine-based annotation. To address these deficiencies, a comprehensive literature-based survey was conducted to identify certain selected genes that have demonstrated function in humans, mice or pigs. Results: The process identified 13,054 candidate human, bovine, mouse or rat genes/proteins used to select potential porcine homologs by searching multiple online sources of porcine gene information. The data in the Porcine Translational Research Database ((http://www.ars.usda.gov/Services/docs.htm?docid=6065) is supported by > 5800 references, and contains 65 data fields for each entry, including > 9700 full length (5' and 3') unambiguous pig sequences, > 2400 real time PCR assays and reactivity information on > 1700 antibodies. It also contains gene and/or protein expression data for > 2200 genes and identifies and corrects 8187 errors (gene duplications artifacts, mis-assemblies, mis-annotations, and incorrect species assignments) for 5337 porcine genes. Conclusions: This database is the largest manually curated database for any single veterinary species and is unique among porcine gene databases in regard to linking gene expression to gene function, identifying related gene pathways, and connecting data with other porcine gene databases. This database provides the first comprehensive description of three major Super-families or functionally related groups of proteins (Cluster of Differentiation (CD) Marker genes, Solute Carrier Superfamily, ATP binding Cassette Superfamily), and a comparative description of porcine microRNAs.
引用
收藏
页数:13
相关论文
共 59 条
  • [1] Evolutionary analysis of a cluster of ATP-binding cassette (ABC) genes
    Annilo, T
    Chen, ZQ
    Shulenin, S
    Dean, M
    [J]. MAMMALIAN GENOME, 2003, 14 (01) : 7 - 20
  • [2] Structured RNAs and synteny regions in the pig genome
    Anthon, Christian
    Tafer, Hakim
    Havgaard, Jakob H.
    Thomsen, Bo
    Hedegaard, Jakob
    Seemann, Stefan E.
    Pundhir, Sachin
    Kehr, Stephanie
    Bartschat, Sebastian
    Nielsen, Mathilde
    Nielsen, Rasmus O.
    Fredholm, Merete
    Stadler, Peter F.
    Gorodkin, Jan
    [J]. BMC GENOMICS, 2014, 15
  • [3] MicroRNA Buffering and Altered Variance of Gene Expression in Response to Salmonella Infection
    Bao, Hua
    Kommadath, Arun
    Plastow, Graham S.
    Tuggle, Christopher K.
    Guan, Le Luo
    Stothard, Paul
    [J]. PLOS ONE, 2014, 9 (04):
  • [4] From evolutionary genetics to human immunology: how selection shapes host defence genes
    Barreiro, Luis B.
    Quintana-Murci, Lluis
    [J]. NATURE REVIEWS GENETICS, 2010, 11 (01) : 17 - 30
  • [5] The uncoupling protein 1 gene (UCP1) is disrupted in the pig lineage:: A genetic explanation for poor thermoregulation in piglets
    Berg, Frida
    Gustafson, Ulla
    Andersson, Leif
    [J]. PLOS GENETICS, 2006, 2 (08): : 1178 - 1181
  • [6] InnateDB: systems biology of innate immunity and beyond-recent updates and continuing curation
    Breuer, Karin
    Foroushani, Amir K.
    Laird, Matthew R.
    Chen, Carol
    Sribnaia, Anastasia
    Lo, Raymond
    Winsor, Geoffrey L.
    Hancock, Robert E. W.
    Brinkman, Fiona S. L.
    Lynn, David J.
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) : D1228 - D1233
  • [7] Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse
    Church, Deanna M.
    Goodstadt, Leo
    Hillier, LaDeana W.
    Zody, Michael C.
    Goldstein, Steve
    She, Xinwe
    Bult, Carol J.
    Agarwala, Richa
    Cherry, Joshua L.
    DiCuccio, Michael
    Hlavina, Wratko
    Kapustin, Yuri
    Meric, Peter
    Maglott, Donna
    Birtle, Zoe
    Marques, Ana C.
    Graves, Tina
    Zhou, Shiguo
    Teague, Brian
    Potamousis, Konstantinos
    Churas, Christopher
    Place, Michael
    Herschleb, Jill
    Runnheim, Ron
    Forrest, Daniel
    Amos-Landgraf, James
    Schwartz, David C.
    Cheng, Ze
    Lindblad-Toh, Kerstin
    Eichler, Evan E.
    Ponting, Chris P.
    [J]. PLOS BIOLOGY, 2009, 7 (05):
  • [8] Nomenclature of CD molecules from the Tenth Human Leucocyte Differentiation Antigen Workshop
    Clark, Georgina
    Stockinger, Hannes
    Balderas, Robert
    van Zelm, Menno C.
    Zola, Heddy
    Hart, Derek
    Engel, Pablo
    [J]. CLINICAL & TRANSLATIONAL IMMUNOLOGY, 2016, 5
  • [9] Finishing the euchromatic sequence of the human genome
    Collins, FS
    Lander, ES
    Rogers, J
    Waterston, RH
    [J]. NATURE, 2004, 431 (7011) : 931 - 945
  • [10] Siglecs and their roles in the immune system
    Crocker, Paul R.
    Paulson, James C.
    Varki, Ajit
    [J]. NATURE REVIEWS IMMUNOLOGY, 2007, 7 (04) : 255 - 266