Sequencing, Analysis, and Annotation of Expressed Sequence Tags for Camelus dromedarius

被引:38
作者
Al-Swailem, Abdulaziz M. [1 ]
Shehata, Maher M. [1 ]
Abu-Duhier, Faisel M. [1 ]
Al-Yamani, Essam J. [1 ]
Al-Busadah, Khalid A. [2 ]
Al-Arawi, Mohammed S. [1 ]
Al-Khider, Ali Y. [1 ]
Al-Muhaimeed, Abdullah N. [1 ]
Al-Qahtani, Fahad H. [1 ]
Manee, Manee M. [1 ]
Al-Shomrani, Badr M. [1 ]
Al-Qhtani, Saad M. [1 ]
Al-Harthi, Amer S. [1 ]
Akdemir, Kadir C. [3 ]
Inan, Mehmet S. [1 ]
Otu, Hasan H. [1 ,3 ]
机构
[1] King Abdulaziz City Sci & Technol, Biotechnol Res Ctr, Nat Resources & Environm Res Inst, Riyadh, Saudi Arabia
[2] King Faisal Univ, Fac Vet Med & Anim Resources, Al Hasa, Saudi Arabia
[3] Harvard Univ, Sch Med, Dept Med, BIDMC Genom Ctr, Boston, MA USA
关键词
IMMUNOGLOBULIN HEAVY-CHAIN; FATTY-ACID COMPOSITION; CONSTANT-REGION GENES; ALPHA-GENES; STRESS; ORGANIZATION; P27(KIP1); ALIGNMENT; GLUCOSE; BLOOD;
D O I
10.1371/journal.pone.0010720
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Despite its economical, cultural, and biological importance, there has not been a large scale sequencing project to date for Camelus dromedarius. With the goal of sequencing complete DNA of the organism, we first established and sequenced camel EST libraries, generating 70,272 reads. Following trimming, chimera check, repeat masking, cluster and assembly, we obtained 23,602 putative gene sequences, out of which over 4,500 potentially novel or fast evolving gene sequences do not carry any homology to other available genomes. Functional annotation of sequences with similarities in nucleotide and protein databases has been obtained using Gene Ontology classification. Comparison to available full length cDNA sequences and Open Reading Frame (ORF) analysis of camel sequences that exhibit homology to known genes show more than 80% of the contigs with an ORF>300 bp and similar to 40% hits extending to the start codons of full length cDNAs suggesting successful characterization of camel genes. Similarity analyses are done separately for different organisms including human, mouse, bovine, and rat. Accompanying web portal, CAGBASE (http://camel.kacst.edu.sa/), hosts a relational database containing annotated EST sequences and analysis tools with possibility to add sequences from public domain. We anticipate our results to provide a home base for genomic studies of camel and other comparative studies enabling a starting point for whole genome sequencing of the organism.
引用
收藏
页数:18
相关论文
共 50 条
[31]   Expression analysis of psychological stress-associated genes in peripheral blood leukocytes [J].
Morita, K ;
Saito, T ;
Ohta, M ;
Ohmori, T ;
Kawai, K ;
Teshima-Kondo, S ;
Rokutan, K .
NEUROSCIENCE LETTERS, 2005, 381 (1-2) :57-62
[32]  
Muyldermans S, 2001, J Biotechnol, V74, P277
[33]   Transcriptional regulation of the antioxidant response element - Activation by Nrf2 and repression by MafK [J].
Nguyen, T ;
Huang, HC ;
Pickett, CB .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2000, 275 (20) :15466-15473
[34]  
Parkinson John, 2009, V533, P1, DOI 10.1007/978-1-60327-136-3_1
[35]   TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets [J].
Pertea, G ;
Huang, XQ ;
Liang, F ;
Antonescu, V ;
Sultana, R ;
Karamycheva, S ;
Lee, Y ;
White, J ;
Cheung, F ;
Parvizi, B ;
Tsai, J ;
Quackenbush, J .
BIOINFORMATICS, 2003, 19 (05) :651-652
[36]   MEMBERS OF THE ZINC FINGER PROTEIN GENE FAMILY SHARING A CONSERVED N-TERMINAL MODULE [J].
ROSATI, M ;
MARINO, M ;
FRANZE, A ;
TRAMONTANO, A ;
GRIMALDI, G .
NUCLEIC ACIDS RESEARCH, 1991, 19 (20) :5661-5667
[37]   Oxidative stress in blood of camels (Camelus dromedaries) naturally infected with Trypanosoma evansi [J].
Saleh, Mostafa A. ;
Al-Salahy, M. Bassam ;
Sanousi, Samera A. .
VETERINARY PARASITOLOGY, 2009, 162 (3-4) :192-199
[38]  
Schmidt-Nielsen K., 1979, Desert animals: Physiological problems of heat and water
[39]   ORGANIZATION OF THE CONSTANT-REGION GENE FAMILY OF THE MOUSE IMMUNOGLOBULIN HEAVY-CHAIN [J].
SHIMIZU, A ;
TAKAHASHI, N ;
YAOITA, Y ;
HONJO, T .
CELL, 1982, 28 (03) :499-506
[40]  
SHIRAZIBEECHY SP, 1994, RUM PHYS DIG MET GRO