An EST-based analysis identifies new genes and reveals distinctive gene expression features of Coffea arabica and Coffea canephora

被引:61
作者
Mondego, Jorge M. C. [2 ]
Vidal, Ramon O. [1 ,3 ]
Carazzolle, Marcelo F. [1 ,4 ]
Tokuda, Eric K. [1 ]
Parizzi, Lucas P. [1 ]
Costa, Gustavo G. L. [1 ]
Pereira, Luiz F. P. [5 ]
Andrade, Alan C. [6 ]
Colombo, Carlos A. [2 ]
Vieira, Luiz G. E. [7 ]
Pereira, Goncalo A. G. [1 ]
机构
[1] Univ Estadual Campinas, Lab Genom & Expressao, Dept Genet Evolucao & Bioagentes, Inst Biol, BR-13083970 Campinas, SP, Brazil
[2] Inst Agron Estado Sao Paulo, Ctr Recursos Genet Vegetais, BR-13001970 Campinas, SP, Brazil
[3] Lab Nacl Biociencias LNBio, BR-13083970 Campinas, SP, Brazil
[4] Univ Estadual Campinas, Ctr Nacl Processamento Alto Desempenho Sao Paulo, BR-13083970 Campinas, SP, Brazil
[5] Embrapa Cafe Inst Agron Parana, Lab Biotecnol Vegetal, BR-86001970 Londrina, PR, Brazil
[6] Embrapa Recursos Genet & Biotecnol, Nucleo Biotecnol NTBio, BR-70770900 Brasilia, DF, Brazil
[7] Inst Agron Parana, Lab Biotecnol Vegetal, BR-86001970 Londrina, PR, Brazil
来源
BMC PLANT BIOLOGY | 2011年 / 11卷
基金
巴西圣保罗研究基金会;
关键词
CELLULOSE SYNTHASE-LIKE; NBS-LRR PROTEINS; MOLECULAR CHARACTERIZATION; SOMATIC EMBRYOGENESIS; FUNCTIONAL-ANALYSIS; GENOMIC ANALYSIS; PEPTIDE-HORMONE; SEQUENCE TAGS; PLANT; L;
D O I
10.1186/1471-2229-11-30
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Background: Coffee is one of the world's most important crops; it is consumed worldwide and plays a significant role in the economy of producing countries. Coffea arabica and C. canephora are responsible for 70 and 30% of commercial production, respectively. C. arabica is an allotetraploid from a recent hybridization of the diploid species, C. canephora and C. eugenioides. C. arabica has lower genetic diversity and results in a higher quality beverage than C. canephora. Research initiatives have been launched to produce genomic and transcriptomic data about Coffea spp. as a strategy to improve breeding efficiency. Results: Assembling the expressed sequence tags (ESTs) of C. arabica and C. canephora produced by the Brazilian Coffee Genome Project and the Nestle-Cornell Consortium revealed 32,007 clusters of C. arabica and 16,665 clusters of C. canephora. We detected different GC3 profiles between these species that are related to their genome structure and mating system. BLAST analysis revealed similarities between coffee and grape (Vitis vinifera) genes. Using KA/KS analysis, we identified coffee genes under purifying and positive selection. Protein domain and gene ontology analyses suggested differences between Coffea spp. data, mainly in relation to complex sugar synthases and nucleotide binding proteins. OrthoMCL was used to identify specific and prevalent coffee protein families when compared to five other plant species. Among the interesting families annotated are new cystatins, glycine-rich proteins and RALF-like peptides. Hierarchical clustering was used to independently group C. arabica and C. canephora expression clusters according to expression data extracted from EST libraries, resulting in the identification of differentially expressed genes. Based on these results, we emphasize gene annotation and discuss plant defenses, abiotic stress and cup quality-related functional categories. Conclusion: We present the first comprehensive genome-wide transcript profile study of C. arabica and C. canephora, which can be freely assessed by the scientific community at http://www.lge.ibi.unicamp.br/ coffea. Our data reveal the presence of species-specific/prevalent genes in coffee that may help to explain particular characteristics of these two crops. The identification of differentially expressed transcripts offers a starting point for the correlation between gene expression profiles and Coffea spp. developmental traits, providing valuable insights for coffee breeding and biotechnology, especially concerning sugar metabolism and stress tolerance.
引用
收藏
页数:22
相关论文
共 136 条
  • [51] Efficient digestion and structural characteristics of cell walls of coffee beans
    Kasai, Naoya
    Konishi, Ayako
    Iwai, Kazuya
    Maeda, Gou
    [J]. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2006, 54 (17) : 6336 - 6342
  • [52] Phylogenomic analysis of the cystatin superfamily in eukaryotes and prokaryotes
    Kordis, Dusan
    Turk, Vito
    [J]. BMC EVOLUTIONARY BIOLOGY, 2009, 9
  • [53] KOSHINO LL, 2008, J AGR FOOD CHEM
  • [54] AutoFACT: An (Auto)under-barmatic (F)under-barunctional (A)under-barnnotation and (C)under-barlassification (T)under-barool
    Koski, LB
    Gray, MW
    Lang, BF
    Burger, G
    [J]. BMC BIOINFORMATICS, 2005, 6 (1)
  • [55] Characterization of chitinases able to rescue somatic embryos of the temperature-sensitive carrot variant ts11
    Kragh, KM
    Hendriks, T
    deJong, AJ
    LoSchiavo, F
    Bucherna, N
    Hojrup, P
    Mikkelsen, JD
    deVries, SC
    [J]. PLANT MOLECULAR BIOLOGY, 1996, 31 (03) : 631 - 645
  • [56] Caffeine, trigonelline, chlorogenic acids and sucrose diversity in wild Coffea arabica L. and C-canephora P. accessions
    Ky, CL
    Louarn, J
    Dussert, S
    Guyot, B
    Hamon, S
    Noirot, M
    [J]. FOOD CHEMISTRY, 2001, 75 (02) : 223 - 230
  • [57] Molecular characterisation and origin of the Coffea arabica L. genome
    Lashermes, P
    Combes, MC
    Robert, J
    Trouslot, P
    D'Hont, A
    Anthony, F
    Charrier, A
    [J]. MOLECULAR AND GENERAL GENETICS, 1999, 261 (02): : 259 - 266
  • [58] Automated SNP detection in expressed sequence tags:: statistical considerations and application to maritime pine sequences
    Le Dantec, L
    Chagné, D
    Pot, D
    Cantin, O
    Garnier-Géré, P
    Bedon, F
    Frigerio, JM
    Chaumeil, P
    Léger, P
    Garcia, V
    Laigret, F
    de Daruvar, A
    Plomion, C
    [J]. PLANT MOLECULAR BIOLOGY, 2004, 54 (03) : 461 - 470
  • [59] LEHLE L, 1973, EUR J BIOCH
  • [60] LELOUP V, 1995, P INT C ASIC, V16, P192