Automated identification of reference genes based on RNA-seq data

被引:22
|
作者
Carmona, Rosario [1 ]
Arroyo, Macarena [2 ]
Jose Jimenez-Quesada, Maria [1 ]
Seoane, Pedro [3 ]
Zafra, Adoracion [1 ]
Larrosa, Rafael [4 ]
de Dios Alche, Juan [1 ]
Gonzalo Claros, M. [3 ]
机构
[1] CSIC, Estn Expt Zaidin, Dept Biochem Cell & Mol Biol Plants, Plant Reprod Biol Lab, Granada, Spain
[2] Hosp Reg Univ Malaga, Serv Neumol, Avda Carlos Haya S-N, Malaga, Spain
[3] Univ Malaga, Dept Biol Mol & Bioquim, Malaga, Spain
[4] Univ Malaga, Dept Arquitectura Comp, Malaga, Spain
来源
关键词
Reference genes; Normalization; Real-time PCR; Quantitative PCR; Olive (Olea europaea L.); Cancer; QUANTITATIVE RT-PCR; GENOME-WIDE IDENTIFICATION; RELIABLE REFERENCE GENES; HOUSEKEEPING GENES; EXPRESSION ANALYSIS; PROSTATE-CANCER; INTERNAL CONTROL; OLIVE FRUIT; VALIDATION; SELECTION;
D O I
10.1186/s12938-017-0356-5
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Background: Gene expression analyses demand appropriate reference genes (RGs) for normalization, in order to obtain reliable assessments. Ideally, RG expression levels should remain constant in all cells, tissues or experimental conditions under study. Housekeeping genes traditionally fulfilled this requirement, but they have been reported to be less invariant than expected; therefore, RGs should be tested and validated for every particular situation. Microarray data have been used to propose new RGs, but only a limited set of model species and conditions are available; on the contrary, RNA-seq experiments are more and more frequent and constitute a new source of candidate RGs. Results: An automated workflow based on mapped NGS reads has been constructed to obtain highly and invariantly expressed RGs based on a normalized expression in reads per mapped million and the coefficient of variation. This workflow has been tested with Roche/454 reads from reproductive tissues of olive tree (Olea europaea L.), as well as with Illumina paired-end reads from two different accessions of Arabidopsis thaliana and three different human cancers (prostate, small-cell cancer lung and lung adenocarcinoma). Candidate RGs have been proposed for each species and many of them have been previously reported as RGs in literature. Experimental validation of significant RGs in olive tree is provided to support the algorithm. Conclusion: Regardless sequencing technology, number of replicates, and library sizes, when RNA-seq experiments are designed and performed, the same datasets can be analyzed with our workflow to extract suitable RGs for subsequent PCR validation. Moreover, different subset of experimental conditions can provide different suitable RGs.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Automated identification of reference genes based on RNA-seq data
    Rosario Carmona
    Macarena Arroyo
    María José Jiménez-Quesada
    Pedro Seoane
    Adoración Zafra
    Rafael Larrosa
    Juan de Dios Alché
    M. Gonzalo Claros
    BioMedical Engineering OnLine, 16
  • [2] Identification of reference genes in lung cancer from RNA-seq data
    Varela, Macarena Arroyo
    Moreno, Rocio Bautista
    Munoz, Rosario Carmona
    Jimenez, Rafael Larrosa
    Rios, Jose Luis De la Cruz
    Cobo, Manuel
    Claros, M. G.
    EUROPEAN RESPIRATORY JOURNAL, 2017, 50
  • [3] Identification of potential genes for human ischemic cardiomyopathy based on RNA-Seq data
    Li, Wan
    Li, Liansheng
    Zhang, Shiying
    Zhang, Ce
    Huang, Hao
    Li, Yiran
    Hu, Erqiang
    Deng, Gui
    Guo, Shanshan
    Wang, Yahui
    Li, Weimin
    Chen, Lina
    ONCOTARGET, 2016, 7 (50) : 82063 - 82073
  • [4] Robust identification of differentially expressed genes from RNA-seq data
    Shahjaman, Md
    Mollah, Md Manir Hossain
    Rahman, Md Rezanur
    Islam, S. M. Shahinul
    Mollah, Md Nurul Haque
    GENOMICS, 2020, 112 (02) : 2000 - 2010
  • [5] Systematic Selection of Reference Genes for the Normalization of Circulating RNA Transcripts in Pregnant Women Based on RNA-Seq Data
    Chim, Stephen S. C.
    Wong, Karen K. W.
    Chung, Claire Y. L.
    Lam, Stephanie K. W.
    Kwok, Jamie S. L.
    Lai, Chit-Ying
    Cheng, Yvonne K. Y.
    Hui, Annie S. Y.
    Meng, Meng
    Chan, Oi-Ka
    Tsui, Stephen K. W.
    Lee, Keun-Young
    Chan, Ting-Fung
    Leung, Tak-Yeung
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2017, 18 (08)
  • [6] Identification of CNAs from RNA-Seq data
    Iwamoto, Eisuke
    Sanada, Masashi
    Yasuda, Takahiko
    CANCER SCIENCE, 2022, 113 : 1446 - 1446
  • [7] Fully automated pipeline for detection of sex linked genes using RNA-Seq data
    Michalovova, Monika
    Kubat, Zdenek
    Hobza, Roman
    Vyskot, Boris
    Kejnovsky, Eduard
    BMC BIOINFORMATICS, 2015, 16
  • [8] Fully automated pipeline for detection of sex linked genes using RNA-Seq data
    Monika Michalovova
    Zdenek Kubat
    Roman Hobza
    Boris Vyskot
    Eduard Kejnovsky
    BMC Bioinformatics, 16
  • [9] Semblans: automated assembly and processing of RNA-seq data
    Woodcock-Girard, Miles D.
    Bretz, Eric C.
    Robertson, Holly M.
    Ramanauskas, Karolis
    Hampton-Marcell, Jarrad T.
    Walker, Joseph F.
    BIOINFORMATICS, 2025, 41 (01)
  • [10] Identification of azadirachtin responsive genes in Spodoptera frugiperda larvae based on RNA-seq
    Shu, Benshui
    Yu, Haikuo
    Li, Yuning
    Zhong, Hongxin
    Li, Xiangli
    Cao, Liang
    Lin, Jintian
    PESTICIDE BIOCHEMISTRY AND PHYSIOLOGY, 2021, 172