Automated identification of reference genes based on RNA-seq data

被引:22
|
作者
Carmona, Rosario [1 ]
Arroyo, Macarena [2 ]
Jose Jimenez-Quesada, Maria [1 ]
Seoane, Pedro [3 ]
Zafra, Adoracion [1 ]
Larrosa, Rafael [4 ]
de Dios Alche, Juan [1 ]
Gonzalo Claros, M. [3 ]
机构
[1] CSIC, Estn Expt Zaidin, Dept Biochem Cell & Mol Biol Plants, Plant Reprod Biol Lab, Granada, Spain
[2] Hosp Reg Univ Malaga, Serv Neumol, Avda Carlos Haya S-N, Malaga, Spain
[3] Univ Malaga, Dept Biol Mol & Bioquim, Malaga, Spain
[4] Univ Malaga, Dept Arquitectura Comp, Malaga, Spain
来源
关键词
Reference genes; Normalization; Real-time PCR; Quantitative PCR; Olive (Olea europaea L.); Cancer; QUANTITATIVE RT-PCR; GENOME-WIDE IDENTIFICATION; RELIABLE REFERENCE GENES; HOUSEKEEPING GENES; EXPRESSION ANALYSIS; PROSTATE-CANCER; INTERNAL CONTROL; OLIVE FRUIT; VALIDATION; SELECTION;
D O I
10.1186/s12938-017-0356-5
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Background: Gene expression analyses demand appropriate reference genes (RGs) for normalization, in order to obtain reliable assessments. Ideally, RG expression levels should remain constant in all cells, tissues or experimental conditions under study. Housekeeping genes traditionally fulfilled this requirement, but they have been reported to be less invariant than expected; therefore, RGs should be tested and validated for every particular situation. Microarray data have been used to propose new RGs, but only a limited set of model species and conditions are available; on the contrary, RNA-seq experiments are more and more frequent and constitute a new source of candidate RGs. Results: An automated workflow based on mapped NGS reads has been constructed to obtain highly and invariantly expressed RGs based on a normalized expression in reads per mapped million and the coefficient of variation. This workflow has been tested with Roche/454 reads from reproductive tissues of olive tree (Olea europaea L.), as well as with Illumina paired-end reads from two different accessions of Arabidopsis thaliana and three different human cancers (prostate, small-cell cancer lung and lung adenocarcinoma). Candidate RGs have been proposed for each species and many of them have been previously reported as RGs in literature. Experimental validation of significant RGs in olive tree is provided to support the algorithm. Conclusion: Regardless sequencing technology, number of replicates, and library sizes, when RNA-seq experiments are designed and performed, the same datasets can be analyzed with our workflow to extract suitable RGs for subsequent PCR validation. Moreover, different subset of experimental conditions can provide different suitable RGs.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] A Clustering Approach to Identify Candidates to Housekeeping Genes Based on RNA-seq Data
    Franco, Edian F.
    Maues, Dener
    Alves, Ronnie
    Guimaraes, Luis
    Azevedo, Vasco
    Silva, Artur
    Ghosh, Preetam
    Morais, Jefferson
    Ramos, Rommel T. J.
    ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, BSB 2019, 2020, 11347 : 83 - 95
  • [22] RNA-Seq is not required to determine stable reference genes for qPCR normalization
    Sampathkumar, Nirmal Kumar
    SundaramID, Venkat Krishnan
    Danthi, Prakroothi S.
    Barakat, Rasha
    Solomon, Shiden
    Mondal, Mrityunjoy
    Carre, Ivo
    El Jalkh, Tatiana
    Padilla-Ferrer, Aida
    Grenier, Julien
    Massaad, Charbel
    Mitchell, Jacqueline C.
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (02)
  • [23] SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis
    Benjamin K. Johnson
    Matthew B. Scholz
    Tracy K. Teal
    Robert B. Abramovitch
    BMC Bioinformatics, 17
  • [24] SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis
    Johnson, Benjamin K.
    Scholz, Matthew B.
    Teal, Tracy K.
    Abramovitch, Robert B.
    BMC BIOINFORMATICS, 2016, 17
  • [25] Systematic identification and validation of the reference genes from 60 RNA-Seq libraries in the scallop Mizuhopecten yessoensis
    Yajuan Li
    Lingling Zhang
    Ruojiao Li
    Meiwei Zhang
    Yangping Li
    Hao Wang
    Shi Wang
    Zhenmin Bao
    BMC Genomics, 20
  • [26] Systematic identification and validation of the reference genes from 60 RNA-Seq libraries in the scallop Mizuhopecten yessoensis
    Li, Yajuan
    Zhang, Lingling
    Li, Ruojiao
    Zhang, Meiwei
    Li, Yangping
    Wang, Hao
    Wang, Shi
    Bao, Zhenmin
    BMC GENOMICS, 2019, 20 (1)
  • [27] Clinker: visualizing fusion genes detected in RNA-seq data
    Schmidt, Breon M.
    Davidson, Nadia M.
    Hawkins, Anthony D. K.
    Bartolo, Ray
    Majewski, Ian J.
    Ekert, Paul G.
    Oshlack, Alicia
    GIGASCIENCE, 2018, 7 (07):
  • [28] Identification of hub genes and regulatory factors of glioblastoma multiforme subgroups by RNA-seq data analysis
    Li, Yanan
    Min, Weijie
    Li, Mengmeng
    Han, Guosheng
    Dai, Dongwei
    Zhang, Lei
    Chen, Xin
    Wang, Xinglai
    Zhang, Yuhui
    Yue, Zhijian
    Liu, Jianmin
    INTERNATIONAL JOURNAL OF MOLECULAR MEDICINE, 2016, 38 (04) : 1170 - 1178
  • [29] Identification Of Lung Specific Genes By Meta-Analysis Of Multiple Tissue Rna-Seq Data
    Xiong, M.
    Heruth, D.
    Zhang, L. Q.
    Ye, S. Q.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2016, 193
  • [30] A probabilistic approach for automated discovery of perturbed genes using expression data from microarray or RNA-Seq
    Sundaramurthy, Gopinath
    Eghbalnia, Hamid R.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2015, 67 : 29 - 40