Automated identification of reference genes based on RNA-seq data

被引:22
|
作者
Carmona, Rosario [1 ]
Arroyo, Macarena [2 ]
Jose Jimenez-Quesada, Maria [1 ]
Seoane, Pedro [3 ]
Zafra, Adoracion [1 ]
Larrosa, Rafael [4 ]
de Dios Alche, Juan [1 ]
Gonzalo Claros, M. [3 ]
机构
[1] CSIC, Estn Expt Zaidin, Dept Biochem Cell & Mol Biol Plants, Plant Reprod Biol Lab, Granada, Spain
[2] Hosp Reg Univ Malaga, Serv Neumol, Avda Carlos Haya S-N, Malaga, Spain
[3] Univ Malaga, Dept Biol Mol & Bioquim, Malaga, Spain
[4] Univ Malaga, Dept Arquitectura Comp, Malaga, Spain
来源
关键词
Reference genes; Normalization; Real-time PCR; Quantitative PCR; Olive (Olea europaea L.); Cancer; QUANTITATIVE RT-PCR; GENOME-WIDE IDENTIFICATION; RELIABLE REFERENCE GENES; HOUSEKEEPING GENES; EXPRESSION ANALYSIS; PROSTATE-CANCER; INTERNAL CONTROL; OLIVE FRUIT; VALIDATION; SELECTION;
D O I
10.1186/s12938-017-0356-5
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Background: Gene expression analyses demand appropriate reference genes (RGs) for normalization, in order to obtain reliable assessments. Ideally, RG expression levels should remain constant in all cells, tissues or experimental conditions under study. Housekeeping genes traditionally fulfilled this requirement, but they have been reported to be less invariant than expected; therefore, RGs should be tested and validated for every particular situation. Microarray data have been used to propose new RGs, but only a limited set of model species and conditions are available; on the contrary, RNA-seq experiments are more and more frequent and constitute a new source of candidate RGs. Results: An automated workflow based on mapped NGS reads has been constructed to obtain highly and invariantly expressed RGs based on a normalized expression in reads per mapped million and the coefficient of variation. This workflow has been tested with Roche/454 reads from reproductive tissues of olive tree (Olea europaea L.), as well as with Illumina paired-end reads from two different accessions of Arabidopsis thaliana and three different human cancers (prostate, small-cell cancer lung and lung adenocarcinoma). Candidate RGs have been proposed for each species and many of them have been previously reported as RGs in literature. Experimental validation of significant RGs in olive tree is provided to support the algorithm. Conclusion: Regardless sequencing technology, number of replicates, and library sizes, when RNA-seq experiments are designed and performed, the same datasets can be analyzed with our workflow to extract suitable RGs for subsequent PCR validation. Moreover, different subset of experimental conditions can provide different suitable RGs.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Identification of key pathways and genes in carotid atherosclerosis through bioinformatics analysis of RNA-seq data
    Li, Zhongchen
    Hao, Jiheng
    Chen, Kun
    Jiang, Qunlong
    Wang, Peijian
    Xing, Xiaohui
    Wang, Jiyue
    Zhang, Yinjiang
    Xiao, Yilei
    Zhang, Liyong
    AGING-US, 2021, 13 (09): : 12733 - 12747
  • [32] Identification of Differentially Expressed Genes in RNA-seq Data of Arabidopsis thaliana: A Compound Distribution Approach
    Anjum, Arfa
    Jaggi, Seema
    Varghese, Eldho
    Lall, Shwetank
    Bhowmik, Arpan
    Rai, Anil
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2016, 23 (04) : 239 - 247
  • [33] Identification of prognostic genes in kidney renal clear cell carcinoma by RNA-seq data analysis
    Gu, Yanqin
    Lu, Linfeng
    Wu, Lingfeng
    Chen, Hao
    Zhu, Wei
    He, Yi
    MOLECULAR MEDICINE REPORTS, 2017, 15 (04) : 1661 - 1667
  • [34] Identification and Validation of Reference Genes in Clostridium beijerinckii NRRL B-598 for RT-qPCR Using RNA-Seq Data
    Jureckova, Katerina
    Raschmanova, Hana
    Kolek, Jan
    Vasylkivska, Maryna
    Branska, Barbora
    Patakova, Petra
    Provaznik, Ivo
    Sedlar, Karel
    FRONTIERS IN MICROBIOLOGY, 2021, 12
  • [35] Reliable Identification of Genomic Variants from RNA-Seq Data
    Piskol, Robert
    Ramaswami, Gokul
    Li, Jin Billy
    AMERICAN JOURNAL OF HUMAN GENETICS, 2013, 93 (04) : 641 - 651
  • [36] Exploring Splicing Variants and Novel Genes in Sacred Lotus Based on RNA-seq Data
    Zhang, Xinyi
    Yu, Zimeng
    Yang, Pingfang
    PHYTON-INTERNATIONAL JOURNAL OF EXPERIMENTAL BOTANY, 2023, 92 (06) : 1665 - 1679
  • [37] Identification of Differentially Expressed Genes in Pelvic Organ Prolapse by RNA-Seq
    Xie, Ruoyun
    Xu, Ying
    Fan, Shuixiu
    Song, Yanfeng
    MEDICAL SCIENCE MONITOR, 2016, 22 : 4218 - 4225
  • [38] Identification of differentially expressed genes in the development of osteosarcoma using RNA-seq
    Yang, Yihao
    Zhang, Ya
    Qu, Xin
    Xia, Junfeng
    Li, Dongqi
    Li, Xiaojuan
    Wang, Yu
    He, Zewei
    Li, Su
    Zhou, Yonghong
    Xie, Lin
    Yang, Zuozhang
    ONCOTARGET, 2016, 7 (52) : 87194 - 87205
  • [39] Identification of nuclear genes controlling chlorophyll synthesis in barley by RNA-seq
    Shmakov, Nickolay A.
    Vasiliev, Gennadiy V.
    Shatskaya, Natalya V.
    Doroshkov, Alexey V.
    Gordeeva, Elena I.
    Afonnikov, Dmitry A.
    Khlestkina, Elena K.
    BMC PLANT BIOLOGY, 2016, 16
  • [40] Identification of nuclear genes controlling chlorophyll synthesis in barley by RNA-seq
    Nickolay A. Shmakov
    Gennadiy V. Vasiliev
    Natalya V. Shatskaya
    Alexey V. Doroshkov
    Elena I. Gordeeva
    Dmitry A. Afonnikov
    Elena K. Khlestkina
    BMC Plant Biology, 16