Assessment of kinship detection using RNA-seq data

被引:9
作者
Blay, Natalia [1 ,2 ,3 ]
Casas, Eduard [1 ,2 ,4 ]
Galvan-Femenia, Ivan [1 ,5 ]
Graffelman, Jan [6 ,7 ]
de Cid, Rafael [1 ,5 ]
Vavouri, Tanya [1 ,2 ]
机构
[1] Germans Trias & Pujol Res Inst PMPPC IGTP, Program Predict & Personalized Med Canc, Badalona 08916, Spain
[2] Univ Autonoma Barcelona, Josep Carreras Leukaemia Res Inst IJC, Campus ICO Germans Trias & Pujol, Badalona 08916, Spain
[3] Univ Oberta Catalunya, Masters Programme Bioinformat & Biostat, Barcelona 08035, Spain
[4] Univ Barcelona, Doctoral Programme Biomed, Barcelona 08007, Spain
[5] Germans Trias & Pujol Res Inst, Genomes Life GCAT Lab Grp, Can Ruti Campus,Cami Escoles S-N, Barcelona 08916, Spain
[6] Univ Politecn Cataluna, Dept Stat & Operat Res, Barcelona 08028, Spain
[7] Univ Washington, Dept Biostat, Seattle, WA 98105 USA
关键词
IDENTIFICATION; RELATEDNESS; INHERITANCE; LIKELIHOODS; EXPRESSION; VARIANTS; FORMAT; TOOL;
D O I
10.1093/nar/gkz776
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Analysis of RNA sequencing (RNA-seq) data from related individuals is widely used in clinical and molecular genetics studies. Prediction of kinship from RNA-seq data would be useful for confirming the expected relationships in family based studies and for highlighting samples from related individuals in case-control or population based studies. Currently, reconstruction of pedigrees is largely based on SNPs or microsatellites, obtained from genotyping arrays, whole genome sequencing and whole exome sequencing. Potential problems with using RNA-seq data for kinship detection are the low proportion of the genome that it covers, the highly skewed coverage of exons of different genes depending on expression level and allele-specific expression. In this study we assess the use of RNA-seq data to detect kinship between individuals, through pairwise identity by descent (IBD) estimates. First, we obtained high quality SNPs after successive filters to minimize the effects due to allelic imbalance as well as errors in sequencing, mapping and genotyping. Then, we used these SNPs to calculate pairwise IBD estimates. By analysing both real and simulated RNA-seq data we show that it is possible to identify up to second degree relationships using RNA-seq data of even low to moderate sequencing depth.
引用
收藏
页数:9
相关论文
共 50 条
[31]   Benchmarking RNA-seq differential expression analysis methods using spike-in and simulation data [J].
Baik, Bukyung ;
Yoon, Sora ;
Nam, Dougu .
PLOS ONE, 2020, 15 (04)
[32]   Using RNA-seq data to select reference genes for normalizing gene expression in apple roots [J].
Zhou, Zhe ;
Cong, Peihua ;
Tian, Yi ;
Zhu, Yanmin .
PLOS ONE, 2017, 12 (09)
[33]   Analysis of Single-Cell RNA-seq Data by Clustering Approaches [J].
Zhu, Xiaoshu ;
Li, Hong-Dong ;
Guo, Lilu ;
Wu, Fang-Xiang ;
Wang, Jianxin .
CURRENT BIOINFORMATICS, 2019, 14 (04) :314-322
[34]   WemIQ: an accurate and robust isoform quantification method for RNA-seq data [J].
Zhang, Jing ;
Kuo, C. -C. Jay ;
Chen, Liang .
BIOINFORMATICS, 2015, 31 (06) :878-885
[35]   NDRindex: a method for the quality assessment of single-cell RNA-Seq preprocessing data [J].
Xiao, Ruiyu ;
Lu, Guoshan ;
Guo, Wanqian ;
Jin, Shuilin .
BMC BIOINFORMATICS, 2020, 21 (Suppl 16)
[36]   Assessment of the Impact of Using a Reference Transcriptome in Mapping Short RNA-Seq Reads [J].
Zhao, Shanrong .
PLOS ONE, 2014, 9 (07)
[37]   A Review on The Processing and Analysis of Next-generation RNA-seq Data [J].
Wang Xi ;
Wang Xiao-Wo ;
Wang Li-Kun ;
Feng Zhi-Xing ;
Zhang Xue-Gong .
PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2010, 37 (08) :834-846
[38]   NDRindex: A method for the quality assessment of single-cell RNA-Seq preprocessing data [J].
Xiao, Ruiyu ;
Lu, Guoshan ;
Jin, Shuilin .
2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, :1792-1800
[39]   Transcript annotation of Chinese sturgeon (Acipenser sinensis) using Iso-seq and RNA-seq data [J].
Liao, Xiaolin ;
Zhang, Libin ;
Tian, Hua ;
Yang, Bo ;
Wang, Ezhou ;
Zhu, Bin .
SCIENTIFIC DATA, 2023, 10 (01)
[40]   Improving the Flexibility of RNA-Seq Data Analysis Pipelines [J].
Phan, John H. ;
Wu, Po-Yen ;
Wang, May D. .
2012 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS), 2012, :70-73