共 50 条
Detecting Fusion Genes in Long-Read Transcriptome Sequencing Data with FUGAREC
被引:2
|作者:
Masuda K.
[1
]
Sota Y.
[2
]
Matsuda H.
[1
]
机构:
[1] Graduate School of Information Science and Technology, Osaka University, Osaka, Suita
[2] Graduate School of Medicine, Osaka University, Osaka, Suita
基金:
日本学术振兴会;
关键词:
fusion gene;
long-read sequencing;
RNA sequencing;
D O I:
10.2197/ipsjtbio.17.1
中图分类号:
学科分类号:
摘要:
Fusion genes are important targets and biomarkers for cancer therapy. Methods of accurately detecting fusion genes are needed in clinical practice. RNA-seq is widely used to detect active fusion genes. Long-read RNA-seq can sequence the full length of mRNA, and long-read RNA-seq is expected to detect fusion genes that cannot be detected by short-read RNA-seq. However, long-read RNA-seq has high basecalling error rates, and gap sequences may occur near the breakpoints of long reads that are not aligned to the genome. When gap sequences occur, it is impossible to identify the correct fusion gene or breakpoint using existing methods. To address these challenges in fusion gene detection, we introduce a novel algorithm, FUGAREC (fusion detection with gap re-alignment and breakpoint clustering). FUGAREC uniquely combines gap sequence re-alignment with breakpoint clustering. This approach not only enhances the detection of previously undetectable fusion genes but also significantly reduces false positives. We demonstrate that FUGAREC has high fusion gene detection performance on both simulated data and sequenced data of a breast cancer cell line. © 2024 Information Processing Society of Japan.
引用
收藏
页码:1 / 9
页数:8
相关论文