AGTAR: A novel approach for transcriptome assembly and abundance estimation using an adapted genetic algorithm from RNA-seq data

被引:1
作者
Li, Mingyue [1 ]
Bai, Miao [1 ]
Wu, Yulun [1 ]
Shao, Wenjun [1 ]
Zheng, Lihua [2 ]
Sun, Luguo [1 ]
Wang, Shuyue [1 ]
Yu, Chunlei [2 ]
Huang, Yanxin [1 ]
机构
[1] Northeast Normal Univ, Natl Engn Lab Druggable Gene & Prot Screening, Changchun 130024, Peoples R China
[2] Northeast Normal Univ, Minist Educ, Res Ctr Agr & Med Gene Engn, Changchun 130024, Peoples R China
关键词
RNA-seq; Adapted genetic algorithm; Transcript assembly; Abundance estimation; QUANTIFICATION;
D O I
10.1016/j.compbiomed.2021.104646
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Recently, the rapid development of RNA-seq technologies has accelerated transcriptomics research. The accurate identification and quantification of transcripts based on RNA-seq data will facilitate the exploration of various potential biological mechanisms. However, due to the limitations of the current data analysis tools and RNA-seq technologies, full and accurate reconstruction of the transcriptome still faces many challenges. Results: We developed the adapted genetic algorithm (AGTAR) program, which can reliably assemble transcriptomes and estimate abundance based on RNA-seq data with or without genome annotation files. We defined a new concept, isoform junction abundance, to help enhance the accuracy of isoform identification and quantification. Isoform abundance and isoform junction abundance are estimated by an adapted genetic algorithm. The crossover and mutation probabilities of the algorithm can be adaptively adjusted to effectively prevent premature convergence. Both simulated and real data indicated that AGTAR's comprehensive ability to assemble transcripts is significantly superior to that achievable by the currently widely used tools with similar functions. Conclusions: AGTAR is a tool for identifying and quantifying transcripts from RNA-seq data. It has the advantages of higher accuracy and ease of use. The AGTAR package is freely available at https://github.com/v4yuezi/AGTAR.git.
引用
收藏
页数:10
相关论文
共 32 条
  • [1] Computational approaches for isoform detection and estimation: good and bad news
    Angelini, Claudia
    De Canditiis, Daniela
    De Feis, Italia
    [J]. BMC BIOINFORMATICS, 2014, 15
  • [2] Characterization of the human ESC transcriptome by hybrid sequencing
    Au, Kin Fai
    Sebastiano, Vittorio
    Afshar, Pegah Tootoonchi
    Durruthy, Jens Durruthy
    Lee, Lawrence
    Williams, Brian A.
    van Bakel, Harm
    Schadt, Eric E.
    Reijo-Pera, Renee A.
    Underwood, Jason G.
    Wong, Wing Hung
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (50) : E4821 - E4830
  • [3] Characterizing and annotating the genome using RNA-seq data
    Chen, Geng
    Shi, Tieliu
    Shi, Leming
    [J]. SCIENCE CHINA-LIFE SCIENCES, 2017, 60 (02) : 116 - 125
  • [4] Emerging roles of non-coding RNAs in epigenetic regulation
    Chen, Juan
    Xue, Yuanchao
    [J]. SCIENCE CHINA-LIFE SCIENCES, 2016, 59 (03) : 227 - 235
  • [5] Landscape of transcription in human cells
    Djebali, Sarah
    Davis, Carrie A.
    Merkel, Angelika
    Dobin, Alex
    Lassmann, Timo
    Mortazavi, Ali
    Tanzer, Andrea
    Lagarde, Julien
    Lin, Wei
    Schlesinger, Felix
    Xue, Chenghai
    Marinov, Georgi K.
    Khatun, Jainab
    Williams, Brian A.
    Zaleski, Chris
    Rozowsky, Joel
    Roeder, Maik
    Kokocinski, Felix
    Abdelhamid, Rehab F.
    Alioto, Tyler
    Antoshechkin, Igor
    Baer, Michael T.
    Bar, Nadav S.
    Batut, Philippe
    Bell, Kimberly
    Bell, Ian
    Chakrabortty, Sudipto
    Chen, Xian
    Chrast, Jacqueline
    Curado, Joao
    Derrien, Thomas
    Drenkow, Jorg
    Dumais, Erica
    Dumais, Jacqueline
    Duttagupta, Radha
    Falconnet, Emilie
    Fastuca, Meagan
    Fejes-Toth, Kata
    Ferreira, Pedro
    Foissac, Sylvain
    Fullwood, Melissa J.
    Gao, Hui
    Gonzalez, David
    Gordon, Assaf
    Gunawardena, Harsha
    Howald, Cedric
    Jha, Sonali
    Johnson, Rory
    Kapranov, Philipp
    King, Brandon
    [J]. NATURE, 2012, 489 (7414) : 101 - 108
  • [6] STAR: ultrafast universal RNA-seq aligner
    Dobin, Alexander
    Davis, Carrie A.
    Schlesinger, Felix
    Drenkow, Jorg
    Zaleski, Chris
    Jha, Sonali
    Batut, Philippe
    Chaisson, Mark
    Gingeras, Thomas R.
    [J]. BIOINFORMATICS, 2013, 29 (01) : 15 - 21
  • [7] An integrated encyclopedia of DNA elements in the human genome
    Dunham, Ian
    Kundaje, Anshul
    Aldred, Shelley F.
    Collins, Patrick J.
    Davis, CarrieA.
    Doyle, Francis
    Epstein, Charles B.
    Frietze, Seth
    Harrow, Jennifer
    Kaul, Rajinder
    Khatun, Jainab
    Lajoie, Bryan R.
    Landt, Stephen G.
    Lee, Bum-Kyu
    Pauli, Florencia
    Rosenbloom, Kate R.
    Sabo, Peter
    Safi, Alexias
    Sanyal, Amartya
    Shoresh, Noam
    Simon, Jeremy M.
    Song, Lingyun
    Trinklein, Nathan D.
    Altshuler, Robert C.
    Birney, Ewan
    Brown, James B.
    Cheng, Chao
    Djebali, Sarah
    Dong, Xianjun
    Dunham, Ian
    Ernst, Jason
    Furey, Terrence S.
    Gerstein, Mark
    Giardine, Belinda
    Greven, Melissa
    Hardison, Ross C.
    Harris, Robert S.
    Herrero, Javier
    Hoffman, Michael M.
    Iyer, Sowmya
    Kellis, Manolis
    Khatun, Jainab
    Kheradpour, Pouya
    Kundaje, Anshul
    Lassmann, Timo
    Li, Qunhua
    Lin, Xinying
    Marinov, Georgi K.
    Merkel, Angelika
    Mortazavi, Ali
    [J]. NATURE, 2012, 489 (7414) : 57 - 74
  • [8] Garber M, 2011, NAT METHODS, V8, P469, DOI [10.1038/nmeth.1613, 10.1038/NMETH.1613]
  • [9] Graphics Processing Unit-Enhanced Genetic Algorithms for Solving the Temporal Dynamics of Gene Regulatory Networks
    Garcia-Calvo, Raul
    Guisado, J. L.
    Diaz-del-Rio, Fernando
    Cordoba, Antonio
    Jimenez-Morales, Francisco
    [J]. EVOLUTIONARY BIOINFORMATICS, 2018, 14
  • [10] Full-length transcriptome assembly from RNA-Seq data without a reference genome
    Grabherr, Manfred G.
    Haas, Brian J.
    Yassour, Moran
    Levin, Joshua Z.
    Thompson, Dawn A.
    Amit, Ido
    Adiconis, Xian
    Fan, Lin
    Raychowdhury, Raktima
    Zeng, Qiandong
    Chen, Zehua
    Mauceli, Evan
    Hacohen, Nir
    Gnirke, Andreas
    Rhind, Nicholas
    di Palma, Federica
    Birren, Bruce W.
    Nusbaum, Chad
    Lindblad-Toh, Kerstin
    Friedman, Nir
    Regev, Aviv
    [J]. NATURE BIOTECHNOLOGY, 2011, 29 (07) : 644 - U130