CPC2: a fast and accurate coding potential calculator based on sequence intrinsic features

被引:1060
作者
Kang, Yu-Jian [1 ]
Yang, De-Chang [1 ]
Kong, Lei [1 ]
Hou, Mei [1 ]
Meng, Yu-Qi [1 ]
Wei, Liping [1 ]
Gao, Ge [1 ]
机构
[1] Peking Univ, Ctr Bioinformat, Sch Life Sci, State Key Lab Prot & Plant Gene Res, Beijing 100871, Peoples R China
关键词
LONG NONCODING RNAS; DATABASE; TOOL; GENERATION; V2.0;
D O I
10.1093/nar/gkx428
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
With advances in next-generation sequencing technologies, numerous novel transcripts in a large number of organisms have been identified. With the goal of fast, accurate assessment of the coding ability of RNA transcripts, we upgraded the coding potential calculator CPC1 to CPC2. CPC2 runs similar to 1000 times faster than CPC1 and exhibits superior accuracy compared with CPC1, especially for long non-coding transcripts. Moreover, the model of CPC2 is species-neutral, making it feasible for ever-growing non-model organism transcriptomes. A mobile-friendly web server, as well as a downloadable standalone package, is freely available at http://cpc2.cbi.pku.edu.cn.
引用
收藏
页码:W12 / W16
页数:5
相关论文
共 33 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   microRNAs: Tiny regulators with great potential [J].
Ambros, V .
CELL, 2001, 107 (07) :823-826
[3]   Screening non-coding RNAs in transcriptomes from neglected species using PORTRAIT: case study of the pathogenic fungus Paracoccidioides brasiliensis Software [J].
Arrial, Roberto T. ;
Togawa, Roberto C. ;
Brigido, Marcelo de M. .
BMC BIOINFORMATICS, 2009, 10
[4]  
Boutet E, 2016, METHODS MOL BIOL, V1374, P23, DOI 10.1007/978-1-4939-3167-5_2
[5]   An Epigenetic Role for Maternally Inherited piRNAs in Transposon Silencing [J].
Brennecke, Julius ;
Malone, Colin D. ;
Aravin, Alexei A. ;
Sachidanandam, Ravi ;
Stark, Alexander ;
Hannon, Gregory J. .
SCIENCE, 2008, 322 (5906) :1387-1392
[6]   Reference-free transcriptome assembly in non-model animals from next-generation sequencing data [J].
Cahais, V. ;
Gayral, P. ;
Tsagkogeorga, G. ;
Melo-Ferreira, J. ;
Ballenghien, M. ;
Weinert, L. ;
Chiari, Y. ;
Belkhir, K. ;
Ranwez, V. ;
Galtier, N. .
MOLECULAR ECOLOGY RESOURCES, 2012, 12 (05) :834-845
[7]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]   The Ribosomal Database Project (RDP-II): previewing a new autoaligner that allows regular updates and the new prokaryotic taxonomy [J].
Cole, JR ;
Chai, B ;
Marsh, TL ;
Farris, RJ ;
Wang, Q ;
Kulam, SA ;
Chandra, S ;
McGarrell, DM ;
Schmidt, TM ;
Garrity, GM ;
Tiedje, JM .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :442-443
[9]   Non-coding RNA genes and the modern RNA world [J].
Eddy, SR .
NATURE REVIEWS GENETICS, 2001, 2 (12) :919-929
[10]   Determinants of genetic diversity [J].
Ellegren, Hans ;
Galtier, Nicolas .
NATURE REVIEWS GENETICS, 2016, 17 (07) :422-433