BioSeq-Analysis2.0: an updated platform for analyzing DNA, RNA and protein sequences at sequence level and residue level based on machine learning approaches

被引:279
作者
Liu, Bin [1 ,2 ]
Gao, Xin [3 ]
Zhang, Hanyu [3 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China
[2] Beijing Inst Technol, Adv Res Inst Multidisciplinary Sci, Beijing, Peoples R China
[3] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
AMINO-ACID-COMPOSITION; PREDICTION; DATABASE; AUTOCORRELATION; CLASSIFICATION; COLLOCATION; RECOGNITION; PROMOTERS; PROFILES; DESIGN;
D O I
10.1093/nar/gkz740
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As the first web server to analyze various biological sequences at sequence level based on machine learning approaches, many powerful predictors in the field of computational biology have been developed with the assistance of the BioSeq-Analysis. However, the BioSeq-Analysis can be only applied to the sequence-level analysis tasks, preventing its applications to the residue-level analysis tasks, and an intelligent tool that is able to automatically generate various predictors for biological sequence analysis at both residue level and sequence level is highly desired. In this regard, we decided to publish an important updated server covering a total of 26 features at the residue level and 90 features at the sequence level called BioSeq-Analysis2.0 (http://bliulab.net/BioSeq-Analysis2.0/), by which the users only need to upload the benchmark dataset, and the BioSeq-Analysis2.0 can generate the predictors for both residue-level analysis and sequence-level analysis tasks. Furthermore, the corresponding stand-alone tool was also provided, which can be downloaded from http://bliulab.net/BioSeq-Analysis2.0/download/. To the best of our knowledge, the BioSeq-Analysis2.0 is the first tool for generating predictors for biological sequence analysis tasks at residue level. Specifically, the experimental results indicated that the predictors developed by BioSeq-Analysis2.0 can achieve comparable or even better performance than the existing state-of-the-art predictors.
引用
收藏
页码:E127 / E127
页数:12
相关论文
共 77 条
[11]   Prediction of protein crystallization using collocation of amino acid pairs [J].
Chen, Ke ;
Kurgan, Lukasz ;
Rahbari, Mandana .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2007, 355 (03) :764-769
[12]   Prediction of Integral Membrane Protein Type by Collocated Hydrophobic Amino Acid Pair [J].
Chen, Ke ;
Jiang, Yingfu ;
Du, Li ;
Kurgan, Lukasz .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2009, 30 (01) :163-172
[13]   Identification and analysis of the N6-methyladenosine in the Saccharomyces cerevisiae transcriptome [J].
Chen, Wei ;
Tran, Hong ;
Liang, Zhiyong ;
Lin, Hao ;
Zhang, Liqing .
SCIENTIFIC REPORTS, 2015, 5
[14]   PseKNC-General: a cross-platform package for generating various modes of pseudo nucleotide compositions [J].
Chen, Wei ;
Zhang, Xitong ;
Brooker, Jordan ;
Lin, Hao ;
Zhang, Liqing ;
Chou, Kuo-Chen .
BIOINFORMATICS, 2015, 31 (01) :119-+
[15]   SUMOhydro: A Novel Method for the Prediction of Sumoylation Sites Based on Hydrophobic Properties [J].
Chen, Yong-Zi ;
Chen, Zhen ;
Gong, Yu-Ai ;
Ying, Guoguang .
PLOS ONE, 2012, 7 (06)
[16]   iLearn: an integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data [J].
Chen, Zhen ;
Zhao, Pei ;
Li, Fuyi ;
Marquez-Lago, Tatiana T. ;
Leier, Andre ;
Revote, Jerico ;
Zhu, Yan ;
Powell, David R. ;
Akutsu, Tatsuya ;
Webb, Geoffrey, I ;
Chou, Kuo-Chen ;
Smith, A. Ian ;
Daly, Roger J. ;
Li, Jian ;
Song, Jiangning .
BRIEFINGS IN BIOINFORMATICS, 2020, 21 (03) :1047-1057
[17]   Prediction of protein subcellular locations by incorporating quasi-sequence-order effect [J].
Chou, KC .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2000, 278 (02) :477-483
[18]   Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes [J].
Chou, KC .
BIOINFORMATICS, 2005, 21 (01) :10-19
[19]   Prediction and classification of protein subcellular location - Sequence-order effect and pseudo amino acid composition [J].
Chou, KC ;
Cai, YD .
JOURNAL OF CELLULAR BIOCHEMISTRY, 2003, 90 (06) :1250-1260
[20]   Prediction of protein cellular attributes using pseudo-amino acid composition [J].
Chou, KC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2001, 43 (03) :246-255