PseAAC-General: Fast Building Various Modes of General Form of Chou's Pseudo-Amino Acid Composition for Large-Scale Protein Datasets

被引:243
作者
Du, Pufeng [1 ,2 ,3 ]
Gu, Shuwang [1 ,2 ]
Jiao, Yasen [1 ,2 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 300072, Peoples R China
[2] Tianjin Univ, Tianjin Key Lab Cognit Comp & Applicat, Tianjin 300072, Peoples R China
[3] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
来源
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES | 2014年 / 15卷 / 03期
基金
美国国家科学基金会;
关键词
general form; large-scale datasets; pseudo-amino acid composition; PREDICTING SUBCELLULAR-LOCALIZATION; COUPLED RECEPTOR CLASSES; OUTER-MEMBRANE PROTEINS; SUPPORT VECTOR MACHINE; STRUCTURAL CLASS; WEB-SERVER; PHYSICOCHEMICAL PROPERTIES; SUBMITOCHONDRIA LOCATIONS; EVOLUTIONARY INFORMATION; APPROXIMATE ENTROPY;
D O I
10.3390/ijms15033495
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The general form pseudo-amino acid composition (PseAAC) has been widely used to represent protein sequences in predicting protein structural and functional attributes. We developed the program PseAAC-General to generate various different modes of Chou's general PseAAC, such as the gene ontology mode, the functional domain mode, and the sequential evolution mode. This program allows the users to define their own desired modes. In every mode, 544 physicochemical properties of the amino acids are available for choosing. The computing efficiency is at least 100 times that of existing programs, which makes it able to facilitate the extensive studies on proteins and peptides. The PseAAC-General is freely available via SourceForge. It runs on both Linux and Windows.
引用
收藏
页码:3495 / 3506
页数:12
相关论文
共 91 条