Identification of Antioxidant Proteins With Deep Learning From Sequence Information

被引:15
作者
Shao, Lifen [1 ]
Gao, Hui [1 ]
Liu, Zhen [1 ]
Feng, Juan [2 ]
Tang, Lixia [2 ]
Lin, Hao [2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Ctr Informat Biol, Chengdu, Sichuan, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Life Sci & Technol, Ctr Informat Biol, Key Lab Neuroinformat,Minist Educ, Chengdu, Sichuan, Peoples R China
关键词
antioxidant proteins; deep learning; g-gap dipeptide; feature selection; webserver; MECHANISM; RESOURCE; NETWORK;
D O I
10.3389/fphar.2018.01036
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Antioxidant proteins have been found closely linked to disease control for its ability to eliminate excess free radicals. Because of its medicinal value, the study of identifying antioxidant proteins is on the upsurge. Many machine-learning classifiers have performed poorly owing to the nonlinear and unbalanced nature of biological data. Recently, deep learning techniques showed advantages over many state-of-the-art machine learning methods in various fields. In this study, a deep learning based classifier was proposed to identify antioxidant proteins based on mixed g-gap dipeptide composition feature vector. The classifier employed deep autoencoder to extract nonlinear representation from raw input. The t-Distributed Stochastic Neighbor Embedding (t-SNE) was used for dimensionality reduction. Support vector machine was finally performed for classification. The classifier achieved F-1 score of 0.8842 and MCC of 0.7409 in 10-fold cross validation. Experimental results show that our proposed method outperformed the traditional machine learning methods and could be a promising tool for antioxidant protein identification. For the convenience of others' scientific research, we have developed a user-friendly web server called IDAod for antioxidant protein identification, which can be accessed freely at http://bigroup.uestc.edu.cn/IDAod/.
引用
收藏
页数:8
相关论文
共 53 条
[1]   The Molecular Mechanism of the Catalase Reaction [J].
Alfonso-Prieto, Mercedes ;
Biarnes, Xevi ;
Vidossich, Pietro ;
Rovira, Carme .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2009, 131 (33) :11751-11761
[2]  
[Anonymous], 2012, CoRR
[3]  
Berg J. M., 2002, Biochemistry, V5th
[4]   On the Origin of Superoxide Dismutase: An Evolutionary Perspective of Superoxide-Mediated Redox Signaling [J].
Case, Adam J. .
ANTIOXIDANTS, 2017, 6 (04)
[5]   iRNA-3typeA: Identifying Three Types of Modification at RNA's Adenosine Sites [J].
Chen, Wei ;
Feng, Pengmian ;
Yang, Hui ;
Ding, Hui ;
Lin, Hao ;
Chou, Kuo-Chen .
MOLECULAR THERAPY-NUCLEIC ACIDS, 2018, 11 :468-474
[6]   iDNA4mC: identifying DNA N4-methylcytosine sites based on nucleotide chemical properties [J].
Chen, Wei ;
Yang, Hui ;
Feng, Pengmian ;
Ding, Hui ;
Lin, Hao .
BIOINFORMATICS, 2017, 33 (22) :3518-3523
[7]   Using deformation energy to analyze nucleosome positioning in genomes [J].
Chen, Wei ;
Feng, Pengmian ;
Ding, Hui ;
Lin, Hao ;
Chou, Kuo-Chen .
GENOMICS, 2016, 107 (2-3) :69-75
[8]   Case Study of Hydrogen Bonding in a Hydrophobic Cavity [J].
Chen, Yi-Chen ;
Cheng, Chao-Sheng ;
Tjong, Siu-Cin ;
Yin, Hsien-Sheng ;
Sue, Shih-Che .
JOURNAL OF PHYSICAL CHEMISTRY B, 2014, 118 (50) :14602-14611
[9]   Some remarks on protein attribute prediction and pseudo amino acid composition [J].
Chou, Kuo-Chen .
JOURNAL OF THEORETICAL BIOLOGY, 2011, 273 (01) :236-247
[10]  
Cortes C., 1995, Machine learning, P1303, DOI DOI 10.1007/978-0-387-73003-5_299