An Efficient Approach for Prediction of Nuclear Receptor and Their Subfamilies Based on Fuzzy k-Nearest Neighbor with Maximum Relevance Minimum Redundancy

被引:3
作者
Tiwari, Arvind Kumar [1 ]
Srivastava, Rajeev [1 ]
机构
[1] Indian Inst Technol BHU, Dept Comp Sci & Engn, Varanasi 221005, Uttar Pradesh, India
关键词
Nuclear receptor; Fuzzy k-nearest neighbor; Minimum redundancy maximum relevance; Sequence derived properties; Matthew's correlation coefficient; Cross validation; AMINO-ACID-COMPOSITION; PROTEIN SUBCELLULAR-LOCALIZATION; SUPPORT VECTOR MACHINE; COUPLED RECEPTORS; WEB SERVER; CLASSIFICATION; INFORMATION; SEQUENCE; FEATURES;
D O I
10.1007/s40010-016-0325-6
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The efficient classification of nuclear receptors and their subfamilies plays an important role in the detection of various diseases such as diabetes, cancer, and inflammatory diseases and their related drug design and discovery. As of now, few methods have been reported in literature for the same but the performance and efficacy of these methods are not up to the desired level. To address the issue of efficient classification of nuclear receptor and their subfamilies, here in this paper we propose to use a fuzzy k-nearest neighbor classifier with minimum redundancy maximum relevance for the classification of nuclear receptor and their eight subfamilies. The minimum redundancy maximum relevance algorithm is used to select the optimal feature subset and observed that highest accuracy and Matthew's correlation coefficient is obtained with 150 features among 753 features through fuzzy kNN classifier. The performance of fuzzy kNN classifier depends on two parameter number of nearest neighbor (k) and fuzzy coefficient (m) and it is observed that the highest accuracy and MCC is obtained at k = 7 and m = 1.25. The overall accuracies of tenfold cross validation with optimal number of features, k and m are 100 and 91.7% and the MCC values of 1.00 and 0.89 for the prediction of nuclear receptor families and subfamilies respectively. From the obtained results and analysis it is observed that the performance of the proposed approach for the classification of nuclear receptor and their eight subfamilies is very competitive with some other standard methods available in literature.
引用
收藏
页码:129 / 136
页数:8
相关论文
共 35 条
[1]   GPCRsclass: a web tool for the classification of amine type of G-protein-coupled receptors [J].
Bhasin, M ;
Raghava, GPS .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W143-W147
[2]   Classification of nuclear receptors based on amino acid composition and dipeptide composition [J].
Bhasin, M ;
Raghava, GPS .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2004, 279 (22) :23262-23266
[3]  
Cai J, 2005, LECT NOTES COMPUT SC, V3498, P680
[4]   Using pseudo-amino acid composition and support vector machine to predict protein structural class [J].
Chen, Chao ;
Tian, Yuan-Xin ;
Zou, Xiao-Yong ;
Cai, Pei-Xiang ;
Mo, Jin-Yuan .
JOURNAL OF THEORETICAL BIOLOGY, 2006, 243 (03) :444-448
[5]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[6]   Predicting DNA-binding proteins: approached from Chou's pseudo amino acid composition and other specific sequence features [J].
Fang, Y. ;
Guo, Y. ;
Feng, Y. ;
Li, M. .
AMINO ACIDS, 2008, 34 (01) :103-109
[7]   Classification of G-protein coupled receptors at four levels [J].
Gao, Qing-Bin ;
Wang, Zheng-Zhi .
PROTEIN ENGINEERING DESIGN & SELECTION, 2006, 19 (11) :511-516
[8]   Prediction of nuclear receptors with optimal pseudo amino acid composition [J].
Gao, Qing-Bin ;
Jin, Zhi-Chao ;
Ye, Xiao-Fei ;
Wu, Cheng ;
He, Jia .
ANALYTICAL BIOCHEMISTRY, 2009, 387 (01) :54-59
[9]   Support vector machine-based method for subcellular localization of human proteins using amino acid compositions, their order, and similarity search [J].
Garg, A ;
Bhasin, M ;
Raghava, GPS .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2005, 280 (15) :14427-14432
[10]   Prediction of G-Protein-Coupled Receptor Classes in Low Homology Using Chou's Pseudo Amino Acid Composition with Approximate Entropy and Hydrophobicity Patterns [J].
Gu, Quan ;
Ding, Yong-Sheng ;
Zhang, Tong-Liang .
PROTEIN AND PEPTIDE LETTERS, 2010, 17 (05) :559-567