Improved Identification of Cytokines Using Feature Selection Techniques

被引:6
作者
Jiang, Limin [1 ,4 ]
Liao, Zhijun [2 ]
Su, Ran [3 ]
Wei, Leyi [1 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin, Peoples R China
[2] Fujian Med Univ, Sch Basic Med Sci, Dept Biochem & Mol Biol, Fuzhou, Fujian, Peoples R China
[3] Tianjin Univ, Sch Software, Tianjin, Peoples R China
[4] Hebei Univ Engn, Sch Informat & Elect Engn, Handan, Peoples R China
关键词
Biological activities; cytokines; human diseases; max-relevance-max-distance (MRMD); feature selection techniques; principal components analysis (PCA); PREDICTION METHOD; WEB SERVER; PROTEIN; CLASSIFICATION; MACHINE; CANCER; DNA;
D O I
10.2174/1570178614666170227143434
中图分类号
O62 [有机化学];
学科分类号
070303 ; 081704 ;
摘要
Background: Cytokines, as small signaling proteins, play critical roles in biological functions and are closely related with human diseases. Accurate identification of cytokines is the first step to provide insights into the relevance of cytokines and human diseases. In recent years, many research efforts have been done for the development of computational methods, especially for machine learning based methods, to fast and accurately identify cytokines. Currently, a major challenge lying in existing machine learning based methods is to improve the performance of cytokine identification. Method: In this study, we attempt to enhance the performance of cytokine identification methods from the two following factors: (1) feature representation and (2) classifier selection. For feature extraction, we fuse multiple types of features showing good performance to classify cytokines from non-cytokines, and employ two feature selection techniques, Max-Relevance-Max-Distance (MRMD) and Principal Components Analysis (PCA), to yield the optimal feature representations. For classifier selection, various powerful classifiers are performed, and the one with the highest performance is determined to build the classification model for our method. Results: Based on the analysis, we learned that our feature sets stably maintain high performance with any of the classifier we used. And, the overall performances of the combinations were in the following order from best to worst: 473D+LIBSVM, MRMD+LIBD3C, and PCA+LIBSVM. Conclusion: Comparative studies demonstrate that our proposed strategy is effective for the improved performance in identification of cytokines.
引用
收藏
页码:632 / 641
页数:10
相关论文
共 26 条
[1]   Principal component analysis [J].
Abdi, Herve ;
Williams, Lynne J. .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04) :433-459
[2]  
Boutet E, 2016, METHODS MOL BIOL, V1374, P23, DOI 10.1007/978-1-4939-3167-5_2
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]   Inflammatory Cytokines and Lung Cancer Risk in 3 Prospective Studies [J].
Brenner, Darren R. ;
Fanidi, Anouar ;
Grankvist, Kjell ;
Muller, David C. ;
Brennan, Paul ;
Manjer, Jonas ;
Byrnes, Graham ;
Hodge, Allison ;
Severi, Gianluca ;
Giles, Graham G. ;
Johansson, Mikael ;
Johansson, Mattias .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2017, 185 (02) :86-95
[5]   SVM-Prot: web-based support vector machine software for functional classification of a protein from its primary sequence [J].
Cai, CZ ;
Han, LY ;
Ji, ZL ;
Chen, X ;
Chen, YZ .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3692-3697
[6]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[7]   Interleukins 1 and 6 as main mediators of inflammation and cancer [J].
Dmitrieva, O. S. ;
Shilovskiy, I. P. ;
Khaitov, M. R. ;
Grivennikov, S. I. .
BIOCHEMISTRY-MOSCOW, 2016, 81 (02) :80-90
[8]   Extreme learning machine: Theory and applications [J].
Huang, Guang-Bin ;
Zhu, Qin-Yu ;
Siew, Chee-Kheong .
NEUROCOMPUTING, 2006, 70 (1-3) :489-501
[9]   CD-HIT Suite: a web server for clustering and comparing biological sequences [J].
Huang, Ying ;
Niu, Beifang ;
Gao, Ying ;
Fu, Limin ;
Li, Weizhong .
BIOINFORMATICS, 2010, 26 (05) :680-682
[10]   BP Neural Network Could Help Improve Pre-miRNA Identification in Various Species [J].
Jiang, Limin ;
Zhang, Jingjun ;
Xuan, Ping ;
Zou, Quan .
BIOMED RESEARCH INTERNATIONAL, 2016, 2016