Implementation and comparison of kernel-based learning methods to predict metabolic networks

被引:5
作者
Roche-Lima A. [1 ]
机构
[1] Collaboration Center for Research in Health Disparities, Medical Science Campus, University of Puerto Rico., PO Box 365067, San Juan, 00936-5067, PR
基金
美国国家卫生研究院; 加拿大自然科学与工程研究理事会;
关键词
Kernel methods; Machine learning; Metabolic pathways; Network prediction;
D O I
10.1007/s13721-016-0134-5
中图分类号
学科分类号
摘要
Metabolic pathways can be conceptualized as the biological equivalent of a data pipeline. In living cells, series of chemical reactions are carried out by different proteins called enzymes in a stepwise manner. However, many pathways remain incompletely characterized, and in some of them, not all enzyme components have been identified. Kernel methods are useful in many difficult problem areas, such as document classification and bioinformatics. Specifically, kernel methods have been used recently to predict biological networks, such as protein–protein interaction networks and metabolic networks. In this paper, we implement and compare different methods and types of data to predict metabolic networks. The methods are Penalized Kernel Matrix Regression (PKMR) and pairwise Support Vector Machine (pSVM). We develop several experiments using these methods with sequence, non-sequence, and combined data. We obtain better accuracy when the sequence data are used in both methods. Whereas when the methods are compared using the same type of data, the pSVM approach shows better accuracy. The best results are obtained with pSVM using all combined kernels. © 2016, The Author(s).
引用
收藏
相关论文
共 37 条
[1]  
Allauzen C., Mohri M., Talwalkar A., Sequence kernels for predicting protein essentiality, Proceedings of the 25th international conference on machine learning ICML ‘08, pp. 9-16, (2008)
[2]  
Allauzen C., Cortes C., Mohri M., A dual coordinate descent algorithm for SVMs combined with rational kernels, Int J Found Comput Sci, 22, pp. 1761-1779, (2011)
[3]  
Basilico J., Hofmann T., Brodley C., Unifying collaborative and content based filtering, Proceedings of the twenty-first international conference on machine learning ACM, (2004)
[4]  
Ben-Hur A., Brutlag D., Remote homology detection: a motif based approach, Bioinformatics, 19, pp. i26-i33, (2003)
[5]  
Ben-Hur A., Noble W.S., Kernel methods for predicting protein–protein interactions, Bioinformatics, 21, pp. i38-i46, (2005)
[6]  
Ben-Hur A., Ong C.S., Sonnenburg S., Scholkopf B., Ratsch G., Support vector machines and kernels for computational biology, PLoS Comput Biol, 4, (2008)
[7]  
Brunner C., Fischer A., Luig K., Thies T., Pairwise support vector machines and their application to large scale problems, J Mach Learn Res, 13, pp. 2279-2292, (2012)
[8]  
Chang C.C., Lin C.J., LibSVM: a library for support vector machines, ACM Trans Intell Syst Technol (TIST), 2, (2011)
[9]  
Cortes C., Mohri M., Confidence intervals for the area under the ROC curve. Advances in neural information processing systems, (2005)
[10]  
Eisen M.B., Spellman P.T., Brown P.O., Botstein D., Cluster analysis and display of genome-wide expression patterns, Proc Natl Acad Sci, 95, pp. 14863-14868, (1998)