Clustering binary codes to express the biochemical properties of amino acids

被引:0
作者
Fu, HG [1 ]
Nguifo, EM [1 ]
机构
[1] Univ Artois, CNRS, CRIL, FRE2499, F-62307 Lens, France
来源
INTELLIGENT INFORMATION PROCESSING II | 2005年 / 163卷
关键词
bioinformatics and AI; amino acids; classification; clustering;
D O I
10.1007/0-387-23152-8_36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study four kinds of binary codes of amino acids (AA). Two codes of them are based respectively on biochemical properties, and the two others are generated with artificial intelligence (AI) methods, and are based on protein structures and alignment, and on Dayhoff matrix. In order to give a global significance of each binary code, we use a hierarchical clustering method to generate different clusters of each binary codes of amino acids. Each cluster is examined with biochemical properties to give an explanation on the similarity between amino acids that it contains. To validate our examination, a decision tree based machine learning system is used to characterize the AA clusters obtained with each binary codes. From this experimentation, it comes out that one of the AI based codes allows to obtain clusters that have significant biochemical properties. As a consequence, it appears that even if attributes of binary codes generated with AI methods, do not separately correspond to a biochemical property, they can be significant in the whole. Conversely binary codes based on biochemical properties can be insignificant when forming a whole.
引用
收藏
页码:279 / 282
页数:4
相关论文
共 4 条
[1]  
de la Maza M., 1994, Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences. Vol.V: Biotechnology Computing (Cat. No.94TH0607-2), P123, DOI 10.1109/HICSS.1994.323559
[2]  
Dickerson RE, 1969, STRUCTURE ACTION PRO, P16
[3]  
NGUIFO EM, 1993, THESIS U MONTPELLIER
[4]  
SALLANTIN J, 1984, ACTES JOURNEES POINT, P141