Protein sequence;
graphic representation;
fractal interpolation;
principal component analysis;
PHYSICOCHEMICAL PROPERTIES;
DISTANCE;
DIMENSION;
ALIGNMENT;
CURVE;
D O I:
10.1109/TCBB.2015.2511731
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
A new graphical representation of protein sequences is introduced in this paper. Nine main physicochemical properties of amino acids were used to obtain a 2D discrete point set for protein sequences by applying principal component analysis. The fractal method was then employed to interpolate discrete points in constructing a graphical representation of protein sequences. Fractal dimension of the protein curve was used to analyze the similarity of protein sequences by comparing the distance of vectors representing segments of protein sequences. The Jeffrey's and Matusita distance was modified in the similarity comparison of protein sequences with different lengths. Nine different species from Nicotinamide adenine dinucleotide (NADH) dehydrogenase 5 (ND5) protein sequences were tested as an example to demonstrate our method. Finally, a linear correlation and significance analysis was used to compare our results with other graphical representations referring to the ClustalW result. To confirm the validity of our method, eight species in NADH dehydrogenase 6 (ND6) protein families and twenty-seven species in beta-globin protein families were also analyzed. Experimental results show that the proposed method is effective for the similarity analysis of proteins.
机构:
Shanghai Univ, Inst Syst Biol, Shanghai 200244, Peoples R China
Shaoyang Univ, Dept Math, Shaoyang 422000, Hunan, Peoples R China
Hunan First Normal Coll, Changsha 410002, Hunan, Peoples R ChinaShanghai Univ, Inst Syst Biol, Shanghai 200244, Peoples R China
Huang, Guohua
Hu, Jerry
论文数: 0引用数: 0
h-index: 0
机构:
Univ Houston Victoria, Sch Arts & Sci, Dept Math & Comp Sci, Sugar Land, TX 77479 USAShanghai Univ, Inst Syst Biol, Shanghai 200244, Peoples R China
机构:
Zhejiang Sci Tech Univ, Coll Life Sci, Hangzhou, Zhejiang, Peoples R ChinaZhejiang Sci Tech Univ, Coll Life Sci, Hangzhou, Zhejiang, Peoples R China
Yao, Yuhua
Yan, Shoujiang
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Sci Tech Univ, Coll Life Sci, Hangzhou, Zhejiang, Peoples R ChinaZhejiang Sci Tech Univ, Coll Life Sci, Hangzhou, Zhejiang, Peoples R China
Yan, Shoujiang
Xu, Huimin
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Sci Tech Univ, Coll Life Sci, Hangzhou, Zhejiang, Peoples R ChinaZhejiang Sci Tech Univ, Coll Life Sci, Hangzhou, Zhejiang, Peoples R China
Xu, Huimin
Han, Jianning
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Sci Tech Univ, Coll Life Sci, Hangzhou, Zhejiang, Peoples R ChinaZhejiang Sci Tech Univ, Coll Life Sci, Hangzhou, Zhejiang, Peoples R China
Han, Jianning
Nan, Xuying
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Sci Tech Univ, Coll Life Sci, Hangzhou, Zhejiang, Peoples R ChinaZhejiang Sci Tech Univ, Coll Life Sci, Hangzhou, Zhejiang, Peoples R China
Nan, Xuying
He, Ping-an
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Sci Tech Univ, Coll Life Sci, Hangzhou, Zhejiang, Peoples R ChinaZhejiang Sci Tech Univ, Coll Life Sci, Hangzhou, Zhejiang, Peoples R China
He, Ping-an
Dai, Qi
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Sci Tech Univ, Coll Life Sci, Hangzhou, Zhejiang, Peoples R ChinaZhejiang Sci Tech Univ, Coll Life Sci, Hangzhou, Zhejiang, Peoples R China
机构:
Shandong Univ, Sch Math, Jinan 250100, Peoples R China
Shandong Univ Weihai, Sch Math & Stat, Weihai 264209, Peoples R ChinaShandong Univ, Sch Math, Jinan 250100, Peoples R China
Mu, Zengchao
Li, Guojun
论文数: 0引用数: 0
h-index: 0
机构:
Shandong Univ, Sch Math, Jinan 250100, Peoples R ChinaShandong Univ, Sch Math, Jinan 250100, Peoples R China
Li, Guojun
Wu, Haiyan
论文数: 0引用数: 0
h-index: 0
机构:
Shandong Univ Weihai, Sch Math & Stat, Weihai 264209, Peoples R ChinaShandong Univ, Sch Math, Jinan 250100, Peoples R China
Wu, Haiyan
Qi, Xingqin
论文数: 0引用数: 0
h-index: 0
机构:
Shandong Univ Weihai, Sch Math & Stat, Weihai 264209, Peoples R ChinaShandong Univ, Sch Math, Jinan 250100, Peoples R China