Feature extraction method for proteins based on Markov tripeptide by compressive sensing
被引:2
作者:
Gao, C. F.
论文数: 0引用数: 0
h-index: 0
机构:
Jiangnan Univ, Sch Sci, Wuxi 214122, Peoples R China
Wuxi Engn Res Ctr Biocomp, Wuxi 214122, Peoples R ChinaJiangnan Univ, Sch Sci, Wuxi 214122, Peoples R China
Gao, C. F.
[1
,2
]
Wu, X. Y.
论文数: 0引用数: 0
h-index: 0
机构:
Jiangnan Univ, Sch Sci, Wuxi 214122, Peoples R ChinaJiangnan Univ, Sch Sci, Wuxi 214122, Peoples R China
Wu, X. Y.
[1
]
机构:
[1] Jiangnan Univ, Sch Sci, Wuxi 214122, Peoples R China
[2] Wuxi Engn Res Ctr Biocomp, Wuxi 214122, Peoples R China
Background: In order to capture the vital structural information of the original protein, the symbol sequence was transformed into the Markov frequency matrix according to the consecutive three residues throughout the chain. A three-dimensional sparse matrix sized 20 x 20 x 20 was obtained and expanded to one-dimensional vector. Then, an appropriate measurement matrix was selected for the vector to obtain a compressed feature set by random projection. Consequently, the new compressive sensing feature extraction technology was proposed. Results: Several indexes were analyzed on the cell membrane, cytoplasm, and nucleus dataset to detect the discrimination of the features. In comparison with the traditional methods of scale wavelet energy and amino acid components, the experimental results suggested the advantage and accuracy of the features by this new method. Conclusions: The new features extracted from this model could preserve the maximum information contained in the sequence and reflect the essential properties of the protein. Thus, it is an adequate and potential method in collecting and processing the protein sequence from a large sample size and high dimension.
机构:
Tulane Univ, Dept Biomed Engn, New Orleans, LA 70118 USATulane Univ, Dept Biomed Engn, New Orleans, LA 70118 USA
Cao, Hongbao
Deng, Hong-Wen
论文数: 0引用数: 0
h-index: 0
机构:
Tulane Univ, Dept Biomed Engn, New Orleans, LA 70118 USATulane Univ, Dept Biomed Engn, New Orleans, LA 70118 USA
Deng, Hong-Wen
Li, Marilyn
论文数: 0引用数: 0
h-index: 0
机构:
Baylor Coll Med, Canc Genet Lab, Houston, TX 77030 USATulane Univ, Dept Biomed Engn, New Orleans, LA 70118 USA
Li, Marilyn
Wang, Yu-Ping
论文数: 0引用数: 0
h-index: 0
机构:
Tulane Univ, Dept Biomed Engn, New Orleans, LA 70118 USA
Tulane Univ, Dept Biostat & Bioinformat, New Orleans, LA 70118 USA
Shanghai Univ Sci & Technol, Ctr Syst Med, Shanghai 200093, Peoples R ChinaTulane Univ, Dept Biomed Engn, New Orleans, LA 70118 USA
机构:
Tulane Univ, Dept Biomed Engn, New Orleans, LA 70118 USATulane Univ, Dept Biomed Engn, New Orleans, LA 70118 USA
Cao, Hongbao
Deng, Hong-Wen
论文数: 0引用数: 0
h-index: 0
机构:
Tulane Univ, Dept Biomed Engn, New Orleans, LA 70118 USATulane Univ, Dept Biomed Engn, New Orleans, LA 70118 USA
Deng, Hong-Wen
Li, Marilyn
论文数: 0引用数: 0
h-index: 0
机构:
Baylor Coll Med, Canc Genet Lab, Houston, TX 77030 USATulane Univ, Dept Biomed Engn, New Orleans, LA 70118 USA
Li, Marilyn
Wang, Yu-Ping
论文数: 0引用数: 0
h-index: 0
机构:
Tulane Univ, Dept Biomed Engn, New Orleans, LA 70118 USA
Tulane Univ, Dept Biostat & Bioinformat, New Orleans, LA 70118 USA
Shanghai Univ Sci & Technol, Ctr Syst Med, Shanghai 200093, Peoples R ChinaTulane Univ, Dept Biomed Engn, New Orleans, LA 70118 USA