Multi-Label Learning With Fuzzy Hypergraph Regularization for Protein Subcellular Location Prediction

被引:12
作者
Chen, Jing [1 ,2 ]
Tang, Yuan Yan [1 ,2 ]
Chen, C. L. Philip [1 ]
Fang, Bin [3 ]
Lin, Yuewei [4 ]
Shang, Zhaowei [3 ]
机构
[1] Univ Macau, Fac Sci & Technol, Taipa, Macau, Peoples R China
[2] Chongqing Univ, Chongqing 400030, Peoples R China
[3] Chongqing Univ, Coll Comp Sci, Chongqing 400030, Peoples R China
[4] Univ S Carolina, Columbia, SC 29208 USA
基金
国家自然科学基金重大项目;
关键词
Dictionary learning; hypergraph regularization; multi-label learning; protein subcellular localization; AMINO-ACID-COMPOSITION; SUPPORT VECTOR MACHINES; POSITIVE BACTERIAL PROTEINS; AVERAGE CHEMICAL-SHIFT; GRAM-NEGATIVE-BACTERIA; GENERAL-FORM; EVOLUTIONARY INFORMATION; ENSEMBLE CLASSIFIER; CHOUS PSEAAC; LOCALIZATION PREDICTION;
D O I
10.1109/TNB.2014.2341111
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein subcellular location prediction aims to predict the location where a protein resides within a cell using computational methods. Considering the main limitations of the existing methods, we propose a hierarchical multi-label learning model FHML for both single-location proteins and multi-location proteins. The latent concepts are extracted through feature space decomposition and label space decomposition under the nonnegative data factorization framework. The extracted latent concepts are used as the codebook to indirectly connect the protein features to their annotations. We construct dual fuzzy hypergraphs to capture the intrinsic high-order relations embedded in not only feature space, but also label space. Finally, the subcellular location annotation information is propagated from the labeled proteins to the unlabeled proteins by performing dual fuzzy hypergraph Laplacian regularization. The experimental results on the six protein benchmark datasets demonstrate the superiority of our proposed method by comparing it with the state-of-the-art methods, and illustrate the benefit of exploiting both feature correlations and label correlations.
引用
收藏
页码:438 / 447
页数:10
相关论文
共 89 条
[21]   A New Method for Predicting the Subcellular Localization of Eukaryotic Proteins with Both Single and Multiple Sites: Euk-mPLoc 2.0 [J].
Chou, Kuo-Chen ;
Shen, Hong-Bin .
PLOS ONE, 2010, 5 (03)
[22]   Using Chou's pseudo amino acid composition to predict subcellular localization of apoptosis proteins: An approach with immune genetic algorithm-based ensemble classifier [J].
Ding, Yong-Sheng ;
Zhang, Tong-Liang .
PATTERN RECOGNITION LETTERS, 2008, 29 (13) :1887-1892
[23]   Wanted: subcellular localization of proteins based on sequence [J].
Eisenhaber, F ;
Bork, P .
TRENDS IN CELL BIOLOGY, 1998, 8 (04) :169-170
[24]   Predicting subcellular localization of proteins based on their N-terminal amino acid sequence [J].
Emanuelsson, O ;
Nielsen, H ;
Brunak, S ;
von Heijne, G .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 300 (04) :1005-1016
[25]   Using the concept of Chou's pseudo amino acid composition for risk type prediction of human papillomaviruses [J].
Esmaeili, Maryam ;
Mohabatkar, Hassan ;
Mohsenzadeh, Sasan .
JOURNAL OF THEORETICAL BIOLOGY, 2010, 263 (02) :203-209
[26]   Discriminating bioluminescent proteins by incorporating average chemical shift and evolutionary information into the general form of Chou's pseudo amino acid composition [J].
Fan, Guo-Liang ;
Li, Qian-Zhong .
JOURNAL OF THEORETICAL BIOLOGY, 2013, 334 :45-51
[27]   Predict mycobacterial proteins subcellular locations by incorporating pseudo-average chemical shift into the general form of Chou's pseudo amino acid composition [J].
Fan, Guo-Liang ;
Li, Qian-Zhong .
JOURNAL OF THEORETICAL BIOLOGY, 2012, 304 :88-95
[28]   iNR-Drug: Predicting the Interaction of Drugs with Nuclear Receptors in Cellular Networking [J].
Fan, Yue-Nong ;
Xiao, Xuan ;
Min, Jian-Liang ;
Chou, Kuo-Chen .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2014, 15 (03) :4915-4937
[29]   Automated subcellular location determination and high-throughput microscopy [J].
Glory, Estelle ;
Murphy, Robert F. .
DEVELOPMENTAL CELL, 2007, 12 (01) :7-16
[30]   RETRACTED: Predicting Protein Folding Rates Using the Concept of Chou's Pseudo Amino Acid Composition (Retracted article. See vol. 33, pg. 2614, 2012) [J].
Guo, Jianxiu ;
Rao, Nini ;
Liu, Guangxiong ;
Yang, Yong ;
Wang, Gang .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2011, 32 (08) :1612-1617