RF-PSSM: A Combination of Rotation Forest Algorithm and Position-Specific Scoring Matrix for Improved Prediction of Protein-Protein Interactions Between Hepatitis C Virus and Human

被引:5
作者
Liu, Xin [1 ]
Lu, Yaping [2 ]
Wang, Liang [3 ]
Geng, Wei [1 ]
Shi, Xinyi [4 ]
Zhang, Xiao [1 ]
机构
[1] Xuzhou Med Univ, Sch Med Informat & Engn, Xuzhou 221000, Peoples R China
[2] China Univ Min & Technol, Coll Comp Sci & Technol, Xuzhou 221116, Peoples R China
[3] Guangdong Acad Med Sci, Guangdong Prov Peoples Hosp, Lab Med, Guangzhou 510080, Peoples R China
[4] Univ Chinese Acad Sci, Hangzhou Inst Adv Study, Hangzhou 310005, Peoples R China
关键词
Proteins; Support vector machines; Computer viruses; Liver diseases; Software algorithms; Feature extraction; Prediction algorithms; protein-protein interactions; hepatitis C virus; position specific scoring matrix; two-dimensional principal component analysis; rotation forest; PSI-BLAST; VECTOR; WEB; PCA;
D O I
10.26599/BDMA.2022.9020031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The identification of hepatitis C virus (HCV) virus-human protein interactions will not only help us understand the molecular mechanisms of related diseases but also be conductive to discovering new drug targets. An increasing number of clinically and experimentally validated interactions between HCV and human proteins have been documented in public databases, facilitating studies based on computational methods. In this study, we proposed a new computational approach, rotation forest position-specific scoring matrix (RF-PSSM), to predict the interactions among HCV and human proteins. In particular, PSSM was used to characterize each protein, two-dimensional principal component analysis (2DPCA) was then adopted for feature extraction of PSSM. Finally, rotation forest (RF) was used to implement classification. The results of various ablation experiments show that on independent datasets, the accuracy and area under curve (AUC) value of RF-PSSM can reach 93.74(%) and 94.29%, respectively, outperforming almost all cutting-edge research. In addition, we used RF-PSSM to predict 9 human proteins that may interact with HCV protein E1, which can provide theoretical guidance for future experimental studies.
引用
收藏
页码:21 / 31
页数:11
相关论文
共 49 条
[1]   Predicting Interactions between Virus and Host Proteins Using Repeat Patterns and Composition of Amino Acids [J].
Alguwaizani, Saud ;
Park, Byungkyu ;
Zhou, Xiang ;
Huang, De-Shuang ;
Han, Kyungsook .
JOURNAL OF HEALTHCARE ENGINEERING, 2018, 2018
[2]   Iterated profile searches with PSI-BLAST - a tool for discovery in protein databases [J].
Altschul, SF ;
Koonin, EV .
TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (11) :444-447
[3]   Genome-to-genome analysis highlights the effect of the human innate and adaptive immune systems on the hepatitis C virus [J].
Ansari, M. Azim ;
Pedergnana, Vincent ;
Ip, Camilla L. C. ;
Magri, Andrea ;
Von Delft, Annette ;
Bonsall, David ;
Chaturvedi, Nimisha ;
Bartha, Istvan ;
Smith, David ;
Nicholson, George ;
McVean, Gilean ;
Trebes, Amy ;
Piazza, Paolo ;
Fellay, Jacques ;
Cooke, Graham ;
Foster, Graham R. ;
Hudson, Emma ;
McLauchlan, John ;
Simmonds, Peter ;
Bowden, Rory ;
Klenerman, Paul ;
Barnes, Eleanor ;
Spencer, Chris C. A. .
NATURE GENETICS, 2017, 49 (05) :666-+
[4]   Impact of infectious disease epidemics on tuberculosis diagnostic, management, and prevention services: experiences and lessons from the 2014-2015 Ebola virus disease outbreak in West Africa [J].
Ansumana, Rashid ;
Keitell, Samuel ;
Roberts, Gregory M. T. ;
Ntoumi, Francine ;
Petersen, Eskild ;
Ippolito, Giuseppe ;
Zumla, Alimuddin .
INTERNATIONAL JOURNAL OF INFECTIOUS DISEASES, 2017, 56 :101-104
[5]   Prediction of Interactions between Viral and Host Proteins Using Supervised Machine Learning Methods [J].
Barman, Ranjan Kumar ;
Saha, Sudipto ;
Das, Santasabuj .
PLOS ONE, 2014, 9 (11)
[6]   Protein-Protein Interactions in Virus-Host Systems [J].
Brito, Anderson F. ;
Pinney, John W. .
FRONTIERS IN MICROBIOLOGY, 2017, 8
[7]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]   NCBI BLAST plus integrated into Galaxy [J].
Cock, Peter J. A. ;
Chilton, John M. ;
Gruening, Bjoern ;
Johnson, James E. ;
Soranzo, Nicola .
GIGASCIENCE, 2015, 4
[9]   Prediction of protein-protein interactions between viruses and human by an SVM model [J].
Cui, Guangyu ;
Fang, Chao ;
Han, Kyungsook .
BMC BIOINFORMATICS, 2012, 13
[10]   Protein-protein interactions between hepatitis C virus nonstructural proteins [J].
Dimitrova, M ;
Imbert, I ;
Kieny, MP ;
Schuster, C .
JOURNAL OF VIROLOGY, 2003, 77 (09) :5401-5414