PSRTTCA: A new approach for improving the prediction and characterization of tumor T cell antigens using propensity score representation learning

被引:12
作者
Charoenkwan, Phasit [1 ]
Pipattanaboon, Chonlatip [2 ]
Nantasenamat, Chanin [3 ]
Hasan, Md Mehedi [4 ]
Moni, Mohammad Ali [5 ]
Lio, Pietro [6 ]
Shoombuatong, Watshara [3 ]
机构
[1] Chiang Mai Univ, Coll Arts Media & Technol, Modern Management & Informat Technol, Chiang Mai 50200, Thailand
[2] Khon Kaen Univ, Fac Med, Dept Microbiol, Khon Kaen 40002, Thailand
[3] Mahidol Univ, Fac Med Technol, Ctr Data Min & Biomed Informat, Bangkok 10700, Thailand
[4] Tulane Univ, Tulane Ctr Biomed Informat & Genom, Sch Med, John W Deming Dept Med, New Orleans, LA 70112 USA
[5] Univ Queensland, Fac Hlth & Behav Sci, Sch Hlth & Rehabil Sci, Artificial Intelligence & Digital Hlth Data Sci, St Lucia, Qld 4072, Australia
[6] Univ Cambridge, Dept Comp Sci & Technol, Cambridge CB3 0FD, England
关键词
T -cell antigen; Propensity score representation learning; Scoring card method; Propensity score; Machine learning; Bioinformatics; IDENTIFICATION; CANCER; EPITOPES; PEPTIDE; HYDROPHOBICITY; IMMUNOTHERAPY; IMMUNITY; IMMUNIZATION; RESPONSES; PROTEINS;
D O I
10.1016/j.compbiomed.2022.106368
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Despite the arsenal of existing cancer therapies, the ongoing recurrence and new cases of cancer pose a serious health concern that necessitates the development of new and effective treatments. Cancer immunotherapy, which uses the body's immune system to combat cancer, is a promising treatment option. As a result, in silico methods for identifying and characterizing tumor T cell antigens (TTCAs) would be useful for better understanding their functional mechanisms. Although few computational methods for TTCA identification have been developed, their lack of model interpretability is a major drawback. Thus, developing computational methods for the effective identification and characterization of TTCAs is a critical endeavor. PSRTTCA, a new machine learning (ML)-based approach for improving the identification and characterization of TTCAs based on their primary sequences, is proposed in this study. Specifically, we introduce a new propensity score representation learning algorithm that allows one to generate various sets of propensity scores of amino acids, dipeptides, and ggap dipeptides to be TTCAs. To enhance the predictive performance, optimal sets of variant propensity scores were determined and fed into the final meta-predictor (PSRTTCA). Benchmarking results revealed that PSRTTCA was a more precise and promising tool for the identification and characterization of TTCAs than conventional ML classifiers and existing methods. Furthermore, PSR-derived propensities of amino acids in becoming TTCAs are used to reveal the relationship between TTCAs and their informative physicochemical properties in order to provide insights into TTCA characteristics. Finally, a user-friendly online computational platform of PSRTTCA is publicly available at http://pmlabstack.pythonanywhere.com/PSRTTCA. The PSRTTCA predictor is anticipated to facilitate community-wide efforts in accelerating the discovery of novel TTCAs for cancer immunotherapy and other clinical applications.
引用
收藏
页数:11
相关论文
共 73 条
[1]   MHC-II neoantigens shape tumour immunity and response to immunotherapy [J].
Alspach, Elise ;
Lussier, Danielle M. ;
Miceli, Alexander P. ;
Kizhvatov, Ilya ;
DuPage, Michel ;
Luoma, Adrienne M. ;
Meng, Wei ;
Lichti, Cheryl F. ;
Esaulova, Ekaterina ;
Vomund, Anthony N. ;
Runci, Daniele ;
Ward, Jeffrey P. ;
Gubin, Matthew M. ;
Medrano, Ruan F. V. ;
Arthur, Cora D. ;
White, J. Michael ;
Sheehan, Kathleen C. F. ;
Chen, Alex ;
Wucherpfennig, Kai W. ;
Jacks, Tyler ;
Unanue, Emil R. ;
Artyomov, Maxim N. ;
Schreiber, Robert D. .
NATURE, 2019, 574 (7780) :696-+
[2]   CAR T-cell Therapy: A New Era in Cancer Immunotherapy [J].
Androulla, Miliotou N. ;
Lefkothea, Papadopoulou C. .
CURRENT PHARMACEUTICAL BIOTECHNOLOGY, 2018, 19 (01) :5-18
[3]   Issues in bioinformatics benchmarking: the case study of multiple sequence alignment [J].
Aniba, Mohamed Radhouene ;
Poch, Olivier ;
Thompson, Julie D. .
NUCLEIC ACIDS RESEARCH, 2010, 38 (21) :7353-7363
[4]   Estimating confidence intervals for information transfer analysis of confusion matrices [J].
Azadpour, Mahan ;
McKay, Colette M. ;
Smith, Robert L. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (03) :EL140-EL146
[5]   TTAgP 1.0: A computational tool for the specific prediction of tumor T cell antigens [J].
Beltran Lissabet, Jorge Felix ;
Herrera Belen, Lisandra ;
Farias, Jorge G. .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2019, 83
[6]   DEVELOPMENT OF HYDROPHOBICITY PARAMETERS TO ANALYZE PROTEINS WHICH BEAR POSTTRANSLATIONAL OR COTRANSLATIONAL MODIFICATIONS [J].
BLACK, SD ;
MOULD, DR .
ANALYTICAL BIOCHEMISTRY, 1991, 193 (01) :72-82
[7]   mACPpred: A Support Vector Machine-Based Meta-Predictor for Identification of Anticancer Peptides [J].
Boopathi, Vinothini ;
Subramaniyam, Sathiyamoorthy ;
Malik, Adeel ;
Lee, Gwang ;
Manavalan, Balachandran ;
Yang, Deok-Chun .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (08)
[8]  
Breckpot Karine, 2009, Endocrine Metabolic & Immune Disorders-Drug Targets, V9, P328
[9]  
Breiman L, 1996, MACH LEARN, V24, P49
[10]   Properties of MHC Class I Presented Peptides That Enhance Immunogenicity [J].
Calis, Jorg J. A. ;
Maybeno, Matt ;
Greenbaum, Jason A. ;
Weiskopf, Daniela ;
De Silva, Aruna D. ;
Sette, Alessandro ;
Kesmir, Can ;
Peters, Bjoern .
PLOS COMPUTATIONAL BIOLOGY, 2013, 9 (10)