In Silico Pipeline to Identify Tumor-Specific Antigens for Cancer Immunotherapy Using Exome Sequencing Data

被引:18
作者
Morazan-Fernandez, Diego [1 ]
Mora, Javier [2 ,3 ]
Molina-Mora, Jose Arturo [2 ,3 ]
机构
[1] Caja Costarricense Seguro Social, San Jose 10104, Costa Rica
[2] Univ Costa Rica, Ctr Invest Enfermedades Trop, Ctr Invest Cirugia & Canc, San Jose 2060, Costa Rica
[3] Univ Costa Rica, Fac Microbiol, San Jose 2060, Costa Rica
来源
PHENOMICS | 2023年 / 3卷 / 02期
关键词
Neopeptide; Colorectal cancer; Cancer vaccine; Neoantigens; Human leukocyte antigen; Costa Rica; COLORECTAL-CANCER; GENERATION; FREQUENCY;
D O I
10.1007/s43657-022-00084-9
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Tumor-specific antigens or neoantigens are peptides that are expressed only in cancer cells and not in healthy cells. Some of these molecules can induce an immune response, and therefore, their use in immunotherapeutic strategies based on cancer vaccines has been extensively explored. Studies based on these approaches have been triggered by the current high-throughput DNA sequencing technologies. However, there is no universal nor straightforward bioinformatic protocol to discover neoantigens using DNA sequencing data. Thus, we propose a bioinformatic protocol to detect tumor-specific antigens associated with single nucleotide variants (SNVs) or "mutations" in tumoral tissues. For this purpose, we used publicly available data to build our model, including exome sequencing data from colorectal cancer and healthy cells obtained from a single case, as well as frequent human leukocyte antigen (HLA) class I alleles in a specific population. HLA data from Costa Rican Central Valley population was selected as an example. The strategy included three main steps: (1) pre-processing of sequencing data; (2) variant calling analysis to detect tumor-specific SNVs in comparison with healthy tissue; and (3) prediction and characterization of peptides (protein fragments, the tumor-specific antigens) derived from the variants, in the context of their affinity with frequent alleles of the selected population. In our model data, we found 28 non-silent SNVs, present in 17 genes in chromosome one. The protocol yielded 23 strong binders peptides derived from the SNVs for frequent HLA class I alleles for the Costa Rican population. Although the analyses were performed as an example to implement the pipeline, to our knowledge, this is the first study of an in silico cancer vaccine using DNA sequencing data in the context of the HLA alleles. It is concluded that the standardized protocol was not only able to identify neoantigens in a specific but also provides a complete pipeline for the eventual design of cancer vaccines using the best bioinformatic practices.
引用
收藏
页码:130 / 137
页数:8
相关论文
共 53 条
[1]  
Almawi WY., 2022, BMC Genom, V23, P1, DOI [10.1186/S12864-022-08682-7/TABLES/4, DOI 10.1186/S12864-022-08682-7/TABLES/4]
[2]   NBPF1, a tumor suppressor candidate in neuroblastoma, exerts growth inhibitory effects by inducing a G1 cell cycle arrest [J].
Andries, Vanessa ;
Vandepoele, Karl ;
Staes, Katrien ;
Berx, Geert ;
Bogaert, Pieter ;
Van Isterdael, Gert ;
Ginneberge, Daisy ;
Parthoens, Eef ;
Vandenbussche, Jonathan ;
Gevaert, Kris ;
van Roy, Frans .
BMC CANCER, 2015, 15
[3]   Frequency of Class I and II HLA alleles in patients with lung cancer according to chemotherapy response and 5-year survival [J].
Araz, Omer ;
Ucar, Elif Yilmazel ;
Meral, Mehmet ;
Yalcin, Aslihan ;
Acemoglu, Hamit ;
Dogan, Hasan ;
Karaman, Adem ;
Aydin, Yener ;
Gorguner, Metin ;
Akgun, Metin .
CLINICAL RESPIRATORY JOURNAL, 2015, 9 (03) :297-304
[4]   High-resolution HLA allele and haplotype frequencies in majority and minority populations of Costa Rica and Nicaragua: Differential admixture proportions in neighboring countries [J].
Arrieta-Bolanos, E. ;
Madrigal-Sanchez, J. J. ;
Stein, J. E. ;
Orlich-Perez, P. ;
Moreira-Espinoza, M. J. ;
Paredes-Carias, E. ;
Vanegas-Padilla, Y. ;
Salazar-Sanchez, L. ;
Madrigal, J. A. ;
Marsh, S. G. E. ;
Shaw, B. E. .
HLA, 2018, 91 (06) :514-529
[5]   HLA-A, -B, -C,-DQB1, and-DRB1,3,4,5 allele and haplotype frequencies in the Costa Rica Central Valley Population and its relationship to worldwide populations [J].
Arrieta-Bolanos, Esteban ;
Maldonado-Torres, Hazael ;
Dimitriu, Oana ;
Hoddinott, Michael A. ;
Fowles, Finnuala ;
Shah, Anila ;
Orlich-Perez, Priscilla ;
McWhinnie, Alasdair J. ;
Alfaro-Bourrouet, Wilbert ;
Bujan-Boza, Willem ;
Little, Ann-Margaret ;
Salazar-Sanchez, Lizbeth ;
Madrigal, J. Alejandro .
HUMAN IMMUNOLOGY, 2011, 72 (01) :80-86
[6]   Insights into the mutation T1117I in the spike and the lineage B.1.1.389 of SARS-CoV-2 circulating in Costa Rica [J].
Arturo Molina-Mora, Jose .
GENE REPORTS, 2022, 27
[7]   Molecular Subtypes of Colorectal Cancer and Their Clinicopathologic Features, With an Emphasis on the Serrated Neoplasia Pathway [J].
Bae, Jeong Mo ;
Kim, Jung Ho ;
Kang, Gyeong Hoon .
ARCHIVES OF PATHOLOGY & LABORATORY MEDICINE, 2016, 140 (05) :406-412
[8]   Binding affinities of 438 HLA proteins to complete proteomes of seven pandemic viruses and distributions of strongest and weakest HLA peptide binders in populations worldwide [J].
Barquera, Rodrigo ;
Collen, Evelyn ;
Di, Da ;
Buhler, Stephane ;
Teixeira, Joao ;
Llamas, Bastien ;
Nunes, Jose M. ;
Sanchez-Mazas, Alicia .
HLA, 2020, 96 (03) :277-298
[9]   MuPeXI: prediction of neo-epitopes from tumor sequencing data [J].
Bjerregaard, Anne-Mette ;
Nielsen, Morten ;
Hadrup, Sine Reker ;
Szallasi, Zoltan ;
Eklund, Aron Charles .
CANCER IMMUNOLOGY IMMUNOTHERAPY, 2017, 66 (09) :1123-1130
[10]   Manipulation of FASTQ data with Galaxy [J].
Blankenberg, Daniel ;
Gordon, Assaf ;
Von Kuster, Gregory ;
Coraor, Nathan ;
Taylor, James ;
Nekrutenko, Anton .
BIOINFORMATICS, 2010, 26 (14) :1783-1785