A novel statistical method for decontaminating T-cell receptor sequencing data

被引:1
|
作者
Li, Ruoxing [1 ]
Altan, Mehmet [2 ]
Reuben, Alexandre [2 ]
Lin, Ruitao [8 ]
Heymach, John, V [3 ]
Tran, Hai [2 ]
Chen, Runzhe [4 ]
Little, Latasha [5 ]
Hubert, Shawna [6 ]
Zhang, Jianjun [7 ,9 ,10 ,11 ]
Li, Ziyi [8 ,12 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, Dept Biostat & Data Sci, Houston, TX USA
[2] Univ Texas MD Anderson Canc Ctr, Dept Thorac Head & Neck Med Oncol, Houston, TX USA
[3] Univ Texas MD Anderson Canc Ctr, Chair Thorac Head & Neck Med Oncol, Houston, TX USA
[4] Univ Texas MD Anderson Canc Ctr, Houston, TX USA
[5] Univ Texas MD Anderson Canc Ctr, Dept Thorac Head & Neck Med Oncol, Houston, TX USA
[6] Univ Texas MD Anderson Canc Ctr, Chair Thorac Head & Neck Med Oncol, Houston, TX USA
[7] Univ Texas MD Anderson Canc Ctr, Dept Thorac Head & Neck Med Oncol, Houston, TX USA
[8] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX USA
[9] Univ Texas MD Anderson Canc Ctr, Lung Canc Genom Program, Houston, TX 77030 USA
[10] Univ Texas MD Anderson Canc Ctr, Lung Canc Intercept Program, Houston, TX 77030 USA
[11] Univ Texas MD Anderson Canc Ctr, Dept Thorac Head & Neck Med Oncol, Houston, TX 77030 USA
[12] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
关键词
Bayesian model; Contamination detection; TCR sequencing; TCR REPERTOIRE;
D O I
10.1093/bib/bbad230
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The T-cell receptor (TCR) repertoire is highly diverse among the population and plays an essential role in initiating multiple immune processes. TCR sequencing (TCR-seq) has been developed to profile the T cell repertoire. Similar to other high-throughput experiments, contamination can happen during several steps of TCR-seq, including sample collection, preparation and sequencing. Such contamination creates artifacts in the data, leading to inaccurate or even biased results. Most existing methods assume 'clean' TCR-seq data as the starting point with no ability to handle data contamination. Here, we develop a novel statistical model to systematically detect and remove contamination in TCR-seq data. We summarize the observed contamination into two sources, pairwise and cross-cohort. For both sources, we provide visualizations and summary statistics to help users assess the severity of the contamination. Incorporating prior information from 14 existing TCR-seq datasets with minimum contamination, we develop a straightforward Bayesian model to statistically identify contaminated samples. We further provide strategies for removing the impacted sequences to allow for downstream analysis, thus avoiding any need to repeat experiments. Our proposed model shows robustness in contamination detection compared with a few off-the-shelf detection methods in simulation studies. We illustrate the use of our proposed method on two TCR-seq datasets generated locally.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] T-Cell Receptor Repertoire Sequencing in the Era of Cancer Immunotherapy
    Frank, Meredith L.
    Lu, Kaylene
    Erdogan, Can
    Han, Yi
    Hu, Jian
    Wang, Tao
    Heymach, John, V
    Zhang, Jianjun
    Reuben, Alexandre
    CLINICAL CANCER RESEARCH, 2023, 29 (06) : 994 - 1008
  • [2] T-Cell Receptor Repertoire Sequencing and Its Applications: Focus on Infectious Diseases and Cancer
    Mazzotti, Lucia
    Gaimari, Anna
    Bravaccini, Sara
    Maltoni, Roberta
    Cerchione, Claudio
    Juan, Manel
    Navarro, Europa Azucena-Gonzalez
    Pasetto, Anna
    Nascimento Silva, Daniela
    Ancarani, Valentina
    Sambri, Vittorio
    Calabro, Luana
    Martinelli, Giovanni
    Mazza, Massimiliano
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (15)
  • [3] Pathologic T-cell response in ischaemic failing hearts elucidated by T-cell receptor sequencing and phenotypic characterization
    Tang, Ting-Ting
    Zhu, Yi-Cheng
    Dong, Nian-Guo
    Zhang, Si
    Cai, Jie
    Zhang, Ling-Xue
    Han, Yue
    Xia, Ni
    Nie, Shao-Fang
    Zhang, Min
    Lv, Bing-Jie
    Jiao, Jiao
    Yang, Xiang-Ping
    Hu, Yu
    Liao, Yu-Hua
    Cheng, Xiang
    EUROPEAN HEART JOURNAL, 2019, 40 (48) : 3924 - +
  • [4] T-cell receptor sequencing in interrogating antigen-specific T-cell responses to foreign and self-antigens
    Johansson, Alexandra M.
    Kwok, William W.
    JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY, 2024, 153 (06) : 1540 - 1542
  • [5] Genesis of the T-cell receptor
    Dupic, Thomas
    Marcou, Quentin
    Walczak, Aleksandra M.
    Mora, Thierry
    PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (03)
  • [6] High throughput sequencing of T-cell receptor repertoire using dry blood spots
    Shang-Gin Wu
    Wenjing Pan
    Hongna Liu
    Miranda L. Byrne-Steele
    Brittany Brown
    Mollye Depinet
    Xiaohong Hou
    Jian Han
    Song Li
    Journal of Translational Medicine, 17
  • [7] Rigorous benchmarking of T-cell receptor repertoire profiling methods for cancer RNA sequencing
    Peng, Kerui
    Nowicki, Theodore S.
    Campbell, Katie
    Vahed, Mohammad
    Peng, Dandan
    Meng, Yiting
    Nagareddy, Anish
    Huang, Yu-Ning
    Karlsberg, Aaron
    Miller, Zachary
    Brito, Jaqueline
    Nadel, Brian
    Pak, Victoria M.
    Abedalthagafi, Malak S.
    Burkhardt, Amanda M.
    Alachkar, Houda
    Ribas, Antoni
    Mangul, Serghei
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (04)
  • [8] High throughput sequencing of T-cell receptor repertoire using dry blood spots
    Wu, Shang-Gin
    Pan, Wenjing
    Liu, Hongna
    Byrne-Steele, Miranda L.
    Brown, Brittany
    Depinet, Mollye
    Hou, Xiaohong
    Han, Jian
    Li, Song
    JOURNAL OF TRANSLATIONAL MEDICINE, 2019, 17 (1)
  • [9] Diagnosing Viral Infections Through T-Cell Receptor Sequencing of Activated CD8+ T Cells
    Vujkovic, Alexandra
    Ha, My
    de Block, Tessa
    van Petersen, Lida
    Brosius, Isabel
    Theunissen, Caroline
    van Ierssel, Sabrina H.
    Bartholomeus, Esther
    Adriaensen, Wim
    Vanham, Guido
    Elias, George
    Van Damme, Pierre
    Van Tendeloo, Viggo
    Beutels, Philippe
    van Frankenhuijsen, Maartje
    Vlieghe, Erika
    Ogunjimi, Benson
    Laukens, Kris
    Meysman, Pieter
    Vercauteren, Koen
    JOURNAL OF INFECTIOUS DISEASES, 2024, 229 (02) : 507 - 516
  • [10] Bulk T-cell receptor sequencing confirms clonality in obstetric antiphospholipid syndrome and may as a potential biomarker
    Liu, Qi
    Yang, Shuo
    Tan, Yuan
    Feng, Weimin
    Wang, Qingchen
    Qiao, Jiao
    Yang, Boxing
    Wang, Chong
    Tao, Jingjin
    Wang, He
    Cui, Liyan
    AUTOIMMUNITY, 2024, 57 (01)