Modern Hopfield Networks and Attention for Immune Repertoire Classification

被引:0
作者
Widrich, Michael [1 ,2 ]
Schafl, Bernhard [1 ,2 ]
Pavlovic, Milena [3 ,4 ]
Ramsauer, Hubert [2 ]
Gruber, Lukas [1 ,2 ]
Holzleitner, Markus [1 ,2 ]
Brandstetter, Johannes [1 ,2 ]
Sandve, Geir Kjetil [4 ]
Greiff, Victor [3 ]
Hochreiter, Sepp [1 ,2 ,5 ]
Klambauer, Gunter
机构
[1] Johannes Kepler Univ Linz, ELLIS Unit Linz, Linz, Austria
[2] Johannes Kepler Univ Linz, Inst Machine Learning, LIT Lab, Linz, Austria
[3] Univ Oslo, Dept Immunol, Oslo, Norway
[4] Univ Oslo, Dept Informat, Oslo, Norway
[5] Inst Adv Res Artificial Intelligence IARAI, Vienna, Austria
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷
基金
欧盟地平线“2020”;
关键词
T-CELL; FEATURES; STORAGE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A central mechanism in machine learning is to identify, store, and recognize patterns. How to learn, access, and retrieve such patterns is crucial in Hopfield networks and the more recent transformer architectures. We show that the attention mechanism of transformer architectures is actually the update rule of modern Hopfield networks that can store exponentially many patterns. We exploit this high storage capacity of modern Hopfield networks to solve a challenging multiple instance learning (MIL) problem in computational biology: immune repertoire classification. In immune repertoire classification, a vast number of immune receptors are used to predict the immune status of an individual. This constitutes a MIL problem with an unprecedentedly massive number of instances, two orders of magnitude larger than currently considered problems, and with an extremely low witness rate. Accurate and interpretable machine learning methods solving this problem could pave the way towards new vaccines and therapies, which is currently a very relevant research topic intensified by the COVID-19 crisis. In this work, we present our novel method DeepRC that integrates transformer-like attention, or equivalently modern Hopfield networks, into deep learning architectures for massive MIL such as immune repertoire classification. We demonstrate that DeepRC outperforms all other methods with respect to predictive performance on large-scale experiments including simulated and real-world virus infection data and enables the extraction of sequence motifs that are connected to a given disease class. Source code and datasets: https://github.com/ml-jku/DeepRC
引用
收藏
页数:14
相关论文
共 70 条
  • [1] Akbar R., 2019, BIORXIV
  • [2] Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning
    Alipanahi, Babak
    Delong, Andrew
    Weirauch, Matthew T.
    Frey, Brendan J.
    [J]. NATURE BIOTECHNOLOGY, 2015, 33 (08) : 831 - +
  • [3] [Anonymous], 2019, INT C MACH LEARN
  • [4] Arras L., 2019, LNCS (LNAI), V11700, P211, DOI [DOI 10.1007/978-3, 10.1007/978-3]
  • [5] Augmenting adaptive immunity: progress and challenges in the quantitative engineering and analysis of adaptive immune receptor repertoires
    Brown, Alex J.
    Snapkov, Igor
    Akbar, Rahmad
    Pavlovic, Milena
    Miho, Enkelejda
    Sandve, Geir K.
    Greiff, Victor
    [J]. MOLECULAR SYSTEMS DESIGN & ENGINEERING, 2019, 4 (04) : 701 - 736
  • [6] Multiple instance learning: A survey of problem characteristics and applications
    Carbonneau, Marc-Andre
    Cheplygina, Veronika
    Granger, Eric
    Gagnon, Ghyslain
    [J]. PATTERN RECOGNITION, 2018, 77 : 329 - 353
  • [7] VDJS']JServer: A Cloud-Based Analysis Portal and Data Commons for Immune Repertoire Sequences and Rearrangements
    Christley, Scott
    Scarborough, Walter
    Salinas, Eddie
    Rounds, William H.
    Toby, Inimary T.
    Fonner, John M.
    Levin, Mikhail K.
    Kim, Min
    Mock, Stephen A.
    Jordan, Christopher
    Ostmeyer, Jared
    Buntzman, Adam
    Rubelt, Florian
    Davila, Marco L.
    Monson, Nancy L.
    Scheuermann, Richard H.
    Cowell, Lindsay G.
    [J]. FRONTIERS IN IMMUNOLOGY, 2018, 9
  • [8] Tetramer-visualized gluten-specific CD4+T cells in blood as a potential diagnostic marker for coeliac disease without oral gluten challenge
    Christophersen, Asbjorn
    Raki, Melinda
    Bergseng, Elin
    Lundin, Knut E. A.
    Jahnsen, Jorgen
    Sollid, Ludvig M.
    Qiao, Shuo-Wang
    [J]. UNITED EUROPEAN GASTROENTEROLOGY JOURNAL, 2014, 2 (04) : 268 - 278
  • [9] iReceptor: A platform for querying and analyzing antibody/B-cell and T-cell receptor repertoire data across federated repositories
    Corrie, Brian D.
    Marthandan, Nishanth
    Zimonja, Bojan
    Jaglale, Jerome
    Zhou, Yang
    Barr, Emily
    Knoetze, Nicole
    Breden, Frances M. W.
    Christley, Scott
    Scott, Jamie K.
    Cowell, Lindsay G.
    Breden, Felix
    [J]. IMMUNOLOGICAL REVIEWS, 2018, 284 (01) : 24 - 41
  • [10] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis
    Dai, Angela
    Qi, Charles Ruizhongtai
    Niessner, Matthias
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6545 - 6554