A Relational Model for One-Shot Classification of Images and Pen Strokes

被引:1
作者
Polis, Arturs [1 ]
Ilin, Alexander [1 ]
机构
[1] Aalto Univ, Espoo, Finland
基金
芬兰科学院;
关键词
One-shot learning; Relational learning; Transformer; Graph network; Omniglot;
D O I
10.1016/j.neucom.2022.06.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We show that a deep learning model with built-in relational inductive bias can bring benefits to sample-efficient learning, without relying on extensive data augmentation. Our study shows that excellent results can be achieved with a model in which the relational inductive bias is applied to images, while building an efficient one-shot classifier on top of raw strokes is more challenging. The proposed one-shot classification model performs relational matching of a pair of inputs in the form of local and pairwise attention. Our approach solves with almost perfect accuracy the one-shot image classification Omniglot challenge when combined with a Hungarian matching algorithm and attains competitive results on the same task on characters represented as rotation-augmented strokes.(c) 2022 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http:// creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:1 / 13
页数:13
相关论文
共 36 条
  • [1] Ba Jimmy Lei, 2016, LAYER NORMALIZATION, DOI 10.48550/arXiv.1607.06450
  • [2] Battaglia P.W., 2018, Relational inductive biases, deep learning, and graph networks
  • [3] Emerging Properties in Self-Supervised Vision Transformers
    Caron, Mathilde
    Touvron, Hugo
    Misra, Ishan
    Jegou, Herve
    Mairal, Julien
    Bojanowski, Piotr
    Joulin, Armand
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9630 - 9640
  • [4] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis
    Dai, Angela
    Qi, Charles Ruizhongtai
    Niessner, Matthias
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6545 - 6554
  • [5] Dosovitskiy A., 2020, INT C LEARNING REPRE
  • [6] Finn C, 2017, PR MACH LEARN RES, V70
  • [7] Geirhos R., 2018, ARXIV
  • [8] Gilmer J, 2017, PR MACH LEARN RES, V70
  • [9] Ha D., 2018, INT C LEARN REPR
  • [10] Ioffe S., 2015, P 32 INT C MACHINE L, P448