Attention-Net: An Ensemble Sketch Recognition Approach Using Vector Images

被引:3
作者
Jain, Gaurav [1 ]
Chopra, Shivang [1 ]
Chopra, Suransh [1 ]
Parihar, Anil Singh [1 ]
机构
[1] Delhi Technol Univ, Dept Comp Sci & Engn, Machine Learning Res Lab, New Delhi 110042, India
关键词
Feature extraction; Convolution; Image recognition; Task analysis; Support vector machines; Recurrent neural networks; Visualization; Attention mechanism and development; sketch recognition; temporal convolution network (TCN); transformers; visual cognition;
D O I
10.1109/TCDS.2020.3023055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For the past few decades, machines have replaced humans in several disciplines. However, machine cognition still lags behind the human capabilities. We address the machines' ability to recognize human drawn sketches in this work. Visual representations, such as sketches have long been a medium of communication for humans. For artificially intelligent systems to effectively immerse in interactive environments, it is required that machines understand such notations. The abstract nature and varied artistic styling of these sketches make automatic recognition of drawings more challenging than other areas of image classification. In this article, we use sketches represented as a sequence of strokes, i.e., as vector images, to effectively capture the long-term temporal dependencies in hand-drawn sketches. The proposed approach combines the self-attention capabilities of Transformers while effectively utilizing the long-term temporal dependencies through temporal convolution networks (TCNs) for sketch recognition. The confidence scores obtained from the two techniques are combined using triangular-norm (T-norm). Attention heat maps are plotted to isolate the discriminating parts of a sketch that contribute to sketch classification. The extensive quantitative and qualitative evaluation confirms that the proposed network performs favorably against state-of-the-art techniques.
引用
收藏
页码:136 / 145
页数:10
相关论文
共 38 条
  • [1] Bai S., 2018, ARXIV180301271
  • [2] Shoot less and Sketch more: An Efficient Sketch Classification Via Joining Graph Neural Networks and Few-shot Learning
    Bensalah, Asma
    Riba, Pau
    Fornes, Alicia
    Llados, Josep
    [J]. 2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW) AND 13TH IAPR INTERNATIONAL WORKSHOP ON GRAPHICS RECOGNITION (GREC 2019), VOL 1, 2019, : 80 - 85
  • [3] Dehghani M, 2018, UNIVERSAL TRANSFORME
  • [4] How Do Humans Sketch Objects?
    Eitz, Mathias
    Hays, James
    Alexa, Marc
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04):
  • [5] Sketch-Based Shape Retrieval
    Eitz, Mathias
    Richter, Ronald
    Boubekeur, Tamy
    Hildebrand, Kristian
    Alexa, Marc
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04):
  • [6] Ha D., 2018, A neural representation of sketch drawings
  • [7] Score level fusion of multimodal biometrics using triangular norms
    Hanmandlu, Madasu
    Grover, Jyotsana
    Gureja, Ankit
    Gupta, H. M.
    [J]. PATTERN RECOGNITION LETTERS, 2011, 32 (14) : 1843 - 1850
  • [8] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [9] Huang Z, 2014, ACM T GRAPHIC, V33, DOI [10.1145/2661228.2661280, 10.1145/2661229.2661280]
  • [10] Keyrouz F, 2018, 2018 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), P40