SketchFormer: transformer-based approach for sketch recognition using vector images

被引:3
作者
Parihar, Anil Singh [1 ]
Jain, Gaurav [1 ]
Chopra, Shivang [1 ]
Chopra, Suransh [1 ]
机构
[1] Delhi Technol Univ, Machine Learning Res Lab, Dept Comp Sci & Engn, New Delhi 110042, India
关键词
Sketch recognition; Transformers; Vector images; Deep learning; ALGORITHM;
D O I
10.1007/s11042-020-09837-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sketches have been employed since the ancient era of cave paintings for simple illustrations to represent real-world entities and communication. The abstract nature and varied artistic styling make automatic recognition of these drawings more challenging than other areas of image classification. Moreover, the representation of sketches as a sequence of strokes instead of raster images introduces them at the correct abstract level. However, dealing with images as a sequence of small information makes it challenging. In this paper, we propose a Transformer-based network, dubbed as AttentiveNet, for sketch recognition. This architecture incorporates ordinal information to perform the classification task in real-time through vector images. We employ the proposed model to isolate the discriminating strokes of each doodle using the attention mechanism of Transformers and perform an in-depth qualitative analysis of the isolated strokes for classification of the sketch. Experimental evaluation validates that the proposed network performs favorably against state-of-the-art techniques.
引用
收藏
页码:9075 / 9091
页数:17
相关论文
共 50 条
  • [21] A transformer-based approach to irony and sarcasm detection
    Rolandos Alexandros Potamias
    Georgios Siolas
    Andreas - Georgios Stafylopatis
    Neural Computing and Applications, 2020, 32 : 17309 - 17320
  • [22] Automated efficient traffic gesture recognition using swin transformer-based multi-input deep network with radar images
    Firat, Huseyin
    Uzen, Huseyin
    Atila, Orhan
    Sengur, Abdulkadir
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [23] A transformer-based approach to irony and sarcasm detection
    Potamias, Rolandos Alexandros
    Siolas, Georgios
    Stafylopatis, Andreas-Georgios
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (23) : 17309 - 17320
  • [24] Transformer-Based Seismic Image Enhancement: A Novel Approach for Improved Resolution
    Park, Jin-Yeong
    Saad, Omar M.
    Oh, Ju-Won
    Alkhalifah, Tariq
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [25] Transformer-Based Model for Monocular Visual Odometry: A Video Understanding Approach
    Francani, Andre O.
    Maximo, Marcos R. O. A.
    IEEE ACCESS, 2025, 13 : 13959 - 13971
  • [26] A transformer-based method for the registration of terahertz security images with visible light images
    Shen, Liujia
    Zhou, Deliang
    Bai, Yechao
    PROCEEDINGS OF THE 2024 6TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING SYSTEMS, SSPS 2024, 2024, : 48 - 55
  • [27] Transformer-based Approaches for Personality Detection using the MBTI Model
    Lazo Vasquez, Ricardo
    Ochoa-Luna, Jose
    2021 XLVII LATIN AMERICAN COMPUTING CONFERENCE (CLEI 2021), 2021,
  • [28] Lung Cancer Prediction Using Electronic Claims Records: A Transformer-Based Approach
    Chen, Huan-Yu
    Wang, Hui-Min
    Lin, Ching-Heng
    Yang, Rob
    Lee, Chi-Chun
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (12) : 6062 - 6073
  • [29] ERTNet: an interpretable transformer-based framework for EEG emotion recognition
    Liu, Ruixiang
    Chao, Yihu
    Ma, Xuerui
    Sha, Xianzheng
    Sun, Limin
    Li, Shuo
    Chang, Shijie
    FRONTIERS IN NEUROSCIENCE, 2024, 18
  • [30] Psychological disorder detection: A multimodal approach using a transformer-based hybrid model
    Ghosh, Debadrita
    Karande, Hema
    Gite, Shilpa
    Pradhan, Biswajeet
    METHODSX, 2024, 13