SketchFormer: transformer-based approach for sketch recognition using vector images

被引:3
作者
Parihar, Anil Singh [1 ]
Jain, Gaurav [1 ]
Chopra, Shivang [1 ]
Chopra, Suransh [1 ]
机构
[1] Delhi Technol Univ, Machine Learning Res Lab, Dept Comp Sci & Engn, New Delhi 110042, India
关键词
Sketch recognition; Transformers; Vector images; Deep learning; ALGORITHM;
D O I
10.1007/s11042-020-09837-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sketches have been employed since the ancient era of cave paintings for simple illustrations to represent real-world entities and communication. The abstract nature and varied artistic styling make automatic recognition of these drawings more challenging than other areas of image classification. Moreover, the representation of sketches as a sequence of strokes instead of raster images introduces them at the correct abstract level. However, dealing with images as a sequence of small information makes it challenging. In this paper, we propose a Transformer-based network, dubbed as AttentiveNet, for sketch recognition. This architecture incorporates ordinal information to perform the classification task in real-time through vector images. We employ the proposed model to isolate the discriminating strokes of each doodle using the attention mechanism of Transformers and perform an in-depth qualitative analysis of the isolated strokes for classification of the sketch. Experimental evaluation validates that the proposed network performs favorably against state-of-the-art techniques.
引用
收藏
页码:9075 / 9091
页数:17
相关论文
共 50 条
  • [1] SketchFormer: transformer-based approach for sketch recognition using vector images
    Anil Singh Parihar
    Gaurav Jain
    Shivang Chopra
    Suransh Chopra
    Multimedia Tools and Applications, 2021, 80 : 9075 - 9091
  • [2] Attention-Net: An Ensemble Sketch Recognition Approach Using Vector Images
    Jain, Gaurav
    Chopra, Shivang
    Chopra, Suransh
    Parihar, Anil Singh
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (01) : 136 - 145
  • [3] A Transformer-Based Approach for Better Hand Gesture Recognition
    Besrour, Sinda
    Surapaneni, Yogesh
    Mubibya, Gael S.
    Ashkar, Fahim
    Almhana, Jalal
    20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024, 2024, : 1135 - 1140
  • [4] A Novel Transformer-Based Approach for Adult's Facial Emotion Recognition
    Nawaz, Uzma
    Saeed, Zubair
    Atif, Kamran
    IEEE ACCESS, 2025, 13 : 56485 - 56508
  • [5] Transformer-Based Feature Fusion Approach for Multimodal Visual Sentiment Recognition Using Tweets in the Wild
    Alzamzami, Fatimah
    Saddik, Abdulmotaleb El
    IEEE ACCESS, 2023, 11 : 47070 - 47079
  • [6] A transformer-based approach for Arabic offline handwritten text recognition
    Momeni, Saleh
    Babaali, Bagher
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3053 - 3062
  • [7] A Transformer-Based Framework for Scene Text Recognition
    Selvam, Prabu
    Koilraj, Joseph Abraham Sundar
    Tavera Romero, Carlos Andres
    Alharbi, Meshal
    Mehbodniya, Abolfazl
    Webber, Julian L.
    Sengan, Sudhakar
    IEEE ACCESS, 2022, 10 : 100895 - 100910
  • [8] A transformer-based approach for Arabic offline handwritten text recognition
    Saleh Momeni
    Bagher BabaAli
    Signal, Image and Video Processing, 2024, 18 : 3053 - 3062
  • [9] A transformer-based approach for early prediction of soybean yield using time-series images
    Bi, Luning
    Wally, Owen
    Hu, Guiping
    Tenuta, Albert U.
    Kandel, Yuba R.
    Mueller, Daren S.
    FRONTIERS IN PLANT SCIENCE, 2023, 14
  • [10] Transformer-Based Regression Network for Pansharpening Remote Sensing Images
    Su, Xunyang
    Li, Jinjiang
    Hua, Zhen
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60