SketchFormer: transformer-based approach for sketch recognition using vector images

被引:3
作者
Parihar, Anil Singh [1 ]
Jain, Gaurav [1 ]
Chopra, Shivang [1 ]
Chopra, Suransh [1 ]
机构
[1] Delhi Technol Univ, Machine Learning Res Lab, Dept Comp Sci & Engn, New Delhi 110042, India
关键词
Sketch recognition; Transformers; Vector images; Deep learning; ALGORITHM;
D O I
10.1007/s11042-020-09837-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sketches have been employed since the ancient era of cave paintings for simple illustrations to represent real-world entities and communication. The abstract nature and varied artistic styling make automatic recognition of these drawings more challenging than other areas of image classification. Moreover, the representation of sketches as a sequence of strokes instead of raster images introduces them at the correct abstract level. However, dealing with images as a sequence of small information makes it challenging. In this paper, we propose a Transformer-based network, dubbed as AttentiveNet, for sketch recognition. This architecture incorporates ordinal information to perform the classification task in real-time through vector images. We employ the proposed model to isolate the discriminating strokes of each doodle using the attention mechanism of Transformers and perform an in-depth qualitative analysis of the isolated strokes for classification of the sketch. Experimental evaluation validates that the proposed network performs favorably against state-of-the-art techniques.
引用
收藏
页码:9075 / 9091
页数:17
相关论文
共 50 条
[1]   A novel hybrid antlion optimization algorithm for multi-objective task scheduling problems in cloud computing environments [J].
Abualigah, Laith ;
Diabat, Ali .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (01) :205-223
[2]  
Abualigah L, 2020, NEURAL COMPUT APPL, V32, P12381, DOI [10.1007/s00521-020-04839-1, 10.1007/s00521-020-05107-y]
[3]   Hybrid clustering analysis using improved krill herd algorithm [J].
Abualigah, Laith Mohammad ;
Khader, Ahamad Tajudin ;
Hanandeh, Essam Said .
APPLIED INTELLIGENCE, 2018, 48 (11) :4047-4071
[4]   A combination of objective functions and hybrid Krill herd algorithm for text document clustering analysis [J].
Abualigah, Laith Mohammad ;
Khader, Ahamad Tajudin ;
Hanandeh, Essam Said .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 73 :111-125
[5]   A new feature selection method to improve the document clustering using particle swarm optimization algorithm [J].
Abualigah, Laith Mohammad ;
Khader, Ahamad Tajudin ;
Hanandeh, Essam Said .
JOURNAL OF COMPUTATIONAL SCIENCE, 2018, 25 :456-466
[6]   Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering [J].
Abualigah, Laith Mohammad ;
Khader, Ahamad Tajudin .
JOURNAL OF SUPERCOMPUTING, 2017, 73 (11) :4773-4795
[7]  
Abualigah LMQ, 2019, FEATURE SELECTION EN, DOI [DOI 10.1007/978-3-030-10674-4, 10.1007/978-3-030-10674-4]
[8]  
[Anonymous], 2013, Generating sequences with recurrent neural networks
[9]   Sketch recognition by fusion of temporal and image-based features [J].
Arandjelovic, Relja ;
Sezgin, Tevfik Metin .
PATTERN RECOGNITION, 2011, 44 (06) :1225-1234
[10]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473