Context awareness based Sketch-DeepNet architecture for hand-drawn sketches classification and recognition in AIoT

被引：0

作者：

Ali S. ^{[1
]}

Aslam N. ^{[1
]}

Kim D. ^{[2
]}

Abbas A. ^{[3
]}

Tufail S. ^{[1
]}

Azhar B. ^{[1
]}

机构：

[1] Department of Software Engineering, University of Lahore, Punjab, Lahore

[2] Department of Computer Engineering, Jeju National University, Jeju, Jeju

[3] Department of Computer Science, University of Central Punjab, Punjab, Lahore

来源：

PeerJ Computer Science | 2023年 / 9卷

关键词：

Convolutional neural networks (CNNs); Deep neural networks (DNNs); Sketch recognition; TU-Berlin;

D O I：

10.7717/PEERJ-CS.1186

中图分类号：

学科分类号：

摘要：

A sketch is a black-and-white, 2-D graphical representation of an object and contains fewer visual details as compared to a colored image. Despite fewer details, humans can recognize a sketch and its context very efficiently and consistently across languages, cultures, and age groups, but it is a difficult task for computers to recognize such low-detail sketches and get context out of them. With the tremendous increase in popularity of IoT devices such as smartphones and smart cameras, etc., it has become more critical to recognize free hand-drawn sketches in computer vision and human-computer interaction in order to build a successful artificial intelligence of things (AIoT) system that can first recognize the sketches and then understand the context of multiple drawings. Earlier models which addressed this problem are scaleinvariant feature transform (SIFT) and bag-of-words (BoW). Both SIFT and BoW used hand-crafted features and scale-invariant algorithms to address this issue. But these models are complex and time-consuming due to the manual process of features setup. The deep neural networks (DNNs) performed well with object recognition on many large-scale datasets such as ImageNet and CIFAR-10. However, the DDN approach cannot be carried out for hand-drawn sketches problems. The reason is that the data source is images, and all sketches in the images are, for example, ‘birds’ instead of their specific category (e.g., ‘sparrow’). Some deep learning approaches for sketch recognition problems exist in the literature, but the results are not promising because there is still room for improvement. This article proposed a convolutional neural network (CNN) architecture called Sketch-DeepNet for the sketch recognition task. The proposed Sketch-DeepNet architecture used the TU-Berlin dataset for classification. The experimental results show that the proposed method beats the performance of the state-of-the-art sketch classification methods. The proposed model achieved 95.05% accuracy as compared to existing models DeformNet (62.6%), Sketch-DNN (72.2%), Sketch-a-Net (77.95%), SketchNet (80.42%), Thinning-DNN (74.3%), CNN-PCA-SVM (72.5%), Hybrid-CNN (84.42%), and human recognition accuracy of 73% on the TU-Berlin dataset © Copyright 2023 Ali et al

引用

共 40 条

[1]

Ahn P, Shin DH, Kim J., Thinning deep neural networks for sketch recognition, 2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), pp. 1-4, (2016)

[2]

Bui T, Ribeiro L, Ponti M, Collomosse J., Sketching out the details: sketch-based image retrieval using convolutional neural networks with multi-stage regression, Computers & Graphics, 71, 7, pp. 77-87, (2018)

[3]

Dahl GE, Sainath TN, Hinton GE., Improving deep neural networks for LVCSR using rectified linear units and dropout, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 8609-8613, (2013)

[4]

Dai J, Qi H, Xiong Y, Li Y, Zhang G, Hu H, Wei Y., Deformable convolutional networks, Proceedings of the IEEE International Conference on Computer Vision, (2017)

[5]

Dalal N, Triggs B., Histograms of oriented gradients for human detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), pp. 886-893, (2005)

[6]

Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L., ImageNet: a large-scale hierarchical image database, 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248-255, (2009)

[7]

Eitz M, Hays J, Alexa M., How do humans sketch objects?, ACM Transactions on Graphics, 31, 4, pp. 1-10, (2012)

[8]

Eitz M, Hildebrand K, Boubekeur T, Alexa M., Sketch-based image retrieval: benchmark and bag-of-features descriptors, IEEE Transactions on Visualization and Computer Graphics, 17, 11, pp. 1624-1636, (2010)

[9]

Glorot X, Bordes A, Bengio Y., Deep sparse rectifier neural networks, Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics. JMLR Workshop and Conference Proceedings, pp. 315-323, (2011)

[10]

The quick, draw! Dataset, (2017)

← 1 2 3 4 →