A Critical Evaluation of Recent Deep Generative Sketch Models from a Human-Centered Perspective

被引:0
作者
Sabuncuoglu, Alpay [1 ]
Sezgin, T. Metin [1 ]
机构
[1] Koc Univ, Istanbul, Turkey
来源
2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU | 2022年
关键词
deep generative sketch models; human-centered design; field studies;
D O I
10.1109/SIU55565.2022.9864823
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Drawing a sketch is a uniquely personal process that depends on previous knowledge, experiences, and current mood. Hence, the success of deep generative sketch models depends on user expectations. Yet, the unconditional generation ability of these models does not consider human-centered metrics in the training step. To achieve this kind of training process, we first need to understand the factors behind human perception on successful generative examples. We designed a user study where we asked twenty-one people from different disciplines to determine these factors. In this study, participants ordered four recent generative models' (Autoencoder, DCGAN, SketchRNN, and Sketchformer) output sketches from most to least recognizable. The results suggest that success in representing the distinct feature of a category is more important than other attributes such as spatial proportions or stroke counts. We shared our code, the interactive notebooks, and field study results to accelerate further analysis in the area.
引用
收藏
页数:4
相关论文
共 18 条
[1]  
Aksan E., 2020, Cose: Compositional stroke embeddings
[2]   Art, design and Gestalt theory [J].
Behrens, RR .
LEONARDO, 1998, 31 (04) :299-303
[3]   Sketch-based interaction and modeling: where do we stand? [J].
Bonnici, Alexandra ;
Akman, Alican ;
Calleja, Gabriel ;
Camilleri, Kenneth P. ;
Fehling, Patrick ;
Ferreira, Alfredo ;
Hermuth, Florian ;
Israel, Johann Habakuk ;
Landwehr, Tom ;
Liu, Juncheng ;
Padfield, Natasha M. J. ;
Sezgin, T. Metin ;
Rosin, Paul L. .
AI EDAM-ARTIFICIAL INTELLIGENCE FOR ENGINEERING DESIGN ANALYSIS AND MANUFACTURING, 2019, 33 (04) :370-388
[4]  
Chen Y., 2017, Sketch-pix2seq: a model to generate sketches of multiple categories
[5]  
Google-PAIR, Understanding umap
[6]  
Ha D, 2017, Arxiv, DOI arXiv:1704.03477
[7]   Swire: Sketch-based User Interface Retrieval [J].
Huang, Forrest ;
Canny, John F. ;
Nichols, Jefrey .
CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
[8]  
Kumar A., 2018, Variational Inference of Disentangled Latent Concepts from Unlabeled Observations
[9]  
McInnes L, 2020, Arxiv, DOI arXiv:1802.03426
[10]  
Radford A., 2016, INT C LEARN REPR