Icon similarity model based on cognition and deep learning☆

被引：1

作者：

Wang, Linlin ^{[1
]}

Zou, Yixuan ^{[1
]}

Wang, Haiyan ^{[1
]}

Xue, Chengqi ^{[1
]}

机构：

[1] Southeast Univ, Dept Mech Engn, 2 Southeast Univ Rd, Nanjing 211102, Jiangsu, Peoples R China

来源：

DISPLAYS | 2024年 / 85卷

基金：

中国国家自然科学基金;

关键词：

Icon similarity; Cognition; Deep learning; Human-computer interface; Semantic; FEATURE-INTEGRATION-THEORY; AVAILABILITY; COMPONENTS; JUDGMENT; CONTOURS; OBJECT;

D O I：

10.1016/j.displa.2024.102864

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Human-computer cooperation guided by natural interaction, intelligent interaction, and human-computer integration is gradually becoming a new trend in human-computer interfaces. An icon is an indispensable pictographic symbol in an interface that can convey pivotal semantics between humans and computers. Research on similar icons' cognition in humans and the discrimination of computers can reduce misunderstandings and facilitate transparent cooperation. Therefore, this research focuses on images of icons, extracted contours, and four features, including the curvature, proportion, orientation, and line of the contour, step by step. By manipulating the feature value change to obtain 360 similar icons, a cognitive experiment was conducted with 25 participants to explore the boundary values of the feature dimensions that cause different levels of similarity. Its boundary values were applied to deep learning to train a discrimination algorithm model that included 1500 similar icons. This dataset was used to train a Siamese neural network using a 16-layer network branch of a visual geometry group. The training process used stochastic gradient descent. This method of combining human cognition and deep learning technology is meaningful for establishing a consensus on icon semantics, including content and emotions, by outputting similarity levels and values. Taking icon similarity discrimination as an example, this study explored the analysis and simulation methods of computer vision for human visual cognition. The accuracy evaluated is 90.82%. The precision was evaluated as 90% for high, 80.65% for medium, and 97.30% for low. Recall was evaluated as 100% for high, 89.29% for medium, and 83.72% for low. It has been verified that it can compensate for fuzzy cognition in humans and enable computers to cooperate efficiently.

引用

页数：15

共 79 条

[51] Quantifying Heuristic Bias: Anchoring, Availability, and Representativeness [J].

Richie, Megan ;

Josephson, S. Andrew .

TEACHING AND LEARNING IN MEDICINE, 2018, 30 (01) :67-75

[52]

Robey Alexander, 2021, Advances in Neural Information Processing Systems, V34

[53]

Sahu G, 2019, Arxiv, DOI arXiv:1904.06022

[54] The Situation Awareness Framework for Explainable AI (SAFE-AI) and Human Factors Considerations for XAI Systems [J].

Sanneman, Lindsay ;

Shah, Julie A. .

INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2022, 38 (18-20) :1772-1788

[55] What line drawings reveal about the visual brain [J].

Sayim, Bilge ;

Cavanagh, Patrick .

FRONTIERS IN HUMAN NEUROSCIENCE, 2011, 5

[56]

Shahmirzadi Omid, 2019, 2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA), P659, DOI 10.1109/ICMLA.2019.00120

[57] The effects of representation of industrial icons on visual search performance [J].

Shao, Jiang ;

Zhan, Yuhan ;

Zhu, Hui ;

Zhang, Mingming ;

Qin, Lang ;

Tian, Shangxin ;

Qi, Hongwei .

DISPLAYS, 2024, 82

[58]

Soujanya P., 2018, A multimodal multi-party dataset for emotion recognition in conversations

[59] Comparative analysis of deep learning image detection algorithms [J].

Srivastava, Shrey ;

Divekar, Amit Vishvas ;

Anilkumar, Chandu ;

Naik, Ishika ;

Kulkarni, Ved ;

Pattabiraman, V. .

JOURNAL OF BIG DATA, 2021, 8 (01)

[60] Deep learning in spiking neural networks [J].

Tavanaei, Amirhossein ;

Ghodrati, Masoud ;

Kheradpisheh, Saeed Reza ;

Masquelier, Timothee ;

Maida, Anthony .

NEURAL NETWORKS, 2019, 111 :47-63

← 1 2 3 4 5 6 7 8 →