Icon similarity model based on cognition and deep learning☆

被引：1

作者：

Wang, Linlin ^{[1
]}

Zou, Yixuan ^{[1
]}

Wang, Haiyan ^{[1
]}

Xue, Chengqi ^{[1
]}

机构：

[1] Southeast Univ, Dept Mech Engn, 2 Southeast Univ Rd, Nanjing 211102, Jiangsu, Peoples R China

来源：

DISPLAYS | 2024年 / 85卷

基金：

中国国家自然科学基金;

关键词：

Icon similarity; Cognition; Deep learning; Human-computer interface; Semantic; FEATURE-INTEGRATION-THEORY; AVAILABILITY; COMPONENTS; JUDGMENT; CONTOURS; OBJECT;

D O I：

10.1016/j.displa.2024.102864

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Human-computer cooperation guided by natural interaction, intelligent interaction, and human-computer integration is gradually becoming a new trend in human-computer interfaces. An icon is an indispensable pictographic symbol in an interface that can convey pivotal semantics between humans and computers. Research on similar icons' cognition in humans and the discrimination of computers can reduce misunderstandings and facilitate transparent cooperation. Therefore, this research focuses on images of icons, extracted contours, and four features, including the curvature, proportion, orientation, and line of the contour, step by step. By manipulating the feature value change to obtain 360 similar icons, a cognitive experiment was conducted with 25 participants to explore the boundary values of the feature dimensions that cause different levels of similarity. Its boundary values were applied to deep learning to train a discrimination algorithm model that included 1500 similar icons. This dataset was used to train a Siamese neural network using a 16-layer network branch of a visual geometry group. The training process used stochastic gradient descent. This method of combining human cognition and deep learning technology is meaningful for establishing a consensus on icon semantics, including content and emotions, by outputting similarity levels and values. Taking icon similarity discrimination as an example, this study explored the analysis and simulation methods of computer vision for human visual cognition. The accuracy evaluated is 90.82%. The precision was evaluated as 90% for high, 80.65% for medium, and 97.30% for low. Recall was evaluated as 100% for high, 89.29% for medium, and 83.72% for low. It has been verified that it can compensate for fuzzy cognition in humans and enable computers to cooperate efficiently.

引用

页数：15

共 79 条

[1] A model of symbiomemesis: machine education and communication as pillars for human-autonomy symbiosis [J].

Abbass, Hussein ;

Petraki, Eleni ;

Hussein, Aya ;

McCall, Finlay ;

Elsawah, Sondoss .

PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2021, 379 (2207)

[2] Review of deep learning: concepts, CNN architectures, challenges, applications, future directions [J].

Alzubaidi, Laith ;

Zhang, Jinglan ;

Humaidi, Amjad J. ;

Al-Dujaili, Ayad ;

Duan, Ye ;

Al-Shamma, Omran ;

Santamaria, J. ;

Fadhel, Mohammed A. ;

Al-Amidie, Muthana ;

Farhan, Laith .

JOURNAL OF BIG DATA, 2021, 8 (01)

[3]

Anderson P., 2017, bottom- up and top-down attention for image captioning and visual question answering

[4]

[Anonymous], 2015, Evaluating Machine Learning Models

[5] RECOGNITION-BY-COMPONENTS - A THEORY OF HUMAN IMAGE UNDERSTANDING [J].

BIEDERMAN, I .

PSYCHOLOGICAL REVIEW, 1987, 94 (02) :115-147

[6] Evaluation of Deep Learning Strategies for Nucleus Segmentation in Fluorescence Images [J].

Caicedo, Juan C. ;

Roth, Jonathan ;

Goodman, Allen ;

Becker, Tim ;

Karhohs, Kyle W. ;

Broisin, Matthieu ;

Molnar, Csaba ;

McQuin, Claire ;

Singh, Shantanu ;

Theis, Fabian J. ;

Carpenter, Anne E. .

CYTOMETRY PART A, 2019, 95 (09) :952-965

[7]

Chauhan R, 2018, 2018 FIRST INTERNATIONAL CONFERENCE ON SECURE CYBER COMPUTING AND COMMUNICATIONS (ICSCCC 2018), P278, DOI 10.1109/ICSCCC.2018.8703316

[8] An effective emotion tendency perception model in empathic dialogue [J].

Chen, Jiancu ;

Yang, Siyuan ;

Xiong, Jiang ;

Xiong, Yiping .

PLOS ONE, 2023, 18 (03)

[9] Numerical Proportion Representation: A Neurocomputational Account [J].

Chen, Qi ;

Verguts, Tom .

FRONTIERS IN HUMAN NEUROSCIENCE, 2017, 11

[10] Skeuomorphic or flat icons for an efficient visual search by younger and older adults? [J].

Chen, Ruoyu ;

Huang, Jincheng ;

Zhou, Jia .

APPLIED ERGONOMICS, 2020, 85

← 1 2 3 4 5 6 7 8 →