Intervention and regulatory mechanism of multimodal fusion natural interactions on AR embodied cognition

被引：1

作者：

Yong, Jiu ^{[1
,2
,3
]}

Wei, Jianguo ^{[1
]}

Lei, Xiaomei ^{[1
,4
]}

Wang, Yangping ^{[2
,3
]}

Dang, Jianwu ^{[2
,3
]}

Lu, Wenhuan ^{[1
]}

机构：

[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300072, Peoples R China

[2] Lanzhou Jiaotong Univ, Natl Virtual Simulat Expt Teaching Ctr Railway Tra, Lanzhou 730070, Peoples R China

[3] Lanzhou Jiaotong Univ, Sch Elect & Informat Engn, Lanzhou 730070, Peoples R China

[4] Gansu Prov Meteorol Bur, Meteorol Informat & Technol Supporting Ctr, Lanzhou 730020, Peoples R China

来源：

INFORMATION FUSION | 2025年 / 117卷

基金：

中国国家自然科学基金;

关键词：

Multimodal fusion natural interaction; AR embodied cognition; Real interaction intention reasoning; Interactive intention trust evaluation and; optimization; AR-EMFNI method; AUGMENTED REALITY; VIRTUAL-REALITY; EMOTION RECOGNITION; NETWORK; NAVIGATION; INTENTION; FRAMEWORK; QUALITY;

D O I：

10.1016/j.inffus.2024.102910

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multiple interaction modes, such as gestures and physical objects, coexist in the process of AR (Augmented Reality) embodied cognition. On the basis of the unified representation of AR embodied cognition and the complementary advantages of different modal information, multimodal interaction information can be semantically aligned and organically fused to achieve efficient and reliable transmission of uncertain or ambiguous interaction information between humans and AR systems. However, the impact mechanism of multimodal fusion natural interactions on higher-level interaction intention understanding, higher-quality cognitive behaviour utility, and lower interaction cognitive load in dynamic and complex AR embodied cognitive scenes is uncertain and limited in existing research. This article provides an in-depth analysis of the intervention and regulatory mechanism of multimodal fusion natural interactions on AR embodied cognition. By rational organization and redesign for AR multimodal fusion interactions methods, AR-EMFNI (AR Embodied Multimodal Fusion Natural Interactions) method is proposed that includes five stages: deep interaction intention knowledge base construction, interaction mode enhancement fusion, real interaction intention reasoning, trust evaluation and optimization, and interaction task assistance guidance. A total of 109 participants were recruited from the majoring in Rail Transit Signal and Control, and five AR multimodal interactive situation systems and 3D-printed embodied cognitive interactive behaviour systems were developed using the ZD6 switch machine assembly interactive task forms. The experiments were designed to collect four types of data: knowledge acquisition, interactive intention reasoning, embodied interactive behaviour, and questionnaires. ANOVA was conducted with the AR interactive mode as the moderating variable. The experimental results indicate that AR gesture interaction enhances learners' natural closeness and direct presence in AR interaction behaviour, but robust recognition of AR learners' dynamic and complex hand interaction actions is needed. AR physical interaction improves the coverage and systematic of knowledge transfer, but limits the selective construction and interactive presentation of AR embodied cognitive content. AR touch interaction reduces learners' cognitive load on AR devices, but reasonable implicit interaction parameter settings need to be considered during the AR interaction process. AR-SMFI (AR Simple Multimodal Fusion Interaction) interaction indicates that AR multimodal humancomputer interaction is not simply the superposition and aggregation of input information, but rather the mutual supplementation and organic integration of information between different interaction modalities. Compared with the other four AR interaction methods, AR-EMFNI interaction improved cognitive test scores performance by 15%, achieved 81% accuracy in intention reasoning, and reduced the cognitive load by 36%, which effectively solves the concurrent transmission problem of uncertainty and ambiguity multimodal interaction information in AR embodied cognition, and which effectively promotes higher-level knowledge transfer, higher-quality embodied interaction behaviour, and stronger flow experience in AR embodied cognition.

引用

页数：39