MIFTel: a multimodal interactive framework based on temporal logic rules

被引：5

作者：

Avola, Danilo ^{[1
]}

Cinque, Luigi ^{[1
]}

Del Bimbo, Alberto ^{[2
]}

Marini, Marco Raoul ^{[1
]}

机构：

[1] Sapienza Univ, Dept Comp Sci, Via Salaria 113, I-00198 Rome, Italy

[2] Univ Florence, Dept Informat Engn, Via Santa Marta 3, I-50139 Florence, Italy

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2020年 / 79卷 / 19-20期

关键词：

Event management; Human-computer interaction; Multimodal interaction; Natural interaction; Temporal logic; RECOGNITION;

D O I：

10.1007/s11042-019-08590-1

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Human-computer interfaces and multimodal interaction are increasingly used in everyday life. Environments equipped with sensors are able to acquire and interpret a wide range of information, thus assisting humans in several application areas, such as behaviour understanding, event detection, action recognition, and many others. In these areas, the suitable processing of this information is a key factor to properly structure multimodal data. In particular, heterogeneous devices and different acquisition times can be exploited to improve recognition results. On the basis of these assumptions, in this paper, a multimodal system based on Allen's temporal logic combined with a prevision method is proposed. The main target of the system is to correlate user's events with system's reactions. After the post-processing data coming from different acquisition devices (e.g., RGB images, depth maps, sounds, proximity sensors), the system manages the correlations between recognition/detection results and events, in real-time, thus creating an interactive environment for users. To increase the recognition reliability, a predictive model is also associated with the method. Modularity of the system grants a full dynamic development and upgrade with customized modules. Finally, comparisons with other similar systems are shown, thus underlining the high flexibility and robustness of the proposed event management method.

引用

页码：13533 / 13558

页数：26

共 23 条

[21] Stratified pooling based deep convolutional neural networks for human action recognition
Yu, Sheng
Cheng, Yun
Su, Songzhi
Cai, Guorong
Li, Shaozi
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (11) : 13367 - 13382
[22] Hand gesture recognition using Leap Motion via deterministic learning
Zeng, Wei
Wang, Cong
Wang, Qinghui
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (21) : 28185 - 28206
[23] Event Detection in Complex Scenes Using Interval Temporal Constraints
Zhang, Yifan
Ji, Qiang
Lu, Hanqing
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3184 - 3191

← 1 2 3 →