MIFTel: a multimodal interactive framework based on temporal logic rules

被引:5
作者
Avola, Danilo [1 ]
Cinque, Luigi [1 ]
Del Bimbo, Alberto [2 ]
Marini, Marco Raoul [1 ]
机构
[1] Sapienza Univ, Dept Comp Sci, Via Salaria 113, I-00198 Rome, Italy
[2] Univ Florence, Dept Informat Engn, Via Santa Marta 3, I-50139 Florence, Italy
关键词
Event management; Human-computer interaction; Multimodal interaction; Natural interaction; Temporal logic; RECOGNITION;
D O I
10.1007/s11042-019-08590-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human-computer interfaces and multimodal interaction are increasingly used in everyday life. Environments equipped with sensors are able to acquire and interpret a wide range of information, thus assisting humans in several application areas, such as behaviour understanding, event detection, action recognition, and many others. In these areas, the suitable processing of this information is a key factor to properly structure multimodal data. In particular, heterogeneous devices and different acquisition times can be exploited to improve recognition results. On the basis of these assumptions, in this paper, a multimodal system based on Allen's temporal logic combined with a prevision method is proposed. The main target of the system is to correlate user's events with system's reactions. After the post-processing data coming from different acquisition devices (e.g., RGB images, depth maps, sounds, proximity sensors), the system manages the correlations between recognition/detection results and events, in real-time, thus creating an interactive environment for users. To increase the recognition reliability, a predictive model is also associated with the method. Modularity of the system grants a full dynamic development and upgrade with customized modules. Finally, comparisons with other similar systems are shown, thus underlining the high flexibility and robustness of the proposed event management method.
引用
收藏
页码:13533 / 13558
页数:26
相关论文
共 23 条
  • [21] Stratified pooling based deep convolutional neural networks for human action recognition
    Yu, Sheng
    Cheng, Yun
    Su, Songzhi
    Cai, Guorong
    Li, Shaozi
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (11) : 13367 - 13382
  • [22] Hand gesture recognition using Leap Motion via deterministic learning
    Zeng, Wei
    Wang, Cong
    Wang, Qinghui
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (21) : 28185 - 28206
  • [23] Event Detection in Complex Scenes Using Interval Temporal Constraints
    Zhang, Yifan
    Ji, Qiang
    Lu, Hanqing
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3184 - 3191