MIFTel: a multimodal interactive framework based on temporal logic rules

被引:5
作者
Avola, Danilo [1 ]
Cinque, Luigi [1 ]
Del Bimbo, Alberto [2 ]
Marini, Marco Raoul [1 ]
机构
[1] Sapienza Univ, Dept Comp Sci, Via Salaria 113, I-00198 Rome, Italy
[2] Univ Florence, Dept Informat Engn, Via Santa Marta 3, I-50139 Florence, Italy
关键词
Event management; Human-computer interaction; Multimodal interaction; Natural interaction; Temporal logic; RECOGNITION;
D O I
10.1007/s11042-019-08590-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human-computer interfaces and multimodal interaction are increasingly used in everyday life. Environments equipped with sensors are able to acquire and interpret a wide range of information, thus assisting humans in several application areas, such as behaviour understanding, event detection, action recognition, and many others. In these areas, the suitable processing of this information is a key factor to properly structure multimodal data. In particular, heterogeneous devices and different acquisition times can be exploited to improve recognition results. On the basis of these assumptions, in this paper, a multimodal system based on Allen's temporal logic combined with a prevision method is proposed. The main target of the system is to correlate user's events with system's reactions. After the post-processing data coming from different acquisition devices (e.g., RGB images, depth maps, sounds, proximity sensors), the system manages the correlations between recognition/detection results and events, in real-time, thus creating an interactive environment for users. To increase the recognition reliability, a predictive model is also associated with the method. Modularity of the system grants a full dynamic development and upgrade with customized modules. Finally, comparisons with other similar systems are shown, thus underlining the high flexibility and robustness of the proposed event management method.
引用
收藏
页码:13533 / 13558
页数:26
相关论文
共 23 条
  • [1] MAINTAINING KNOWLEDGE ABOUT TEMPORAL INTERVALS
    ALLEN, JF
    [J]. COMMUNICATIONS OF THE ACM, 1983, 26 (11) : 832 - 843
  • [2] Fusing depth and colour information for human action recognition
    Avola, Danilo
    Bernardi, Marco
    Foresti, Gian Luca
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (05) : 5919 - 5939
  • [3] An interactive and low-cost full body rehabilitation framework based on 3D immersive serious games
    Avola, Danilo
    Cinque, Luigi
    Foresti, Gian Luca
    Marini, Marco Raoul
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 89 : 81 - 100
  • [4] Exploiting Recurrent Neural Networks and Leap Motion Controller for the Recognition of Sign Language and Semaphoric Hand Gestures
    Avola, Danilo
    Bernardi, Marco
    Cinque, Luigi
    Foresti, Gian Luca
    Massaroni, Cristiano
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (01) : 234 - 245
  • [5] VRheab: a fully immersive motor rehabilitation system based on recurrent neural network
    Avola, Danilo
    Cinque, Luigi
    Foresti, Gian Luca
    Marini, Marco Raoul
    Pannone, Daniele
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (19) : 24955 - 24982
  • [6] A keypoint-based method for background modeling and foreground detection using a PTZ camera
    Avola, Danilo
    Cinque, Luigi
    Foresti, Gian Luca
    Massaroni, Cristiano
    Pannone, Daniele
    [J]. PATTERN RECOGNITION LETTERS, 2017, 96 : 96 - 105
  • [7] Multi-dimensional modal logic as a framework for spatio-temporal reasoning
    Bennett, B
    Cohn, AG
    Wolter, F
    Zakharyaschev, M
    [J]. APPLIED INTELLIGENCE, 2002, 17 (03) : 239 - 251
  • [8] Semantic Event Fusion of Different Visual Modality Concepts for Activity Recognition
    Crispim-Junior, Carlos F.
    Buso, Vincent
    Avgerinakis, Konstantinos
    Meditskos, Georgios
    Briassouli, Alexia
    Benois-Pineau, Jenny
    Kompatsiaris, Ioannis
    Bremond, Francois
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (08) : 1598 - 1611
  • [9] Unsupervised Person Re-identification: Clustering and Fine-tuning
    Fan, Hehe
    Zheng, Liang
    Yan, Chenggang
    Yang, Yi
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2018, 14 (04)
  • [10] Gang Chen, 2014, 2014 International Conference on Reconfigurable Computing and FPGAs (ReConFig14), P1, DOI 10.1109/ReConFig.2014.7032502