Multimodal Transformer for Nursing Activity Recognition

被引:0
|
作者
Ijaz, Momal [1 ]
Diaz, Renato [1 ]
Chen, Chen [1 ,2 ]
机构
[1] Department of Computer Science, University of Central Florida, United States
[2] Center for Research in Computer Vision, University of Central Florida, United States
来源
arXiv | 2022年
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Benchmarking - Classification (of information) - Hospitals - Machine learning - Pattern recognition
引用
收藏
相关论文
共 23 条
  • [1] Multimodal Graph Transformer for Multimodal Question Answering
    He, Xuehai
    Wang, Xin Eric
    EACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, 2023, : 189 - 200
  • [2] Multimodal Distillation for Egocentric Action Recognition
    Radevski, Gorjan
    Grujicic, Dusan
    Blaschko, Matthew
    Moens, Marie-Francine
    Tuytelaars, Tinne
    Proceedings of the IEEE International Conference on Computer Vision, 2023, : 5190 - 5201
  • [3] Context-Aware Pedestrian Trajectory Prediction with Multimodal Transformer
    Damirchi, Haleh
    Greenspan, Michael
    Etemad, Ali
    Proceedings - International Conference on Image Processing, ICIP, 2023, : 2535 - 2539
  • [4] Multimodal speech recognition for unmanned aerial vehicles
    Oneață, Dan
    Cucu, Horia
    Computers and Electrical Engineering, 2021, 90
  • [5] Transformer-based Multimodal Information Fusion for Facial Expression Analysis
    Zhang, Wei
    Qiu, Feng
    Wang, Suzhen
    Zeng, Hao
    Zhang, Zhimeng
    An, Rudong
    Ma, Bowen
    Ding, Yu
    IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2022, 2022-June : 2427 - 2436
  • [6] Palm Vein Recognition Network Combining Transformer and CNN
    Wu, Kai
    Shen, Wenzhong
    Jia, Dingding
    Liang, Juan
    Computer Engineering and Applications, 2023, 59 (24) : 98 - 109
  • [7] Review on Human Action Recognition Methods Based on Multimodal Data
    Wang, Cailing
    Yan, Jingjing
    Zhang, Zhidong
    Computer Engineering and Applications, 60 (09): : 1 - 18
  • [8] MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis
    Zheng, Jianbin
    Liu, Daqing
    Wang, Chaoyue
    Hu, Minghui
    Yang, Zuopeng
    Ding, Changxing
    Tao, Dacheng
    arXiv, 2023,
  • [9] Multimodal Fake News Detection on Fakeddit Dataset Using Transformer-Based Architectures
    Kalra, Sakshi
    Kumar, Chitneedi Hemanth Sai
    Sharma, Yashvardhan
    Chauhan, Gajendra Singh
    Communications in Computer and Information Science, 2022, 1763 CCIS : 281 - 292
  • [10] Group-Attention Transformer for Fine-Grained Image Recognition
    Yan, Bo
    Wang, Siwei
    Zhu, En
    Liu, Xinwang
    Chen, Wei
    Communications in Computer and Information Science, 2022, 1587 CCIS : 40 - 54