Efficient Inference Offloading for Mixture-of-Experts Large Language Models in Internet of Medical Things

被引:1
作者
Yuan, Xiaoming [1 ,2 ]
Kong, Weixuan [1 ]
Luo, Zhenyu [1 ]
Xu, Minrui [3 ]
机构
[1] Northeastern Univ Qinhuangdao, Hebei Key Lab Marine Percept Network & Data Proc, Qinhuangdao 066004, Peoples R China
[2] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
基金
中国国家自然科学基金;
关键词
large language models; efficient inference offloading; mixture-of-experts; Internet of Medical Things;
D O I
10.3390/electronics13112077
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite recent significant advancements in large language models (LLMs) for medical services, the deployment difficulties of LLMs in e-healthcare hinder complex medical applications in the Internet of Medical Things (IoMT). People are increasingly concerned about e-healthcare risks and privacy protection. Existing LLMs face difficulties in providing accurate medical questions and answers (Q&As) and meeting the deployment resource demands in the IoMT. To address these challenges, we propose MedMixtral 8x7B, a new medical LLM based on the mixture-of-experts (MoE) architecture with an offloading strategy, enabling deployment on the IoMT, improving the privacy protection for users. Additionally, we find that the significant factors affecting latency include the method of device interconnection, the location of offloading servers, and the speed of the disk.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Embracing Large Language Models for Medical Applications: Opportunities and Challenges
    Karabacak, Mert
    Margetis, Konstantinos
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (05)
  • [32] Large language models for generating medical examinations: systematic review
    Artsi, Yaara
    Sorin, Vera
    Konen, Eli
    Glicksberg, Benjamin S.
    Nadkarni, Girish
    Klang, Eyal
    BMC MEDICAL EDUCATION, 2024, 24 (01)
  • [33] The rise of large language models in the medical field: A bibliometric analysis
    Qi, Wenhao
    Cao, Shihua
    Wang, Bin
    Zhu, Xiaohong
    Dong, Chaoqun
    He, Danni
    Chen, Yanfei
    Shi, Yankai
    Wang, BingSheng
    PROCEEDINGS 2024 IEEE INTERNATIONAL WORKSHOP ON FOUNDATION MODELS FOR CYBER-PHYSICAL SYSTEMS & INTERNET OF THINGS, FMSYS 2024, 2024, : 56 - 62
  • [34] The Role of Large Language Models in Medical Education: Applications and Implications
    Safranek, Conrad W.
    Sidamon-Eristoff, Anne Elizabeth
    Gilson, Aidan
    Chartash, David
    JMIR MEDICAL EDUCATION, 2023, 9
  • [35] Evaluating interactions of patients with large language models for medical information
    Carl, Nicolas
    Haggenmueller, Sarah
    Wies, Christoph
    Nguyen, Lisa
    Winterstein, Jana Theres
    Hetz, Martin Joachim
    Mangold, Maurin Helen
    Hartung, Friedrich Otto
    Gruene, Britta
    Holland-Letz, Tim
    Michel, Maurice Stephan
    Brinker, Titus Josef
    Wessels, Frederik
    BJU INTERNATIONAL, 2025, : 1010 - 1017
  • [36] Can large language models reason about medical questions?
    Lievin, Valentin
    Hother, Christoffer Egeberg
    Motzfeldt, Andreas Geert
    Winther, Ole
    PATTERNS, 2024, 5 (03):
  • [37] Impact of Large Language Models on Medical Education andTeaching Adaptations
    Li, Zhui
    Yhap, Nina
    Liu, Liping
    Wang, Zhengjie
    Xiong, Zhonghao
    Yuan, Xiaoshu
    Cui, Hong
    Liu, Xuexiu
    Ren, Wei
    JMIR MEDICAL INFORMATICS, 2024, 12
  • [38] A secure and efficient framework for internet of medical things through blockchain driven customized federated learning
    Mazid, Abdul
    Kirmani, Sheeraz
    Abid, Manaullah
    Pawar, Vijayant
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2025, 28 (04):
  • [39] Large language models and medical education: a paradigm shift in educator roles
    Li, Zhui
    Li, Fenghe
    Fu, Qining
    Wang, Xuehu
    Liu, Hong
    Zhao, Yu
    Ren, Wei
    SMART LEARNING ENVIRONMENTS, 2024, 11 (01)
  • [40] MedExpQA: Multilingual benchmarking of Large Language Models for Medical Question Answering
    Alonso, Inigo
    Oronoz, Maite
    Agerri, Rodrigo
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 155