Diffusion-Based Reinforcement Learning for Edge-Enabled AI-Generated Content Services

被引:17
作者
Du, Hongyang [1 ]
Li, Zonghang [2 ]
Niyato, Dusit [1 ]
Kang, Jiawen [3 ]
Xiong, Zehui [4 ]
Huang, Huawei [5 ]
Mao, Shiwen [6 ]
机构
[1] Nanyang Technol Univ, Energy Res Inst NTU, Sch Comp Sci & Engn, Interdisciplinary Grad Program, Singapore 639798, Singapore
[2] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 610054, Peoples R China
[3] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China
[4] Singapore Univ Technol & Design, Pillar Informat Syst Technol & Design, Singapore 487372, Singapore
[5] Sun Yat Sen Univ, Sch Software Engn, Zhuhai 519082, Peoples R China
[6] Auburn Univ, Dept Elect & Comp Engn, Auburn, AL 36849 USA
基金
中国国家自然科学基金; 新加坡国家研究基金会;
关键词
Computational modeling; Metaverse; Task analysis; Servers; Biological system modeling; Mathematical models; Performance evaluation; AI-generated content; and deep reinforcement learning; diffusion models; wireless networks; generative AI;
D O I
10.1109/TMC.2024.3356178
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As Metaverse emerges as the next-generation Internet paradigm, the ability to efficiently generate content is paramount. AI-Generated Content (AIGC) emerges as a key solution, yet the resource-intensive nature of large Generative AI (GAI) models presents challenges. To address this issue, we introduce an AIGC-as-a-Service (AaaS) architecture, which deploys AIGC models in wireless edge networks to ensure broad AIGC services accessibility for Metaverse users. Nonetheless, an important aspect of providing personalized user experiences requires carefully selecting AIGC Service Providers (ASPs) capable of effectively executing user tasks, which is complicated by environmental uncertainty and variability. Addressing this gap in current research, we introduce the AI-Generated Optimal Decision (AGOD) algorithm, a diffusion model-based approach for generating the optimal ASP selection decisions. Integrating AGOD with Deep Reinforcement Learning (DRL), we develop the Deep Diffusion Soft Actor-Critic (D2SAC) algorithm, enhancing the efficiency and effectiveness of ASP selection. Our comprehensive experiments demonstrate that D2SAC outperforms seven leading DRL algorithms. Furthermore, the proposed AGOD algorithm has the potential for extension to various optimization problems in wireless networks, positioning it as a promising approach for future research on AIGC-driven services.
引用
收藏
页码:8902 / 8918
页数:17
相关论文
共 48 条
  • [1] AI S., 2024, Stable diffusion
  • [2] Aljanabi M., 2023, Iraqi J. Comp. Sci. Mathem, V4, P62, DOI [10.52866/20ijcsm.2023.01.01.0018, DOI 10.52866/20IJCSM.2023.01.01.0018, DOI 10.52866/IJCSM.2023.01.01.0018]
  • [3] Deep Reinforcement Learning A brief survey
    Arulkumaran, Kai
    Deisenroth, Marc Peter
    Brundage, Miles
    Bharath, Anil Anthony
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) : 26 - 38
  • [4] Deep Reinforcement Learning for Internet of Things: A Comprehensive Survey
    Chen, Wuhui
    Qiu, Xiaoyu
    Cai, Ting
    Dai, Hong-Ning
    Zheng, Zibin
    Zhang, Yan
    [J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2021, 23 (03): : 1659 - 1692
  • [5] Reinforcement learning-based QoS/QoE-aware service function chaining in software-driven 5G slices
    Chen, Xi
    Li, Zonghang
    Zhang, Yupeng
    Long, Ruiming
    Yu, Hongfang
    Du, Xiaojiang
    Guizani, Mohsen
    [J]. TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2018, 29 (11):
  • [6] Performance analysis and overflowed traffic characterization in multiservice hierarchical wireless networks
    Chung, SP
    Lee, JC
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2005, 4 (03) : 904 - 918
  • [7] Cobbe K, 2019, 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019)
  • [8] PSACCF: Prioritized Online Slice Admission Control Considering Fairness in 5G/B5G Networks
    Dai, Miao
    Luo, Long
    Ren, Jing
    Yu, Hongfang
    Sun, Gang
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (06): : 4101 - 4114
  • [9] Desale S., 2015, Int. J. Comput. Eng. Res. Trends, V351, P2349
  • [10] Du H., 2023, IEEE Wireless Commun.