Diffusion-Based Reinforcement Learning for Edge-Enabled AI-Generated Content Services

被引:49
作者
Du, Hongyang [1 ]
Li, Zonghang [2 ]
Niyato, Dusit [1 ]
Kang, Jiawen [3 ]
Xiong, Zehui [4 ]
Huang, Huawei [5 ]
Mao, Shiwen [6 ]
机构
[1] Nanyang Technol Univ, Energy Res Inst NTU, Sch Comp Sci & Engn, Interdisciplinary Grad Program, Singapore 639798, Singapore
[2] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 610054, Peoples R China
[3] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China
[4] Singapore Univ Technol & Design, Pillar Informat Syst Technol & Design, Singapore 487372, Singapore
[5] Sun Yat Sen Univ, Sch Software Engn, Zhuhai 519082, Peoples R China
[6] Auburn Univ, Dept Elect & Comp Engn, Auburn, AL 36849 USA
基金
新加坡国家研究基金会; 中国国家自然科学基金;
关键词
Computational modeling; Metaverse; Task analysis; Servers; Biological system modeling; Mathematical models; Performance evaluation; AI-generated content; and deep reinforcement learning; diffusion models; wireless networks; generative AI;
D O I
10.1109/TMC.2024.3356178
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As Metaverse emerges as the next-generation Internet paradigm, the ability to efficiently generate content is paramount. AI-Generated Content (AIGC) emerges as a key solution, yet the resource-intensive nature of large Generative AI (GAI) models presents challenges. To address this issue, we introduce an AIGC-as-a-Service (AaaS) architecture, which deploys AIGC models in wireless edge networks to ensure broad AIGC services accessibility for Metaverse users. Nonetheless, an important aspect of providing personalized user experiences requires carefully selecting AIGC Service Providers (ASPs) capable of effectively executing user tasks, which is complicated by environmental uncertainty and variability. Addressing this gap in current research, we introduce the AI-Generated Optimal Decision (AGOD) algorithm, a diffusion model-based approach for generating the optimal ASP selection decisions. Integrating AGOD with Deep Reinforcement Learning (DRL), we develop the Deep Diffusion Soft Actor-Critic (D2SAC) algorithm, enhancing the efficiency and effectiveness of ASP selection. Our comprehensive experiments demonstrate that D2SAC outperforms seven leading DRL algorithms. Furthermore, the proposed AGOD algorithm has the potential for extension to various optimization problems in wireless networks, positioning it as a promising approach for future research on AIGC-driven services.
引用
收藏
页码:8902 / 8918
页数:17
相关论文
共 48 条
[1]  
AI S., 2024, Stable diffusion
[2]  
Aljanabi M., 2023, Iraqi Journal For Computer Science and Mathematics, V4, P62, DOI [DOI 10.52866/20IJCSM.2023.01.01.0018, 10.52866/20ijcsm.2023.01.01.0018, 10.52866/ijcsm.2023.01.01.0018]
[3]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38
[4]   Deep Reinforcement Learning for Internet of Things: A Comprehensive Survey [J].
Chen, Wuhui ;
Qiu, Xiaoyu ;
Cai, Ting ;
Dai, Hong-Ning ;
Zheng, Zibin ;
Zhang, Yan .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2021, 23 (03) :1659-1692
[5]   Reinforcement learning-based QoS/QoE-aware service function chaining in software-driven 5G slices [J].
Chen, Xi ;
Li, Zonghang ;
Zhang, Yupeng ;
Long, Ruiming ;
Yu, Hongfang ;
Du, Xiaojiang ;
Guizani, Mohsen .
TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2018, 29 (11)
[6]   Performance analysis and overflowed traffic characterization in multiservice hierarchical wireless networks [J].
Chung, SP ;
Lee, JC .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2005, 4 (03) :904-918
[7]  
Cobbe K, 2019, 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019)
[8]   PSACCF: Prioritized Online Slice Admission Control Considering Fairness in 5G/B5G Networks [J].
Dai, Miao ;
Luo, Long ;
Ren, Jing ;
Yu, Hongfang ;
Sun, Gang .
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (06) :4101-4114
[9]  
Desale S., 2015, International Journal of Computer Engineering in Research Trends, V351, P2349
[10]  
Du H., 2023, arXiv