On-Line Learning and Optimization for Wireless Video Transmission

被引:19
作者
Zhang, Yu [1 ]
Fu, Fangwen [1 ]
van der Schaar, Mihaela [1 ]
机构
[1] Univ Calif Los Angeles, Dept Elect Engn, Los Angeles, CA 90095 USA
关键词
Layered Markov decision process; layered real-time dynamic programming; on-line learning; wireless video transmission; RESOURCE-ALLOCATION; NETWORKS;
D O I
10.1109/TSP.2010.2046040
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we address the problem of how to optimize the cross-layer transmission policy for delay-sensitive video streaming over slow-varying flat-fading wireless channels on-line, at transmission time, when the environment dynamics are unknown. We first formulate the cross-layer optimization using a systematic layered Markov decision process (MDP) framework, which complies with the layered architecture of the OSI stack. Subsequently, considering the unknown dynamics of the video sources and underlying wireless channels, we propose a layered real-time dynamic programming (LRTDP) algorithm, which requires no a priori knowledge about the source and network dynamics. LRTDP allows each layer to learn the dynamics on-the-fly, and adjusts its policy autonomously, based on their experienced dynamics as well as limited message exchanges with other layers. Unlike existing cross-layer methods, LRTDP optimizes the cross-layer policy in a layered and on-line fashion, exhibits a low computational complexity, requires limited message exchanges among layers, and is capable to adapt on-the-fly to the experienced environment dynamics. Finally, we prove that LRTDP converges to the optimal cross-layer policy asymptotically. Our numerical experiments show that LRTDP provides comparable performance to the idealized optimal cross-layer solutions based on complete knowledge.
引用
收藏
页码:3108 / 3124
页数:17
相关论文
共 39 条
[1]  
Albanese A, 1996, HIGH-SPEED NETWORKING FOR MULTIMEDIA APPLICATIONS, P247
[2]  
[Anonymous], MACHINE LEARNING
[3]  
[Anonymous], 1999, 80211 IEEE
[4]  
[Anonymous], ARTIFICIAL INTELLIGE
[5]  
Bertsekas D.P., 1989, PARALLEL DISTRIBUTED
[6]  
Boyd S., 2004, CONVEX OPTIMIZATION, VFirst, DOI DOI 10.1017/CBO9780511804441
[7]  
Breiman L., 1992, SOC IND APPL MATH
[8]   Cross-layer QoS analysis of opportunistic OFDM-TDMA and OFDMA networks [J].
Chang, Yu-Jung ;
Chien, Feng-Tsun ;
Kuo, C. -C. Jay .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2007, 25 (04) :657-666
[9]  
CHEN W, 2008, WIRELESS NETWORKS
[10]   Layering as optimization decomposition: A mathematical theory of network architectures [J].
Chiang, Mung ;
Low, Steven H. ;
Calderbank, A. Robert ;
Doyle, John C. .
PROCEEDINGS OF THE IEEE, 2007, 95 (01) :255-312