Parallel failure recovery techniques in cluster-based media servers

被引：0

作者：

Lee, Joahyung ^{[1
]}

Jung, Inbum ^{[1
]}

机构：

[1] Kangwon Natl Univ, Dept Comp Sci & Engn, Chunchon, South Korea

来源：

JOURNAL OF SUPERCOMPUTING | 2010年 / 51卷 / 01期

关键词：

Streaming media; Parallel failure recovery; Cluster servers; QoS stream; STRATEGIES;

D O I：

10.1007/s11227-009-0305-6

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

For large-scale video-on-demand (VOD) service, cluster servers are highlighted due to their high performance and low cost. A cluster server consists of a front-end node and multiple backend nodes. Though the increase in backend nodes provides more quality of service (QoS) streams, the possibility of backend node failure is proportionally increased. The failure causes not only the cessation of streaming services but also the loss of current playing positions. In this paper, when a backend node fails, recovery mechanisms are studied to support the streaming service continuously. Without considering the characteristics of cluster-based servers and MPEG media, the basic redundant array of independent disks (RAID) techniques cause a network bottleneck in the internal network path and demonstrate inefficient CPU usage in backend nodes. To address these problems, a new failure recovery mechanism is proposed based on the pipeline computing concept. The proposed method not only distributes the internal network traffic generated from the recovery operations but also utilizes the CPU time available in the backend nodes. In the experiments, even if a backend node fails, the proposed method provides continuous streaming media services within a short MTTR value as well as more QoS streams than the existing method.

引用

页码：20 / 39

页数：20

共 16 条

[1]

BOLOSKY WJ, 1997, P 16 ACM S OP SYST P, P212

[2]

CHANG T, 1998, IEEE MASS STORAGE C, P181

[3] A multicast delivery scheme for VCR operations in a large VOD system [J].

Choi, JM ;

Lee, SW ;

Chung, KD .

PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, 2001, :555-561

[4] Approaches to recovery-oriented computing [J].

Fox, A ;

Patterson, D .

IEEE INTERNET COMPUTING, 2005, 9 (02) :14-16

[5] Data striping and reliability aspects in distributed video servers [J].

Jamel Gafsi ;

Ernst W. Biersack .

Cluster Computing, 1999, 2 (1) :75-91

[6] Modeling and performance comparison of reliability strategies for distributed video servers [J].

Gafsi, J ;

Biersack, EW .

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2000, 11 (04) :412-430

[7] ARCHITECTURES AND ALGORITHMS FOR ONLINE FAILURE RECOVERY IN REDUNDANT DISK ARRAYS [J].

HOLLAND, M ;

GIBSON, GA ;

SIEWIOREK, DP .

DISTRIBUTED AND PARALLEL DATABASES, 1994, 2 (03) :295-335

[8] Modeling the Caching Effect in Continuous Media Servers [J].

Sooyong Kang ;

Heon Y. Yeom .

Multimedia Tools and Applications, 2003, 21 (3) :203-224

[9] ANALYTIC MODELING AND COMPARISONS OF STRIPING STRATEGIES FOR REPLICATED DISK ARRAYS [J].

MERCHANT, A ;

YU, PS .

IEEE TRANSACTIONS ON COMPUTERS, 1995, 44 (03) :419-433

[10]

PATTERSON DA, 1998, COMPUTER ORG DESIGN, P392

← 1 2 →