Falcon: Fair and Efficient Online File Transfer Optimization

被引:3
作者
Arifuzzaman, Md [1 ]
Bockelman, Brian [2 ]
Basney, James [3 ]
Arslan, Engin [1 ]
机构
[1] Univ Nevada, Reno, NV 89557 USA
[2] Morgridge Inst Res, Madison, WI 53715 USA
[3] Univ Illinois, Natl Ctr Supercomp Applicat, Reno, NV 89557 USA
关键词
Online transfer optimization; throughput optimization in research networks; file transfer tuning; high-speed networks; GLOBAL OPTIMIZATION; MODEL;
D O I
10.1109/TPDS.2023.3282872
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Research networks provide high-speed wide-area network connectivity between research and education institutions to facilitate large-scale data transfers. However, scalability issues of legacy transfer applications such as scp and FTP hinder the effective utilization of these networks. Although researchers extended the legacy transfer applications to increase their performance by exploiting I/O and network parallelism, these solutions necessitate users to fine-tune parallelism level, a task that is challenging even for experienced users due to the dynamic nature of networks. In this article, we propose an online optimization algorithm, Falcon, to tune the degree of parallelism for file transfers to maximize transfer throughput while keeping system overhead at a minimum. As research networks are shared infrastructures, we introduce a game theory-inspired novel utility function to evaluate the performance of various parallelism levels such that competing transfers are guaranteed to converge to a fair and stable solution. We assessed the performance of Falcon in isolated and production high-speed networks and found that it can discover optimal transfer parallelism in as little as 20 seconds and outperform the state-of-the-art solutions by more than 2x. Moreover, Falcon is guaranteed to converge to Nash Equilibrium when multiple transfers compete for the same resources with the help of its game theory-inspired utility function. Finally, we demonstrate that Falcon can also be used as a central transfer scheduler to speed up convergence time, increase stability, and enforce system/user-level resource limitations in shared networks.
引用
收藏
页码:2265 / 2278
页数:14
相关论文
共 36 条
[21]  
Dong M, 2018, PROCEEDINGS OF THE 15TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION (NSDI'18), P343
[22]  
Freed N., 2000, Smtp service extension for command pipelining
[23]  
Hacker TJ, 2005, 14TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, PROCEEDINGS, P265
[24]   Efficient global optimization of expensive black-box functions [J].
Jones, DR ;
Schonlau, M ;
Welch, WJ .
JOURNAL OF GLOBAL OPTIMIZATION, 1998, 13 (04) :455-492
[25]  
Leith D., 2004, P PFLDNET, V2004
[26]   Experimental evaluation of TCP protocols for high-speed networks [J].
Li, Yee-Ting ;
Leith, Douglas ;
Shorten, Robert N. .
IEEE-ACM TRANSACTIONS ON NETWORKING, 2007, 15 (05) :1109-1122
[27]   Cross-Geography Scientific Data Transferring Trends and Behavior [J].
Liu, Zhengchun ;
Kettimuthu, Rajkumar ;
Foster, Ian ;
Rao, Nageswara S. V. .
HPDC '18: PROCEEDINGS OF THE 27TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, 2018, :267-278
[28]   A Two-Phase Dynamic Throughput Optimization Model for Big Data Transfers [J].
Nine, S. Q. Zulkar ;
Kosar, Tevfik .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (02) :269-280
[29]   Taking the Human Out of the Loop: A Review of Bayesian Optimization [J].
Shahriari, Bobak ;
Swersky, Kevin ;
Wang, Ziyu ;
Adams, Ryan P. ;
de Freitas, Nando .
PROCEEDINGS OF THE IEEE, 2016, 104 (01) :148-175
[30]  
Thaker P., OPTIMIZATION, V24, P11