Wide-Area Analytics with Multiple Resources

被引:50
作者
Hung, Chien-Chun [1 ]
Ananthanarayanan, Ganesh [2 ]
Golubchik, Leana [1 ]
Yu, Minlan [3 ]
Zhang, Mingyang [1 ]
机构
[1] USC, Los Angeles, CA 90089 USA
[2] Microsoft, Redmond, WA USA
[3] Harvard, Cambridge, MA USA
来源
EUROSYS '18: PROCEEDINGS OF THE THIRTEENTH EUROSYS CONFERENCE | 2018年
关键词
D O I
10.1145/3190508.3190528
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Running data-parallel jobs across geo-distributed sites has emerged as a promising direction due to the growing need for geo-distributed cluster deployment. A key difference between geo-distributed and intra-cluster jobs is the heterogeneous (and often constrained) nature of compute and network resources across the sites. We propose Tetrium, a system for multi-resource allocation in geodistributed clusters, that jointly considers both compute and network resources for task placement and job scheduling. Tetrium significantly reduces job response time, while incorporating several other performance goals with simple control knobs. Our EC2 deployment and trace-driven simulations suggest that Tetrium improves the average job response time by up to 78% compared to existing data-locality-based solutions, and up to 55% compared to Iridium, the recently proposed geo-distributed analytics system.
引用
收藏
页数:16
相关论文
共 46 条
  • [1] Ananthanarayanan G., 2010, P USENIX C OP SYST D
  • [2] Ananthanarayanan G., 2011, P 6 C COMP SYST EURO
  • [3] Ananthanarayanan G., 2013, P USENIX C NETWORKED
  • [4] Ananthanarayanan G., 2014, P USENIX C NETW SYST
  • [5] Ananthanarayanan G., 2012, P USENIX C NETW SYST
  • [6] [Anonymous], 2005, Algorithm Design
  • [7] [Anonymous], 2010, EDBT, DOI [DOI 10.1145/1739041.1739056, 10.1145/1739041.1739056]
  • [8] [Anonymous], 2011, P USENIX C NETW SYST
  • [9] Calder Matt, 2013, IMC
  • [10] Chekuri C., 2001, J ALGORITHMS