Modeling Traffic of Big Data Platform for Large Scale Datacenter Networks

被引:0
作者
Xie, Zhen [1 ,2 ]
Cao, Zheng [1 ]
Wang, Zhan [1 ]
Zang, Dawei [1 ]
Shao, En [1 ]
Sun, Ninghui [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
来源
2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS) | 2016年
基金
中国国家自然科学基金;
关键词
modeling traffic; big data platform; datacenter; hadoop; storm;
D O I
10.1109/ICPADS.2016.36
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Prior to deployment, network designers often use simulators to pre-evaluate the performance of designed network with artificial network traffic. The traditional way of separating network design from real applications will not only result in over-designed network configurations, wasting money and energy, but also miss the real network demands of applications, degrading system performance. In this paper, we provide a method to model the network traffic of current popular big data platforms, which can observably improve the matching between network design and applications. The new method extracts communication behavior from the popular big data applications and replays the behavior instead of the packet traces. Experiments show that the traffic generated by the model is almost match the real traffic and the model can easily scale to thousands of nodes.
引用
收藏
页码:224 / 231
页数:8
相关论文
共 9 条
  • [1] Delimitrou Christina, IISWC 2012, P14
  • [2] Metric and workload effects on computer systems evaluation
    Feitelson, DG
    [J]. COMPUTER, 2003, 36 (09) : 18 - +
  • [3] Huang S., 2010, P ICDE WORKSH
  • [4] IBM Research Dublin, IBM SOFTW GROUP EUR
  • [5] Joo Y., 2001, P SIGCOMM
  • [6] Li H., 2010, IEEE TPDS, V21
  • [7] Owezarski Philippe, 2004, COMM IEEE INT C IEEE, V4
  • [8] Roy Arjun, 2015, P SIGCOMM
  • [9] Sheldon M X V M J, 2007, P 3 ANN WORKSH MOD B