Efficient Buffering and Scheduling for a Single-Chip Crosspoint-Queued Switch

被引:4
作者
Cao, Zizhong [1 ]
Panwar, Shivendra S. [1 ]
机构
[1] NYU, Polytech Sch Engn, Dept Elect & Comp Engn, Brooklyn, NY 11201 USA
关键词
Single-chip; crossbar; scheduling; load balancing; deflection routing; buffer pooling; INPUT; ALGORITHM; TCP;
D O I
10.1109/TCOMM.2014.2318695
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The single-chip crosspoint-queued (CQ) switch is a compact switching architecture that has all its buffers placed at the crosspoints of input and output lines. Scheduling is also performed inside the switching core and does not rely on latency-limited communications with input or output line-cards. Compared with other legacy switching architectures, the CQ switch has the advantages of high throughput, minimal delay, low scheduling complexity, and no speedup requirement. However, the crosspoint buffers are small and segregated; thus, how to efficiently use the buffers and avoid packet drops remains a major problem that needs to be addressed. In this paper, we consider load balancing, deflection routing, and buffer pooling for efficient buffer sharing in the CQ switch. We also design scheduling algorithms to maintain the correct packet order even while employing multi-path switching and resolve contentions caused by multiplexing. All these techniques require modest hardware modifications and memory speedup in the switching core but can greatly boost the buffer utilizations by up to 10 times and reduce the packet drop rates by one to three orders of magnitude. Extensive simulations and analyses have been done to demonstrate the advantages of the proposed buffering and scheduling techniques. By pushing the on-chip memory to the limit of current ASIC technology, we show that a cell drop rate of 10(-8), which is low enough for practical uses, can be achieved under real Internet traffic traces corresponding to a load of 0.9.
引用
收藏
页码:2034 / 2050
页数:17
相关论文
共 40 条
[1]   Sizing router buffers [J].
Appenzeller, G ;
Keslassy, I ;
McKeown, N .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2004, 34 (04) :281-292
[2]  
Arpaci Mutlu, 2000, IEEE Communications Surveys & Tutorials, V3, P2, DOI 10.1109/COMST.2000.5340716
[3]  
Cao Z., 2014, TECHNICAL REPORT EFF
[4]  
Cao ZZ, 2012, PROCEEDINGS OF THE EIGHTH ACM/IEEE SYMPOSIUM ON ARCHITECTURES FOR NETWORKING AND COMMUNICATIONS SYSTEMS (ANCS'12), P111
[5]   A Dynamic Frame Sizing Algorithm for CICQ Switches with 100% Throughput [J].
Chang, Cheng-Shang ;
Hsu, Yu-Hao ;
Cheng, Jay ;
Lee, Duan-Shin .
IEEE INFOCOM 2009 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-5, 2009, :747-755
[6]  
Chang CS, 2004, IEEE INFOCOM SER, P1995
[7]   Load balanced Birkhoff-von Neumann switches, part II: multi-stage buffering [J].
Chang, CS ;
Lee, DS ;
Lien, CM .
COMPUTER COMMUNICATIONS, 2002, 25 (06) :623-634
[8]   Matching output queueing with a combined input/output-queued switch [J].
Chuang, ST ;
Goel, A ;
McKeown, N ;
Prabhakar, B .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1999, 17 (06) :1030-1039
[9]  
Chuang ST, 2005, IEEE INFOCOM SER, P981
[10]  
Claffy K., 2011, CAIDA ANONYMIZED 201