Scalable I/O and analytics

被引:14
作者
Choudhary, Alok [1 ]
Liao, Wei-keng [1 ]
Gao, Kui [1 ]
Nisar, Arifa [1 ]
Ross, Robert [2 ]
Thakur, Rajeev [2 ]
Latham, Robert [2 ]
机构
[1] Northwestern Univ, Dept Elect Engn & Comp Sci, Evanston, IL 60208 USA
[2] Argonne Natl Lab, Div Math & Comp Sci, Argonne, IL 60439 USA
来源
SCIDAC 2009: SCIENTIFIC DISCOVERY THROUGH ADVANCED COMPUTING | 2009年 / 180卷
基金
美国国家科学基金会;
关键词
D O I
10.1088/1742-6596/180/1/012048
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
High-performance computing systems have already approached peta-scale with hundreds of thousands of processors/cores in many deployments. These systems promise a new level of predictive and knowledge discovery ability as researchers gain the capability to model dependencies between phenomena at scales not seen earlier. These applications are highly I/O and data intensive, leading scientists to observe that performing I/O and subsequent analyses are major bottlenecks in effectively utilizing peta-scale systems and a major hurdle in accelerating discoveries. Although significant progress has been made in performance, interfaces, and middleware runtime systems for I/O in the recent past, significantly more research and development needs to be carried out to scale the performance to the desired levels for systems containing tens to hundreds of thousands of cores. In this work we outline our recent achievements and current research for designing scalable I/O software and enabling data analytics in storage systems. We also enumerate key challenges for the I/O systems and discuss ongoing efforts that address these challenges.
引用
收藏
页数:10
相关论文
共 30 条
  • [1] Active disks: Programming model, algorithms and evaluation
    Acharya, A
    Uysal, M
    Saltz, J
    [J]. ACM SIGPLAN NOTICES, 1998, 33 (11) : 81 - 91
  • [2] ALMASI G, 2004, EUR C PAR PROC
  • [3] [Anonymous], HIER DAT FORM VERS 5
  • [4] [Anonymous], 1997, ANLMCSTM234
  • [5] CARNS P, 2000, 3 ANN LIN SHOWC C OC, P317
  • [6] CHING A, 2003, IEEE ACM INT S CLUST
  • [7] COLOMA K, 2004, INT PAR DISTR PROC S
  • [8] COLOMA K, 2005, 20 INT SUP C
  • [9] COLOMA K, 2006, IEEE C CLUST COMP
  • [10] DELROSARIO J, 1993, WORKSH I O PAR COMP, P56