Peta-Scale Data Warehousing at Yahoo!

被引:0
作者
Ahuja, Mona [1 ]
Chen, Cheng Che [1 ]
Gottapu, Ravi [1 ]
Hallmann, Joerg [1 ]
Hasan, Waqar [1 ]
Johnson, Richard [1 ]
Kozyrczak, Maciek [1 ]
Pabbati, Ramesh [1 ]
Pandit, Neeta [1 ]
Pokuri, Sreenivasulu [1 ]
Uppala, Krishna [1 ]
机构
[1] Yahoo Inc, Sunnyvale, CA 94089 USA
来源
ACM SIGMOD/PODS 2009 CONFERENCE | 2009年
关键词
Column Database; MPP Database; Vector Query Processing; Column Storage; Data Warehousing; Analytics; Business Intelligence;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Insights based on detailed data on consumer behavior, product performance and marketplace behavior are driving innovation and competition in the internet space. We introduce Everest, a SQL-compliant data warehousing engine, based on a column architecture that we have built and deployed at Yahoo!. In contrast to commercially available engines, this massively parallel engine, based on commodity hardware, offers scale, flexibility, specialized analytic operations, and lower administrative & hardware costs. In this paper, we describe the business motivation and the software and deployment architecture of Everest. The engine is in production at Yahoo! since 2007 and currently manages over six petabytes of data.
引用
收藏
页码:855 / 861
页数:7
相关论文
共 50 条
  • [41] A Workload-Aware Change Data Capture Framework for Data Warehousing
    Qu, Weiping
    Liu, Xiufeng
    Dessloch, Stefan
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY (DAWAK 2021), 2021, 12925 : 222 - 231
  • [42] Teradata University Network - A teaching support portal for business intelligence, data warehousing and database
    Winter, Robert
    Gericke, Anke
    WIRTSCHAFTSINFORMATIK, 2006, 48 (04): : 276 - 281
  • [43] Big Data Analytics Adoption in Malaysia Warehousing Industry
    Wahab, Siti Norida
    Olugu, Ezutah Udoncy
    Lee, Wei Chern
    Tan, Say Yik
    VISION 2020: SUSTAINABLE ECONOMIC DEVELOPMENT AND APPLICATION OF INNOVATION MANAGEMENT, 2018, : 2349 - 2365
  • [45] Classification of Metadata Categories in Data Warehousing - A Generic Approach
    Gabriel, Roland
    Hoppe, Tobias
    Pastwa, Alexander
    AMCIS 2010 PROCEEDINGS, 2010,
  • [46] Clinical Data Warehousing for Evidence Based Decision Making
    Narra, Lekha
    Sahama, Tony
    Stapleton, Peta
    DIGITAL HEALTHCARE EMPOWERING EUROPEANS, 2015, 210 : 329 - 333
  • [47] An Efficient Stochastic Update Propagation Method in Data Warehousing
    Bordoloi, Bijoy
    Kapoor, Bhushan
    Jacks, Tim
    AMCIS 2014 PROCEEDINGS, 2014,
  • [48] Integration and dimensional modeling approaches for complex data warehousing
    O. Boussaid
    Adrian Tanasescu
    Fadila Bentayeb
    Jérôme Darmont
    Journal of Global Optimization, 2007, 37 : 571 - 591
  • [49] A general framework for trajectory data warehousing and visual OLAP
    Leonardi, Luca
    Orlando, Salvatore
    Raffaeta, Alessandra
    Roncato, Alessandro
    Silvestri, Claudio
    Andrienko, Gennady
    Andrienko, Natalia
    GEOINFORMATICA, 2014, 18 (02) : 273 - 312
  • [50] An empirical investigation of the factors affecting data warehousing success
    Wixom, BH
    Watson, HJ
    MIS QUARTERLY, 2001, 25 (01) : 17 - 41