Peta-Scale Data Warehousing at Yahoo!

被引:0
|
作者
Ahuja, Mona [1 ]
Chen, Cheng Che [1 ]
Gottapu, Ravi [1 ]
Hallmann, Joerg [1 ]
Hasan, Waqar [1 ]
Johnson, Richard [1 ]
Kozyrczak, Maciek [1 ]
Pabbati, Ramesh [1 ]
Pandit, Neeta [1 ]
Pokuri, Sreenivasulu [1 ]
Uppala, Krishna [1 ]
机构
[1] Yahoo Inc, Sunnyvale, CA 94089 USA
来源
ACM SIGMOD/PODS 2009 CONFERENCE | 2009年
关键词
Column Database; MPP Database; Vector Query Processing; Column Storage; Data Warehousing; Analytics; Business Intelligence;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Insights based on detailed data on consumer behavior, product performance and marketplace behavior are driving innovation and competition in the internet space. We introduce Everest, a SQL-compliant data warehousing engine, based on a column architecture that we have built and deployed at Yahoo!. In contrast to commercially available engines, this massively parallel engine, based on commodity hardware, offers scale, flexibility, specialized analytic operations, and lower administrative & hardware costs. In this paper, we describe the business motivation and the software and deployment architecture of Everest. The engine is in production at Yahoo! since 2007 and currently manages over six petabytes of data.
引用
收藏
页码:855 / 861
页数:7
相关论文
共 50 条
  • [31] Towards a framework for evaluating investments in data warehousing
    Counihan, A
    Finnegan, P
    Sammon, D
    INFORMATION SYSTEMS JOURNAL, 2002, 12 (04) : 321 - 338
  • [32] Data Warehousing in the Context of a Bologna Undergraduate Degree
    Victor Ramos, Jose
    Oliveira, Rui
    2013 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON), 2013, : 540 - 544
  • [33] Clickstream Data Warehousing for Web Crawlers Profiling
    Lourenco, Analia
    Belo, Orlando
    WORLD CONGRESS ON ENGINEERING, WCE 2011, VOL I, 2011, : 343 - 348
  • [34] A Survey of Spatio-Temporal Data Warehousing
    Gomez, Leticia
    Kuijpers, Bart
    Moelans, Bart
    Vaisman, Alejandro
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2009, 5 (03) : 28 - 55
  • [35] Simplification of OWL ontology sources for Data Warehousing
    Laadidi, Yassine
    Bahaj, Mohamed
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND INFORMATION MANAGEMENT (ICSIM 2018) / WORKSHOP 2018 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (ICBDSC 2018), 2018, : 77 - 81
  • [36] Enterprise modeling and Data Warehousing in TELECOM ITALIA
    Calvanese, D
    Dragone, L
    Nardi, D
    Rosati, R
    Trisolini, SM
    INFORMATION SYSTEMS, 2006, 31 (01) : 1 - 32
  • [37] The epiphany of data warehousing technologies in the pharmaceutical industry
    Barrett, JS
    Koprowski, SP
    INTERNATIONAL JOURNAL OF CLINICAL PHARMACOLOGY AND THERAPEUTICS, 2002, 40 (03) : S3 - S13
  • [38] XSX: Lightweight Encryption for Data Warehousing Environments
    Santos, Ricardo Jorge
    Vieira, Marco
    Bernardino, Jorge
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2016, 2016, 9829 : 281 - 295
  • [39] Data-Warehousing Applications in Manufacturing Industry - Applicable Solutions and Challenges Faced
    Ramesh, Goparaju V.
    Rao, Sattiraju N.
    Shashi, Mogalla
    ADVANCES IN COMPUTING, COMMUNICATION AND CONTROL, 2011, 125 : 70 - +
  • [40] A Dynamic Data Warehousing Platform for Creating and Accessing Biomedical Data Lakes
    Kathiravelu, Pradeeban
    Sharma, Ashish
    DATA MANAGEMENT AND ANALYTICS FOR MEDICINE AND HEALTHCARE, 2017, 10186 : 101 - 120