Peta-Scale Data Warehousing at Yahoo!

被引:0
|
作者
Ahuja, Mona [1 ]
Chen, Cheng Che [1 ]
Gottapu, Ravi [1 ]
Hallmann, Joerg [1 ]
Hasan, Waqar [1 ]
Johnson, Richard [1 ]
Kozyrczak, Maciek [1 ]
Pabbati, Ramesh [1 ]
Pandit, Neeta [1 ]
Pokuri, Sreenivasulu [1 ]
Uppala, Krishna [1 ]
机构
[1] Yahoo Inc, Sunnyvale, CA 94089 USA
来源
ACM SIGMOD/PODS 2009 CONFERENCE | 2009年
关键词
Column Database; MPP Database; Vector Query Processing; Column Storage; Data Warehousing; Analytics; Business Intelligence;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Insights based on detailed data on consumer behavior, product performance and marketplace behavior are driving innovation and competition in the internet space. We introduce Everest, a SQL-compliant data warehousing engine, based on a column architecture that we have built and deployed at Yahoo!. In contrast to commercially available engines, this massively parallel engine, based on commodity hardware, offers scale, flexibility, specialized analytic operations, and lower administrative & hardware costs. In this paper, we describe the business motivation and the software and deployment architecture of Everest. The engine is in production at Yahoo! since 2007 and currently manages over six petabytes of data.
引用
收藏
页码:855 / 861
页数:7
相关论文
共 50 条
  • [2] Integrating Artificial Intelligence into Data Warehousing and Data Mining
    Madonsela, Nelson Sizwe.
    Mbecke, Paulin.
    Mbohwa, Charles
    WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, WCECS 2015, VOL II, 2015, : 819 - 823
  • [3] Data integration in data warehousing
    Calvanese, D
    De Giacomo, G
    Lenzerini, M
    Nardi, D
    Rosati, R
    INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2001, 10 (03) : 237 - 271
  • [4] Web warehousing: Web technology meets data warehousing
    Tan, Xin
    Yen, David C.
    Fang, Xiang
    Technology in Society, 2003, 25 (01) : 131 - 148
  • [5] Current trends in data warehousing methods and technologies
    Ivanova, Vera
    ICEIS 2006: Proceedings of the Eighth International Conference on Enterprise Informational Systems: DATABASES AND INFORMATION SYSTEMS INTEGRATION, 2006, : 297 - 301
  • [6] A Casestudy of Data Models in Data Warehousing
    Mishra, Deepti
    Yazici, Ali
    Basaran, Beril Pinar
    2008 FIRST INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES, VOLS 1 AND 2, 2008, : 321 - 326
  • [7] A Framework for Data Quality in Data Warehousing
    Nemani, Rao R.
    Konda, Ramesh
    INFORMATION SYSTEMS: MODELING, DEVELOPMENT, AND INTEGRATION: THIRD INTERNATIONAL UNITED INFORMATION SYSTEMS CONFERENCE, UNISCON 2009, 2009, 20 : 292 - +
  • [8] A Survey on Temporal Data Warehousing
    Golfarelli, Matteo
    Rizzi, Stefano
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2009, 5 (01) : 1 - 17
  • [9] RAIN: Always on data warehousing
    Vieira, Jorge
    Vieira, Marco
    Costa, Marco
    Madeira, Henrique
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2008, 4947 : 678 - +
  • [10] Data Warehousing of Vehicle Trajectory
    Tang, Baicheng
    Shen, Guicheng
    Zhang, Cailin
    PROCEEDINGS OF 2015 6TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE, 2015, : 935 - 938