欢迎访问行业研究报告数据库

行业分类

当前位置:首页 > 报告详细信息

找到报告 1 篇 当前为第 1 页 共 1

交互式查询处理大数据系统:一个十字架MapReduce工作负载的行业研究

Interactive Query Processing in Big Data Systems: A Cross Industry Study of MapReduce Workloads

作者:Yanpei Chen Sara Alspaugh Randy H. Katz 加工时间:2013-12-30 信息来源:EECS 索取原文[13 页]
关键词:数据处理;工作负载;查询处理;昼夜模式
摘 要:Within the past few years, organizations in diverse indus- tries have adopted MapReduce-based systems for large-scale data processing. Along with these new users, important new workloads have emerged which feature many small, short,and increasingly interactive jobs in addition to the large, long-running batch jobs for which MapReduce was originally designed. As interactive, large-scale query processing (e.g.,OLAP) is a strength of the RDBMS community, it is impor- tant that lessons from that eld be carried over and applied where possible in this new domain. However, these new workloads have not yet been described in the literature. We ll this gap with an empirical analysis of MapReduce traces from six separate business-critical deployments inside Face-book and at Cloudera customers in e-commerce, telecommu-nications, media, and retail. Our key contribution is a char- acterization of new MapReduce workloads which are driven in part by interactive analysis, and which make heavy use of SQL-like programming frameworks on top of MapReduce.These workloads display diverse behaviors which invalidate prior assumptions about MapReduce such as uniform data access, regular diurnal patterns, and prevalence of large jobs.A secondary contribution is a rst step towards creating a TPC-like data processing benchmark for MapReduce.
© 2016 武汉世讯达文化传播有限责任公司 版权所有 技术支持:武汉中网维优
客服中心

QQ咨询


点击这里给我发消息 客服员


电话咨询


027-87841330


微信公众号




展开客服