行业报告详情 - 行业报告数据库

行业分类

找到报告 1 篇当前为第 1 页共 1 页

数据密集型集群中并行工作性能的优化技术

Optimizing Parallel Job Performance in Data-Intensive Clusters

作者：Ganesh Ananthanarayanan 作者单位：EECS Department, University of California, Berkeley 加工时间：2015-06-07 信息来源：EECS

关键词：集群技术；并行工作；数据密集型
摘要：A simple but key aspect of parallel jobs is the all-or-nothing property: unless all tasks of a job are provided equal improvement, there is no speedup in the completion of the job. The all-or-nothing property is critical for the promise of efficient and fault-tolerant parallel computations on large clusters. Meeting this promise in clusters of these scales is challenging and a key departure from prior work on distributed systems. This talk will look at the execution of a job from first principles and propose techniques spanning the software stack of data analytics systems such that its tasks achieve homogeneous performance while overcoming the various heterogeneities. To that end, we will propose techniques for (i) caching and cache replacement for parallel jobs, which outperforms even Belady's MIN (that uses an oracle), (ii) data locality, and (iii) straggler mitigation. Our analyses and evaluation are performed using workloads from Facebook and Bing production datacenters Along the way, we will also describe how we broke the myth of disk-locality's importance in datacenter computing.

行业分类

友情链接

联系我们

QQ咨询

电话咨询

微信公众号

感谢访问