行业报告详情 - 行业报告数据库

行业分类

找到报告 1 篇当前为第 1 页共 1 页

多核超级计算机节点密集的LU分解

Dense LU Factorization on Multicore Supercomputer Nodes

作者：A. Arya J. Lifflander L. Kale P. Miller R. Venkataraman T. Jones 加工时间：2014-09-26 信息来源：科技报告（DE）

关键词：基准;通讯;计算机体系结构;超级计算机
摘要：Dense LU factorization is a prominent benchmark used to rank the performance of supercomputers. Many implementations use block-cyclic distributions of matrix blocks onto a two-dimensional process grid. The process grid dimensions drive a trade-off between communication and computation and are architecture- and implementation-sensitive. The critical panel factorization steps can be made less communication-bound by overlapping asynchronous collectives for pivoting with the computation of rank-k updates. By shifting the computation-commun icationtrade-off, a modified block-cyclic distribution can beneficially exploitmore available parallelism on the critical path, and reduce panel factorizations memory hierarchy contention on now-ubiquitous multicore architectures.

行业分类

友情链接

联系我们

QQ咨询

电话咨询

微信公众号

感谢访问