欢迎访问行业研究报告数据库

行业分类

当前位置:首页 > 报告详细信息

找到报告 1 篇 当前为第 1 页 共 1

多核超级计算机节点密集的LU分解

Dense LU Factorization on Multicore Supercomputer Nodes
作者:A. Arya J. Lifflander L. Kale P. Miller R. Venkataraman T. Jones 加工时间:2014-09-26 信息来源:科技报告(DE) 索取原文[11 页]
关键词:基准;通讯;计算机体系结构;超级计算机
摘 要:Dense LU factorization is a prominent benchmark used to rank the performance of supercomputers. Many implementations use block-cyclic distributions of matrix blocks onto a two-dimensional process grid. The process grid dimensions drive a trade-off between communication and computation and are architecture- and implementation-sensitive. The critical panel factorization steps can be made less communication-bound by overlapping asynchronous collectives for pivoting with the computation of rank-k updates. By shifting the computation-commun icationtrade-off, a modified block-cyclic distribution can beneficially exploitmore available parallelism on the critical path, and reduce panel factorizations memory hierarchy contention on now-ubiquitous multicore architectures.
© 2016 武汉世讯达文化传播有限责任公司 版权所有
客服中心

QQ咨询


点击这里给我发消息 客服员


电话咨询


027-87841330


微信公众号




展开客服