QR Factorization Using Malleable BLAS on Multicore Processors.

ISC Workshops(2022)

引用 0|浏览16
暂无评分
摘要
We demonstrate that significant performance benefits can be obtained via the exploitation of malleability in a framework designed to implement portable and high-performance BLAS-like kernels. For this purpose, we integrate thread-level malleability within the BLIS library, providing an experimental evaluation for a representative dense linear algebra operation such as the QR factorization for dense matrices enhanced with look-ahead.
更多
查看译文
关键词
Malleability, Basic Linear Algebra Subprograms (BLAS), High performance, Multi-threading, Multicore processors
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要