A large-scale heterogeneous computing framework for non-uniform sampling two-dimensional convolution applications

CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING(2023)

引用 0|浏览10
暂无评分
摘要
Non-uniform sampling two-dimensional convolution (NUSC) maps spatially sampling data with irregular distribution to a regular grid by convolution. As the data scale and growth rate continue to increase, accelerating NUSC with the heterogeneous computing platform is a feasible way. However, the complex hardware architecture and storage hierarchy of the heterogeneous computing platform poses a challenge to programming and performance tuning. Therefore, this paper proposes a heterogeneous parallel programming model and runtime framework named AutoNUSC. For the programming difficulties of NUSC in heterogeneous computing environments, AutoNUSC abstracts and encapsulates the parallel execution process of NUSC. Task scheduling, data division, node communication, fault-tolerant recovery, and other parallelization tasks are managed by AutoNUSC. For the performance tuning issues of NUSC, this paper implements performance optimization strategies for AutoNUSC, including vectorization, memory access optimization, data reuse, etc. The experiments show that AutoNUSC effectively reduces the workload of users in developing NUSC applications in heterogeneous computing environments. Performance acceleration of up to 339 times is achieved within a single node compared to the serial program. AutoNUSC can efficiently perform task scheduling and fault-tolerant recovery across multiple nodes, with desirable scalability and robustness.
更多
查看译文
关键词
Heterogeneous Computing,NUSC,Programming Model,Runtime Framework
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要