Implementation of a Lattice Boltzmann Method for Large Eddy Simulation on Multiple GPUs

High Performance Computing and Communication & 2012 IEEE 9th International Conference Embedded Software and Systems(2012)

引用 4|浏览4
暂无评分
摘要
Recently, the Graphic Processor Unit (GPU) has evolved into a highly parallel, multithreaded, many-core processor with tremendous computational horsepower and very high memory bandwidth. To improve the simulation efficiency of complex flow phenomena in the field of computational fluid dynamics, a CUDA-based simulation algorithm of large eddy simulation using multiple GPUs is proposed. Our implementation adopted the "collision after propagation" scheme and performed the propagation process by global memory reading transactions. The working set is split up into equal sub-domains and assigned to each GPU for simplicity. Using recently released hardware, up to four GPUs can be controlled by a single CPU thread and run in parallel. The results show that our multi-GPU implementation could perform simulations on a rather large scale (meshes: 10240脳10240) even using double-precision floating point calculation and achieved 190X speedup over the sequential implementation on CPU.
更多
查看译文
关键词
multiple gpus,sequential implementation,high memory bandwidth,simulation efficiency,large scale,multi-gpu implementation,large eddy simulation,computational fluid dynamic,global memory reading transaction,lattice boltzmann method,cuda-based simulation algorithm,computational fluid dynamics,floating point arithmetic,computational modeling,parallel processor,multi threading,lattice boltzmann methods,lattices,mathematical model,kernel
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要