GPU Optimization of Lattice Boltzmann Method with Local Ensemble Transform Kalman Filter

2022 IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH)(2022)

引用 1|浏览14
暂无评分
摘要
The ensemble data assimilation of computational fluid dynamics simulations based on the lattice Boltzmann method (LBM) and the local ensemble transform Kalman filter (LETKF) is implemented and optimized on a GPU supercomputer based on NVIDIA A100 GPUs. To connect the LBM and LETKF parts, data transpose communication is optimized by overlapping computation, file I/O, and communication based on data dependency in each LETKF kernel. In two dimensional forced isotropic turbulence simulations with the ensemble size of M = 64 and the number of grid points of N x = 128 2 , the optimized implementation achieved ×3.80 speedup from the naive implementation, in which the LETKF part is not parallelized. The main computing kernel of the local problem is the eigenvalue decomposition (EVD) of M×M real symmetric dense matrices, which is computed by a newly developed batched EVD in EigenG. The batched EVD in EigenGoutperforms that in cuSOLVER, and ×65.3 speedup was achieved.
更多
查看译文
关键词
Ensemble data assimilation,Local ensemble transform Kalman filter,Eigenvalue decomposition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要