A GPU-Accelerated Parallel Shooting Algorithm for Analysis of Radio Frequency and Microwave Integrated Circuits

IEEE Trans. VLSI Syst.(2015)

引用 16|浏览12
暂无评分
摘要
This paper presents a new parallel shooting-Newton method based on a graphic processing unit (GPU)-accelerated periodic Arnoldi shooting solver (GAPAS) for fast periodic steady-state analysis of radio frequency/millimeter-wave integrated circuits. The new algorithm first explores a periodic structure of the state matrix by using a periodic Arnoldi algorithm for computing the resulting structured Krylov subspace in the generalized minimal residual (GMRES) solver. The resulting periodic Arnoldi shooting method is very amenable for massive parallel computing, such as GPUs. Second, the periodic Arnoldi-based GMRES solver in the shooting-Newton method is parallelized on the recent NVIDIA Tesla GPU platforms. We further explore CUDA GPUs features, such as coalesced memory access and overlapping transfers with computation to boost the efficiency of the resulting parallel GAPAS method. Experimental results from several industrial examples show that when compared with the state-of-the-art implicit GMRES method under the same accuracy, the new parallel shooting-Newton method can lead up to $8times$ speedup.
更多
查看译文
关键词
periodic steady-state (pss) analysis,generalized minimal residual solver,graphic processing unit (gpu) parallelization,structured krylov-subspace,generalized minimal residual (gmres),fast periodic steady-state analysis,graphics processing units,matrix algebra,periodic arnoldi shooting solver,newton method,radiofrequency integrated circuits,arnoldi iteration,nvidia tesla platforms,graphic processing unit,gpu-accelerated parallel shooting algorithm,structured krylov subspace,structured krylov-subspace.,periodic structure,parallel computing,shooting-newton method,coalesced memory access,gmres solver,gapas,state matrix,microwave integrated circuits,algorithm design and analysis,mathematical model,instruction sets
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要