本文已被:浏览 2235次 下载 2403次
Received:September 19, 2017 Revised:October 10, 2017
Received:September 19, 2017 Revised:October 10, 2017
中文摘要: 针对非结构网格隐式算法在GPU上的加速效果不佳的问题,通过分析GPU的架构及并行模式,研究并实现了基于非结构网格格点格式的隐式LU-SGS算法的GPU并行加速.通过采用RCM和Metis网格重排序(重组)方法,优化非结构网格的数据局部性,改善非结构网格的隐式算法在GPU上的并行加速效果.通过三维机翼算例验证了本文实现的正确性及效率.结果表明两种网格重排序(重组)方法分别得到了63%和69%的加速效果提高.优化后的LU-SGS隐式GPU并行算法获得了相较于CPU串行算法27倍的加速比,充分说明了本文方法的高效性.
Abstract:With regard to the poor acceleration performance on GPU using the unstructured grids implicit method, this study realizes the GPU acceleration of LU-SGS implicit method based on unstructured grids with the cell-vertex scheme. With introduce the architecture of a GPU and its parallelization method, two grid reordering methods are set forth based on RCM and METIS, to improve data locality of unstructured grids and to improve acceleration performance on GPU using the unstructured grids implicit method. The ONERA M6 Wing test case is carried out to verify and validate this implementation. With two grid reordering methods, the GPU implementations achieve 63% and 69% improvements respectively. The GPU implementation obtains a speedup of 27 times compared to the CPU version running on a single core. It indicates that the proposed GPU implementation has a solid performance.
keywords: GPU acceleration parallel computing grid reordering computational fluid dynamics implicit schemes
文章编号: 中图分类号: 文献标志码:
基金项目:江苏高校优势学科建设工程资助项目
引用文本:
陈龙,徐添豪,田书玲.基于非结构网格隐式算法的GPU加速研究.计算机系统应用,2018,27(5):238-243
CHEN Long,XU Tian-Hao,TIAN Shu-Ling.Research on GPU Acceleration of Implicit Schemes Based on Unstructured Grids.COMPUTER SYSTEMS APPLICATIONS,2018,27(5):238-243
陈龙,徐添豪,田书玲.基于非结构网格隐式算法的GPU加速研究.计算机系统应用,2018,27(5):238-243
CHEN Long,XU Tian-Hao,TIAN Shu-Ling.Research on GPU Acceleration of Implicit Schemes Based on Unstructured Grids.COMPUTER SYSTEMS APPLICATIONS,2018,27(5):238-243