JOURNAL ARTICLE
Accelerating atmospheric physics parameterizations using graphics processing units.
Published In: International Journal of High Performance Computing Applications, 2024, v. 38, n. 4. P. 282 1 of 3
Database: Academic Search Ultimate 2 of 3
Authored By: Abdi, Daniel S; Jankov, Isidora 3 of 3
Abstract
This article focuses on porting two physics parameterizations from the Common Community Physics Package (CCPP)—the aerosol-aware Thompson microphysics scheme and the Grell–Freitas cumulus convection scheme—to Graphics Processing Units (GPUs) using the OpenACC directive programming language. The study demonstrates significant acceleration, achieving up to 120× speedup for the Thompson scheme and 90× for the Grell–Freitas scheme compared to single-core CPU runs, with realistic speedups around 10× when compared to multi-core CPUs. Multi-GPU implementations show good weak scaling efficiency, especially when using one GPU per node, due to the column-wise independence of physics computations that minimizes inter-node communication. The work highlights challenges such as the impact of automatic arrays on GPU performance and the necessity of porting entire numerical weather prediction models—including dynamics—to GPUs to realize overall speedups, as data transfer overheads can negate gains from accelerating physics alone. Validation against CPU results confirms the accuracy of the GPU implementations within acceptable tolerances, and ongoing efforts aim to integrate these accelerated kernels into operational weather models.
Additional Information
- Source:International Journal of High Performance Computing Applications. 2024/07, Vol. 38, Issue 4, p282
- Document Type:Article
- Subject Area:Earth and Atmospheric Sciences
- Publication Date:2024
- ISSN:1094-3420
- DOI:10.1177/10943420241238711
- Accession Number:178804290
- Copyright Statement:Copyright of International Journal of High Performance Computing Applications is the property of Sage Publications Inc. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Looking to go deeper into this topic? Look for more articles on EBSCOhost.