Enable building and testing Omega in single precision #147

mwarusz · 2024-10-16T16:58:43Z

This PR enables configuring Omega to use single precision and adds a test (reusing the tendency terms test) checking that we can build and run in single precision. The main changes are:

Changing R8 to Real where appropriate
Reading mesh and state arrays into temporary R8 arrays before storing them into the class members
Always using R8 buffers for halo exchanges
CMake logic to build the single precision library when needed

The halo buffers change is not optimal for performance in single precision. It would need to be optimized if we decide
to pursue this option seriously in the future.

Checklist

Documentation:
- User's Guide has been updated
- Developer's Guide has been updated
- Documentation has been built locally and changes look as expected
Testing
- CTest unit tests for new features have been added per the approved design.
- Unit tests have passed. Please provide a relevant CDash build entry for verification.

sbrus89

This looks good to me and includes some nice clean-ups as well. I just had a couple questions related to the MPI calls and _Real suffix.

components/omega/test/ocn/AuxiliaryVarsTest.cpp

sbrus89 · 2024-10-29T01:21:25Z

components/omega/src/ocn/HorzMesh.cpp

+   // Read mesh cell coordinates
+   readCellArray(XCellH, "xCell");
+   readCellArray(YCellH, "yCell");
+   readCellArray(ZCellH, "zCell");


These function calls are a nice improvement, thanks.

components/omega/src/base/Halo.h

grnydawn

Both the double-precision and single-precision versions of the Omega unit tests were profiled using Nsight profilers, and the results indicate that this PR correctly generates the corresponding GPU kernels. However, I have not verified the validity of the algorithms.

The profiling results indicate that the unit tests in this PR are insufficient to determine whether single-precision improves performance, as the kernel sizes are too small and do not appear to be representative of typical climate algorithms.

I approve this PR, assuming it properly handles merging commits from external libraries as I noted in another comment.

cime

grnydawn · 2024-10-31T23:49:35Z

components/omega/test/ocn/TendencyTermsTest.cpp

I have profiled PLANE_TEST unit test with Nsight profilers and summarized the profiling result below. Further details are in (https://acme-climate.atlassian.net/l/cp/YSFk1Zra).

The PR correctly generates the single-precision version of the Omega TEND_PLANE unit test.

The elapsed time for both the single-precision and double-precision versions of the unit test is nearly the same, at around 155 ms, excluding initialization and finalization routines.

GPU resources are underutilized in both cases. For the double-precision version, Compute (SM) Throughput is 4.15%, and Memory Throughput is 1.39%. For single-precision, Compute (SM) Throughput is 7.27%, and Memory Throughput is 0.81%.

It appears that the kernels are too small to effectively compare the performance characteristics of different floating-point precisions. The longest kernel runs for about 27 µs, but most kernels run in under 20 µs.

The arithmetic intensity (AI) of the kernels also appears to be too high, meaning these kernels might not accurately represent the performance characteristics of the full Omega model. The typical AI of climate algorithms ranges between 0.1 and 1, but the AI of these unit test kernels reaches up to 460 FLOPs/byte.

brian-oneill · 2024-11-04T18:47:53Z

After the rebase, able to build and run tests successfully on Chrysalis and Perlmutter CPU & GPU. Everything looks good, approving.

sbrus89

Approving based my inspection and testing by @brian-oneill and @grnydawn.

mwarusz requested review from sbrus89, grnydawn and brian-oneill October 16, 2024 16:58

mark-petersen changed the title ~~Enable builing and testing Omega in single precision~~ Enable building and testing Omega in single precision Oct 21, 2024

sbrus89 reviewed Oct 29, 2024

View reviewed changes

Precision fixes for state, auxvars, and geometry arrays

8e6b89c

mwarusz force-pushed the mwarusz/omega/real-precision branch 2 times, most recently from 42e8b11 to 3c6a1b2 Compare October 30, 2024 14:28

grnydawn approved these changes Oct 31, 2024

View reviewed changes

mwarusz added 10 commits October 31, 2024 19:43

Precision fixes in reductions test

4cb500d

Use R8 buffers for halo exchange

b20f67a

Change spherical setup in aux vars test to avoid very large numbers

4d3f838

Use Real precision for geometry

150044d

Precision fixes in tracer terms

44a2192

Precision fixes for constants

6c3e1e4

Use checkErrors helper in tendency terms test

c19bfef

SINGLE_PRECISION -> OMEGA_SINGLE_PRECISION

c0382de

Enable setting single precision through CMake

07785e6

Use planar tendency terms test to test single precision

9b403ad

mwarusz force-pushed the mwarusz/omega/real-precision branch from 3c6a1b2 to 9b403ad Compare November 1, 2024 01:44

brian-oneill approved these changes Nov 4, 2024

View reviewed changes

sbrus89 approved these changes Nov 4, 2024

View reviewed changes

sbrus89 merged commit 801ceaa into E3SM-Project:develop Nov 6, 2024
2 checks passed

sbrus89 self-assigned this Nov 6, 2024

sbrus89 added clean up performance labels Nov 6, 2024

sbrus89 mentioned this pull request Nov 7, 2024

Omega/state time level change #152

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable building and testing Omega in single precision #147

Enable building and testing Omega in single precision #147

mwarusz commented Oct 16, 2024

sbrus89 left a comment

sbrus89 Oct 29, 2024

grnydawn left a comment

grnydawn Oct 31, 2024

brian-oneill commented Nov 4, 2024

sbrus89 left a comment

Enable building and testing Omega in single precision #147

Enable building and testing Omega in single precision #147

Conversation

mwarusz commented Oct 16, 2024

sbrus89 left a comment

Choose a reason for hiding this comment

sbrus89 Oct 29, 2024

Choose a reason for hiding this comment

grnydawn left a comment

Choose a reason for hiding this comment

grnydawn Oct 31, 2024

Choose a reason for hiding this comment

brian-oneill commented Nov 4, 2024

sbrus89 left a comment

Choose a reason for hiding this comment