Skip to content

v2022.10.0

Compare
Choose a tag to compare
@rhornung67 rhornung67 released this 12 Jan 22:47
· 1021 commits to main since this release
57ee53e

This release contains new features, bug fixes, and build improvements.

Please download the RAJAPerf-v2022.10.0.tar.gz file below. The others will not work due to the way RAJAPerf uses git submodules.

Notable changes include:

  • Release version name change:

    • Following the naming scheme for coordinated RAJA Portability Suite releases, this release of the RAJA Performance Suite is v2022.10.0 to indicate that it corresponds to the v2022.10.x releases of RAJA and camp.

    • We've been doing coordinated releases of RAJA Portability Suite projects (RAJA, Umpire, CHAI, and camp) for a while, and we changed the version naming scheme for those projects to reflect that. For example, the version number for the last release of these projects is v2022.10.x, meaning the release occurred in October 2022. The intent is that the v2022.10.x project releases are consistent in terms of their dependencies and they are tested together. The 'x' patch version number is applied to each project independently if a bugfix or other patch is needed. Any combination of v2022.10.x versioned libraries should be compatible.

  • New features and usage changes:

    • Add CONVECTION3DPA finite element kernel.
    • Add basic memory operation kernels MEMSET and MEMCPY
  • Build changes / improvements:

    • Improved CI testing, including using test infrastructure in RAJA (eliminate redundancies).
    • Fix 'make install' so that executable is installed as well.
    • Update all submodules to be consistent with RAJA v2022.10.4 release, including that version of RAJA.
  • Bug fixes / improvements:

    • Fix race condition in FIRST_MIN kernel (Thanks C. Robeck from AMD).
    • Fix broken OpenMP target variant of REDUCE_STRUCT kernel.
    • Fix MPI hang when rank zero does not enter a barrier if no path name is given for creating directories.
    • Support long double with MPI all reduce even when MPI implementation does not support long double.
    • Fix message printing to be rank zero only.