ROCm-afar OpenMP target offload by awnawab · Pull Request #137 · ecmwf-ifs/field_api

awnawab · 2026-02-06T15:25:09Z

This PR contributes an OpenMP target offload backend for AMD's ROCm-afar compiler.

All tests pass, other than tests/sync_device.F90. We hit a runtime problem when trying to access FIELD%DEVPTR on device, which seems linked to trying to access an abstract type on device. This is not an access pattern we use in our transformations, so for now I have just marked this as expected to fail for offloaded rocm-afar builds, and added the tests/copy_struct.F90 test which is more representative of the code we generate with loki. OWNER_GET_DEVICE_DATA had a similar problem during the on-device initialisation, and there I've used an associate block to work around it.

The big thing that is missing here is multi-precision builds with flang. I've understood the problem, and it's to do with flang being much stricter about module imports than other compilers. To regain that ability with flang, we have to make the "core" of the build precision independent again. This means removing INIT_DEBUG_VALUE_JPRB from field_defaults_module.F90. I understand this requires a change in IAL, so for now I've setup the CI to only build SP for flang. I will create an issue to track this and assign it to you @dareg if that's ok. It would be great to also support multi-precision builds in flang.

In a future PR I will also contribute an hpc-ci entry for running field_api on LUMI so that we can actually test it on AMD GPUs too.

…for flang builds

…bs because of leaking imports

awnawab · 2026-02-17T10:33:40Z

Hi @dareg, @pmarguinaud,

Have you had the chance to take a look at this PR yet? A big PR like this will conflict with many other bugfixes/developments, so it would be better to merge this sooner rather than later.

dareg · 2026-02-23T13:37:48Z

I see no problem in validating this PR, I've tested it on our systems (so without AMD compiler neither GPU) and it doesn’t seems to break anything.

But I just got access (today) to a system with AMD card and I'm trying to compile on it.
I get a problem in ecbuild because it doesn't recognise the compiler and then cannot load the right option (nothing complicated, see below). So I was wondering which compiler did you use to try this PR on?

CMake Error at build/ecbuild/cmake/ecbuild_log.cmake:190 (message):
  CRITICAL - Variable 'ECBUILD_Fortran_COMPILE_OPTIONS_REAL4' must be
  defined for compiler with ID LLVMFlang.

   Description:
     Compile options to convert all unqualified reals to 32 bit (single precision)
   Please submit a patch. In the mean time you can provide the variable to the CMake configuration.

awnawab · 2026-02-23T14:06:06Z

I see no problem in validating this PR, I've tested it on our systems (so without AMD compiler neither GPU) and it doesn’t seems to break anything.

But I just got access (today) to a system with AMD card and I'm trying to compile on it. I get a problem in ecbuild because it doesn't recognise the compiler and then cannot load the right option (nothing complicated, see below). So I was wondering which compiler did you use to try this PR on?
CMake Error at build/ecbuild/cmake/ecbuild_log.cmake:190 (message):
  CRITICAL - Variable 'ECBUILD_Fortran_COMPILE_OPTIONS_REAL4' must be
  defined for compiler with ID LLVMFlang.

   Description:
     Compile options to convert all unqualified reals to 32 bit (single precision)
   Please submit a patch. In the mean time you can provide the variable to the CMake configuration.

I'll look into that error and find a proper fix for it. for now you can just downgrade to ecbuild 3.9.0 which doesn't have that error message.

awnawab · 2026-02-23T18:26:27Z

I see no problem in validating this PR, I've tested it on our systems (so without AMD compiler neither GPU) and it doesn’t seems to break anything.

But I just got access (today) to a system with AMD card and I'm trying to compile on it. I get a problem in ecbuild because it doesn't recognise the compiler and then cannot load the right option (nothing complicated, see below). So I was wondering which compiler did you use to try this PR on?
CMake Error at build/ecbuild/cmake/ecbuild_log.cmake:190 (message):
  CRITICAL - Variable 'ECBUILD_Fortran_COMPILE_OPTIONS_REAL4' must be
  defined for compiler with ID LLVMFlang.

   Description:
     Compile options to convert all unqualified reals to 32 bit (single precision)
   Please submit a patch. In the mean time you can provide the variable to the CMake configuration.

In fact this problem is fixed again in ecbuild 3.12, so it's better to use that. Can I push an update here to set the minimum required ecbuild version to 3.12?

dareg · 2026-02-24T09:13:45Z

Yes sure, better to set it to the right version of ecbuild

awnawab · 2026-02-24T09:56:49Z

The tests are failing because of a fiat bug, which should be fixed once ecmwf-ifs/fiat#96 is merged.

mlange05

Excellent work, looks very good to me. GTG from my side. 👍

awnawab · 2026-02-24T15:59:54Z

@dareg were you able to test this successfully with an AMD GPU?

dareg · 2026-02-25T14:09:42Z

cmake/field_api_get_offload_model.cmake

                       DESCRIPTION "Enable GPU offload via OpenMP"
-                       CONDITION CMAKE_Fortran_COMPILER_ID MATCHES "PGI|NVHPC" AND ${_HAVE_OMP_OFFLOAD} )
+                       CONDITION
+                         (CMAKE_Fortran_COMPILER_ID MATCHES "PGI|NVHPC" OR  CMAKE_Fortran_COMPILER MATCHES "amdflang")


I think you are missing the _ID at the end of CMAKE_Fortran_COMPILER

Ah no, I misunderstood. CMAKE_Fortran_COMPILER is amdflang, and the CMAKE_Fortran_COMPILER_ID is LLVMFlang

dareg · 2026-02-25T14:10:03Z

cmake/field_api_get_offload_model.cmake

+       else()
+          set(FIELD_API_OFFLOAD_MODEL "NVHPCOpenMP")
+       endif()
+     elseif( CMAKE_Fortran_COMPILER MATCHES "amdflang")


Same here, I think you are missing the _ID at the end of CMAKE_Fortran_COMPILER

awnawab added 16 commits February 6, 2026 15:19

TESTS: add missing INT64 import in mem_pool test

5546e93

ROCMAFAR: openmp offload cmake setup

9da9a28

field_api_compile_options: define custom preprocessor macro for flang

ccb5c86

FIELD_RANKSUFF_MODULE: remove CONTIGUOUS attribrute from SELF%DEVPTR …

4fe08ee

…for flang builds

ROCMAFAR: don't access SELF members in map clauses

3a26574

ROCMAFAR: add amdflang openmp backend

187fa85

ROCMAFAR: add openmp+hip backend

1e2ce6e

HIPFORT: add separate HIPHOSTREGISTER import

f5d0c1c

Enable pinning in INIT_OWNER_GPU test

eb2513a

AAC7: add env for rocm-afar 22.2

c4a58f8

Add sync_device test to ABOR1_TEST_FILES for omp offload with rocm-afar

9d221a4

TESTS: add copy_struct.F90

3a464b7

TESTS: adapt for hipfort

6f5fc33

LLVMFlang workaround: link field_api_defaults to downstream object li…

130113c

…bs because of leaking imports

Readme: update for AMD offload backend

a0e5197

INIT_OWNER_DELAYED_GPU2: don't offload field members

4b06496

github-actions bot added the contributor label Feb 6, 2026

awnawab requested review from dareg, mlange05 and pmarguinaud February 6, 2026 15:25

awnawab added the approved-for-ci Approved to run hpc-ci label Feb 6, 2026

awnawab added 4 commits February 6, 2026 17:34

OWNER_GET_DEVICE_DATA: don't offload SELF in on-device initialisation

7f7d180

Add CXX as language to link to libstdc++

59fdbd6

CI: add macos flang runner

6d76b27

CI: use ninja build backend

d60ba3d

awnawab force-pushed the naan-openmp-offload-amd-rebase branch from 3cfa25f to d60ba3d Compare February 6, 2026 17:35

github-actions bot removed the approved-for-ci Approved to run hpc-ci label Feb 6, 2026

awnawab added the approved-for-ci Approved to run hpc-ci label Feb 6, 2026

dareg approved these changes Feb 23, 2026

View reviewed changes

ecbuild: upgrade minimum version to 3.12

1047f2e

github-actions bot removed the approved-for-ci Approved to run hpc-ci label Feb 24, 2026

awnawab added the approved-for-ci Approved to run hpc-ci label Feb 24, 2026

mlange05 approved these changes Feb 24, 2026

View reviewed changes

dareg reviewed Feb 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ROCm-afar OpenMP target offload#137

ROCm-afar OpenMP target offload#137
awnawab wants to merge 21 commits intoecmwf-ifs:mainfrom
awnawab:naan-openmp-offload-amd-rebase

awnawab commented Feb 6, 2026

Uh oh!

awnawab commented Feb 17, 2026

Uh oh!

dareg commented Feb 23, 2026

Uh oh!

awnawab commented Feb 23, 2026

Uh oh!

awnawab commented Feb 23, 2026

Uh oh!

dareg commented Feb 24, 2026

Uh oh!

awnawab commented Feb 24, 2026

Uh oh!

mlange05 left a comment

Uh oh!

awnawab commented Feb 24, 2026

Uh oh!

dareg Feb 25, 2026

Uh oh!

dareg Feb 25, 2026

Uh oh!

dareg Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

awnawab commented Feb 6, 2026

Uh oh!

awnawab commented Feb 17, 2026

Uh oh!

dareg commented Feb 23, 2026

Uh oh!

awnawab commented Feb 23, 2026

Uh oh!

awnawab commented Feb 23, 2026

Uh oh!

dareg commented Feb 24, 2026

Uh oh!

awnawab commented Feb 24, 2026

Uh oh!

mlange05 left a comment

Choose a reason for hiding this comment

Uh oh!

awnawab commented Feb 24, 2026

Uh oh!

dareg Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

dareg Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

dareg Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants