Fm/task/unst 9476 pol to cellmask #420

FlorisBuwaldaDeltares · 2025-12-03T09:06:09Z

What was done

pol_to_cellmask was split into 3 optimized routines (init, calculate, cleanup)
new pinpok_elemental for faster raycasting
use OpenMP to parallellize O(N^2) operation if not in MPI mode
Two unit tests, one with an enclosure polygon and inside it a dry area polygon, one with two nested dry area polygons.

Evidence of the work done

Video/figures
<add video/figures if applicable>
Clear from the issue description
Not applicable

Tests

Tests updated
<add testcase numbers if applicable, Issue number>
[ x] Not applicable

Documentation

Documentation updated
<add description of changes if applicable, Issue number>
[ x] Not applicable

Issue link

harmenwierenga

Partial review

harmenwierenga · 2025-12-05T15:17:30Z

...engines_gpl/dflowfm/packages/dflowfm_kernel/src/dflowfm_kernel/prepost/dbpinpol_cellmask.f90

+
+   private
+
+   !> dbpinpol routines are public to avoid PetSC dependency in unit tests


Could you explain how it helped to make them public?

Maybe the comment is too vague, dbpinpol became a whole separate module from the pol_to_cellmask module so that the petsc dependency of pol_to_cellmask didn't bleed into the unit test. But maybe I should remove this comment as it will only cause questions

harmenwierenga · 2025-12-05T15:33:20Z

src/engines_gpl/dflowfm/packages/dflowfm_kernel/src/dflowfm_kernel/prepost/pol_to_cellmask.F90

@@ -1,4 +1,4 @@
-!----- AGPL --------------------------------------------------------------------
+ !----- AGPL --------------------------------------------------------------------


Accidental space

harmenwierenga · 2025-12-05T15:35:27Z

src/engines_gpl/dflowfm/packages/dflowfm_kernel/src/dflowfm_kernel/prepost/pol_to_cellmask.F90

+      integer :: temp_threads
+#endif
+      integer :: k
+      if (allocated(cellmask)) deallocate (cellmask)


Style guide: no single line if statements (see also below). A converter is currently under review 😄

harmenwierenga · 2025-12-05T15:38:15Z

src/engines_gpl/dflowfm/packages/dflowfm_kernel/src/dflowfm_kernel/prepost/pol_to_cellmask.F90

+#ifdef _OPENMP
+      temp_threads = omp_get_max_threads() !> Save old number of threads
+      if (jampi == 0) then
+         call omp_set_num_threads(OMP_GET_NUM_PROCS()) !> Set number of threads to max for this O(N^2) operation


Does this circumvent the user settings for the number of threads? Are we sure that they do not set that number for a reason?

Yes, this circumvents the user settings. Most users do not set the number of threads, since they either use MPI to parallelize and OpenMP threading does not speed up Dflowfm meaningfully. However a year ago I introduced this same principle to another O(N^2) operation in find1dcells and it has not caused any problems.
Worst case scenario (you're running many testcases in parallel but not MPI) it might be a tiny bit slower due to thread scheduling overhead. In my opinion the benefits outweigh the gains. Ideally you would offload O(N^2) operations to the GPU or something. If you have 5 million cells you will get 25 trillion function evaluations.

harmenwierenga · 2025-12-05T15:39:16Z

src/engines_gpl/dflowfm/packages/dflowfm_kernel/src/dflowfm_kernel/prepost/pol_to_cellmask.F90

+         call omp_set_num_threads(OMP_GET_NUM_PROCS()) !> Set number of threads to max for this O(N^2) operation
+      end if !> no else, in MPI mode omp num threads is already set to 1
+#endif
+      !$OMP PARALLEL DO SCHEDULE(DYNAMIC, 100)


How did you choose the chunk size of 100? Same question for find1dcells

I added dynamic scheduling as the worst case evaluation is quite a bit slower than best-case. Without it it could be that 7 out of 8 threads are already finished with their best-case blocks and then have to wait for thread 8 to finish his worst-case block. chunk size 100 seemed like a good starting point, but to be honest I haven't put a lot of thought into it. Will do some research

harmenwierenga · 2025-12-05T15:40:52Z

src/engines_gpl/dflowfm/packages/dflowfm_kernel/test/CMakeLists.txt

+    VISUAL_STUDIO_PATH engines_gpl/dflowfm/test
+)
+
+# Force sequential build order for the test projects (workaround, to be fixed properly)


Should be possible to remove this soon 😄

harmenwierenga · 2025-12-05T15:53:12Z

src/engines_gpl/dflowfm/packages/dflowfm_kernel/test/test_pol_to_cellmask.f90

+
+      ! Setup nested dry point polygons:
+      ! 1. Outer dry point polygon: x=[0,40], y=[0,40] with zpl=1
+      ! 2. Inner dry point polygon: x=[10,30], y=[10,30] with zpl=1


What does zpl = 1 or -1 do?

ah it's explained in the code but not in the test. zpl =1 is a dry-area polygon, zpl -1 is an enclosure polygon. i'll update the description

FlorisBuwaldaDeltares added 19 commits December 1, 2025 13:53

initial implementation of new, elemental routine

67de225

missing use statement

750cd9e

parallellize dbpinpol_cellmask

fad6a55

rename 1

b851727

rename 2, to F90. Add preprocessor flag in cmake

2a797cb

add dynamic scheduling for incells check

d36ed14

small fixes

cdd4269

restore nested polygon check

93f95fd

added pol_to_cellmask unit tests

9b12dff

new tests

9ce81e6

separate module

de6f600

separate module to remove PetSC dependency attempt 2

e3013bf

fix test, new global "enclosures_present"

4240b9d

revert temporary out-commenting

ad76c0d

forgot to include dmiss in dry area boolean check

8083983

doxygen comments

012cf87

removed needless copies of m_polygon variables

b815f6e

restore Npoly_cellmask as it was quite distinct from npl

cf60c8a

add sequential dependency for unit tests as a workaround

a476d81

harmenwierenga reviewed Dec 5, 2025

View reviewed changes


		private

		!> dbpinpol routines are public to avoid PetSC dependency in unit tests

		@@ -1,4 +1,4 @@
		!----- AGPL --------------------------------------------------------------------
		!----- AGPL --------------------------------------------------------------------

Fm/task/unst 9476 pol to cellmask #420

Are you sure you want to change the base?

Fm/task/unst 9476 pol to cellmask #420

Uh oh!

Conversation

FlorisBuwaldaDeltares commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What was done

Evidence of the work done

Tests

Documentation

Issue link

Uh oh!

harmenwierenga left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

FlorisBuwaldaDeltares commented Dec 3, 2025 •

edited

Loading