From faeaf927df6644e0228876952a786ce958ffa944 Mon Sep 17 00:00:00 2001 From: Eric Phipps Date: Tue, 5 Sep 2023 18:44:04 -0600 Subject: [PATCH] Squashed 'tpls/kokkos/' changes from aa1f48f31..1a3ea28f6 1a3ea28f6 Merge pull request #6231 from ndellingwood/master 3e85bd920 Fix windows symlink configure issue (#6241) ea7b12448 CHANGELOG fixup following merge 25592c571 Update master_history.txt adde1e6aa Merge branch 'release-candidate-4.1.00' for 4.1.00 9e8443018 Merge pull request #6228 from masterleinad/cherry_pick_6223 dd81ecb3d Merge pull request #6223 from masterleinad/fix_simd_on_gpus 5c3e68392 [4.1.00] Changelog for 4.1.00 (#6226) cd96a740b Merge pull request #6219 from masterleinad/fix_sycl_makefile_4_1_00 23aadf490 Fix compiling SYCL with KOKKOS_IMPL_DO_NOT_USE_PRINTF_USAGE afc192988 Update version to 4.1.00 6ca60c395 Improve OpenMP affinity warning to include MPI concerns (#6185) e200ba117 [HIP] Improve heuristic deciding the number of blocks used in parallel_reduce (#6160) 43a797b59 Left align demangled stacktrace output. (#6191) a40637298 Fix global fence in Kokkos::resize(DynRankView) (#6184) 8661773eb Merge pull request #6195 from fnrizzi/is_trait_v 98f9b4c62 add trait and test e30f04011 shortcut value for is_dynamic_view 789b62c61 Weed out verbose output from `dynamic_view` container unit test (#6173) e2a7f085d Merge pull request #6171 from rgayatri23/openmptarget_nvhpc 8266abd1b Merge pull request #6183 from ldh4/simd_replace_unavailable_loadu_storeu_instr ad966bda0 OpenMPTarget: include desul changes. c72615afe Merge remote-tracking branch 'upstream/develop' into openmptarget_nvhpc 7b0e378e6 Replace _mm512_loadu_epi64 and _mm512_storeu_epi64 with _mm512_loadu_si512 and _mm512_storeu_si512 18c539504 Merge pull request #5982 from masterleinad/cleanup_functor_analysis 6c134afda Merge pull request #6172 from masterleinad/remove_desul_sycl_extended_namespace 0b7bed581 Allow passing a temporary std::vector to partition_space (#6167) 65ffe4c5d Also create symlinks for CMake configuration files to cmake_packages/Kokkos for TriBITS (#6163) 915c17466 SIMD: make binary op tests to test against all data types (#5913) 62ba94c88 Merge pull request #6175 from dalg24/changelog_372 502dc03c3 Merge pull request #6176 from bartlettroscoe/tril-11938-tribits-hwloc 2bc7b96b7 Clean up FunctorAnalysis 9df5a01a8 Kokkos: Mark HWLOC as a TriBITS TPL as well (trilinos/Trilinos#11938) 1af137999 Cherry-pick v3.7.02 changelog into develop [ci skip] bf3457349 OpenMPTarget: Restore desul changes. 925aca1b1 OpenMPTarget: Replace kokkos macros in desul. 538d18d31 OpenMPTarget: update fixme comment. e832781a3 Remove extended_namespace template paramter for SYCLMemoryOrder/Scope c23cfb8d0 Update Makefile.kokkos d1ecf9acb OpenMPTarget: Add a fixme. bbd9a7882 OpenMPTarget: Changes for OpenMPTarget backend with nvhpc compiler. ab6f7565b Implement `HPX::in_parallel` (#6143) e88537f62 Allow linking against build tree (#6078) b3f9f7825 sorting: add to binsort support for strided views and reorg tests (#6081) 2a5c949c7 Add `Kokkos::all_libs` alias target for compatibility with TriBITS/Trilinos (#6157) 2a382b42b Merge pull request #6126 from masterleinad/fix_uninitialized_value_in_combined_reducer 461310de4 Merge pull request #6156 from masterleinad/fix_cuda_lambda_trilinos 12e9645e7 KokkosTools: Don't call callbacks before backends are initialized (#6114) f8a2a8085 `BinSort`, `BinOp1D`, `BinOp3D`: mark default constructor as deleted (#6131) d92158c69 Fix bogus warnings in nested CUDA parallel_reduce 31a5f21ae Merge pull request #6136 from masterleinad/fix_nd_builtin_reductions_with_loc 5d81422da Merge pull request #6155 from dalg24/fixup_dual_view 85b014b33 Fix Kokkos_ENABLE_CUDA_LAMBDA for Trilinos 131503d8d Revert to `DualView` when deprecated code 4 is enabled 382f0bea7 Merge pull request #6150 from dalg24/drop_profiling_load_print_option b2645f80c OpenMPTarget: Enable Cray compiler for the OpenMPTarget backend. (#5889) 6c0adb571 Merge pull request #6149 from dalg24/fixup_cuda_lambda d74df9b66 [ci skip] Add nightly ci for spack (#6135) 8ede4a496 Merge pull request #6142 from dalg24/cleanup_exported_kokkos_options d92988f3f Suppress bogus warning about CUDA_LAMBDA being ON 57226c978 Drop Kokkos_ENABLE_PROFILING_LOAD_PRINT option 87c7be94f Merge pull request #6047 from masterleinad/simplify_sycl_reductions 3f565bbb5 Export Kokkos_ENABLE_