Add mkl::linalg_svd relative Ops #1511

yucai-intel · 2025-03-26T07:45:06Z

This pull request introduces support for Singular Value Decomposition (SVD) on XPU devices using oneMKL. It includes the implementation of SVD kernels, necessary utility functions, and test updates. The following ops are enabled on XPU devices:

_linalg_svd
_linalg_svd.U
linalg_svd
linalg_svd.U

yucai-intel · 2025-03-27T05:39:51Z

Test cases related to complex data types fail because onednn does not support it.

CuiYifeng

Please reactivate some skipped cases such as https://github.com/intel/torch-xpu-ops/blob/main/test/xpu/skip_list_common.py#L1397. Otherwise, CI cannot test with these svd cases.

src/ATen/native/xpu/BatchLinearAlgebra.cpp

Copilot

Pull Request Overview

This PR adds support for MKL-based SVD operations on the XPU backend, including new composite functions and updates to native function definitions for linalg_svd and related ops. Key changes include:

New YAML definitions for _linalg_svd and linalg_svd (and their variants) exposing the compute_uv flag.
Removal of several SVD-related skip tests in the xpu test skip list.
New MKL-based implementation files for BatchLinearAlgebra (header and source) and updated XPU dispatch registration.

Reviewed Changes

Copilot reviewed 5 out of 6 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
yaml/native/native_functions.yaml	Added definitions for new SVD ops and composite functions.
test/xpu/skip_list_common.py	Removed skip tests for SVD float64 variants on XPU.
src/ATen/native/xpu/mkl/BatchLinearAlgebra.h	Declared the svd_mkl prototype.
src/ATen/native/xpu/mkl/BatchLinearAlgebra.cpp	Implemented the svd_mkl function using oneMKL calls.
src/ATen/native/xpu/BatchLinearAlgebra.cpp	Registered the XPU dispatch for the SVD kernel.

Files not reviewed (1)

src/ATen/native/xpu/XPUFallback.template: Language not supported

yaml/native/native_functions.yaml

src/ATen/native/xpu/BatchLinearAlgebra.cpp

CuiYifeng · 2025-05-20T05:14:47Z

@fengyuan14 @majing921201 Please help review, thanks.

majing921201 · 2025-05-20T05:39:32Z

src/ATen/native/xpu/BatchLinearAlgebra.cpp

+#if defined(USE_ONEMKL)
+  native::xpu::svd_mkl(A, full_matrices, compute_uv, driver, U, S, Vh, info);
+#else
+  const auto A_cpu = A.to(A.options()


why we need to fallback cpu when there is no mkl install, instead of throw an error. If cpu path fails and throw error in cpu side, that will confuse user.

@majing921201 There still exists an op registration on XPU device when USE_ONEMKL is OFF. Please note that USE_ONEMKL will be renamed as USE_ONEMKL_XPU #1642 , which is irrelevant to USE_MKL in stock Pytorch.
Before oneMKL XPU build is default ON, such a fallback path is needed, in case functionality is broken.

src/ATen/native/xpu/BatchLinearAlgebra.cpp

fengyuan14 · 2025-05-20T05:44:37Z

src/ATen/native/xpu/mkl/BatchLinearAlgebra.cpp

@@ -0,0 +1,352 @@
+#if defined(USE_ONEMKL)


When the scope is the whole source file, please change the logic in CMAKE, to filter the file if USE_ONEMKL is off.

Done. src/ATen/native/xpu/mkl/*.cpp is filtered in src/ATen/CMakeLists.txt when oneMKL XPU support is OFF.

yaml/native/native_functions.yaml

majing921201 · 2025-05-20T06:15:22Z

src/ATen/native/xpu/mkl/BatchLinearAlgebra.cpp

+    const Tensor& U,
+    const Tensor& S,
+    const Tensor& Vh,
+    const Tensor& info) {


Can you align this op implementation with svd_stub logic in stock pytorch. Previous code in ipex is based on a legacy impl in Pytorch 1.x. For our current upstream solution, those copy and resize etc. should already be covered by common code in stock pytorch now. please check.

I tried to remove those resize and copy but the current implementation will be broken. My suggestion is to ensure functionality on XPU first and refine these code in another PR.

@CuiYifeng , any justification regarding these operations that we need to ensure the functionality first? Any models that depend on these operations? If not, why don't we polish this PR to make it good enough?

SVD was requested by Argonne. Get your point.

Copilot

Pull Request Overview

This pull request adds support for Singular Value Decomposition (SVD) on XPU devices using oneMKL by introducing new op definitions, implementing corresponding MKL-based kernels, and updating test skip lists.

Updated YAML configuration to expose SVD ops (_linalg_svd, linalg_svd, and their U variants).
Added new SVD kernel implementations and supporting utility functions in the xpu/mkl directories.
Adjusted test skip lists to reflect the updated SVD support on XPU float64 inputs.

Reviewed Changes

Copilot reviewed 6 out of 9 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
yaml/native/native_functions.yaml	Adds new op definitions for SVD and its variants on XPU devices.
test/xpu/skip_list_common.py	Removes several float64-related SVD test skips to enable testing on XPU devices.
src/ATen/native/xpu/mkl/SpectralOps.cpp	Removes conditional compilation guard for oneMKL usage.
src/ATen/native/xpu/mkl/BatchLinearAlgebra.h	Introduces the declaration for the new svd_mkl function.
src/ATen/native/xpu/mkl/BatchLinearAlgebra.cpp	Implements the MKL-based SVD functionality with type specializations.
src/ATen/native/xpu/BatchLinearAlgebra.cpp	Provides an XPU dispatch wrapper that calls the new oneMKL-based implementation.

Files not reviewed (3)

cmake/ONEMKL.cmake: Language not supported
src/ATen/CMakeLists.txt: Language not supported
src/ATen/native/xpu/XPUFallback.template: Language not supported

Comments suppressed due to low confidence (1)

src/ATen/native/xpu/mkl/SpectralOps.cpp:1

The removal of the conditional compilation guard for USE_ONEMKL in this file may impact systems without oneMKL. Please verify that this change is intentional and that necessary build guards are applied elsewhere if needed.

#include <ATen/native/Resize.h>

EikanWang · 2025-05-22T01:45:46Z

Pls. address @majing921201 's comments.

yucai-intel mentioned this pull request Apr 9, 2025

Enable aten::linalg_lu_factor_ex and aten::linalg_lu_solve #1563

Open

2 tasks

yucai-intel changed the title ~~[WIP] Add mkl::linalg_svd relative Ops~~ Add mkl::linalg_svd relative Ops Apr 9, 2025

yucai-intel requested a review from CuiYifeng April 11, 2025 01:53

yucai-intel added the xpu-op label Apr 11, 2025

CuiYifeng requested changes Apr 11, 2025

View reviewed changes

src/ATen/native/xpu/BatchLinearAlgebra.cpp Outdated Show resolved Hide resolved

CuiYifeng force-pushed the slogdet branch 4 times, most recently from 56cebb3 to 9ac4024 Compare April 25, 2025 08:54

CuiYifeng force-pushed the slogdet branch 2 times, most recently from e13dc4f to da85fe6 Compare May 7, 2025 05:27

CuiYifeng force-pushed the slogdet branch from 70d3d9b to 0bed2ec Compare May 13, 2025 08:56

CuiYifeng requested a review from Copilot May 13, 2025 13:11

Copilot AI reviewed May 13, 2025

View reviewed changes

yaml/native/native_functions.yaml Outdated Show resolved Hide resolved

CuiYifeng force-pushed the slogdet branch from 82d8aca to aab0887 Compare May 15, 2025 02:16

xytintel reviewed May 15, 2025

View reviewed changes

src/ATen/native/xpu/BatchLinearAlgebra.cpp Show resolved Hide resolved

CuiYifeng requested review from fengyuan14 and majing921201 May 16, 2025 05:19

CuiYifeng force-pushed the slogdet branch from e1f512a to 82037f2 Compare May 16, 2025 13:29

yucai-intel and others added 10 commits May 19, 2025 01:48

add linalg svd

400c610

fix

8b271f2

fix

5042da6

add svd_xpu_float64 test cases

58b6432

Directly register svd instead of stub

d59fe90

Retain fallback in XPUFallback.template

950c00f

Clean unused header files

88cefaa

Remove fallback

6d4be2c

Fix namespace

08996fd

Remove fallback from XPUFallback.template

83731fd

CuiYifeng added 9 commits May 19, 2025 01:48

Simplify oneMKL header file

4ff2b3d

Remove unused variables

7f1df73

Add fallback with stub

6c7e207

Keep empty U and Vh for svdvals

e3acec9

Remove nested USE_ONEMKL

24c1147

Small refinement

8b4dae3

Fix typo

a38f3aa

Fix code style

d156e8b

To link mkl_sycl_blas forcely

a193621

CuiYifeng force-pushed the slogdet branch from 82037f2 to a193621 Compare May 19, 2025 15:19

majing921201 reviewed May 20, 2025

View reviewed changes

fengyuan14 reviewed May 20, 2025

View reviewed changes

src/ATen/native/xpu/BatchLinearAlgebra.cpp Outdated Show resolved Hide resolved

fengyuan14 reviewed May 20, 2025

View reviewed changes

src/ATen/native/xpu/BatchLinearAlgebra.cpp Outdated Show resolved Hide resolved

fengyuan14 reviewed May 20, 2025

View reviewed changes

yaml/native/native_functions.yaml Show resolved Hide resolved

majing921201 reviewed May 20, 2025

View reviewed changes

CuiYifeng added 2 commits May 20, 2025 07:14

Using pageable memory instead of pinned memory

4d0f184

Filter oneMKL integration code in CMAKE

8ea8267

CuiYifeng requested a review from Copilot May 21, 2025 05:17

Copilot AI reviewed May 21, 2025

View reviewed changes

xytintel requested a review from CuiYifeng May 21, 2025 05:26

CuiYifeng requested a review from fengyuan14 May 21, 2025 05:36

chuanqi129 removed the xpu-op label May 21, 2025

Add mkl::linalg_svd relative Ops #1511

Are you sure you want to change the base?

Add mkl::linalg_svd relative Ops #1511

Uh oh!

Conversation

yucai-intel commented Mar 26, 2025 • edited by CuiYifeng Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yucai-intel commented Mar 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CuiYifeng left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

CuiYifeng commented May 20, 2025

Uh oh!

majing921201 May 20, 2025

Choose a reason for hiding this comment

Uh oh!

CuiYifeng May 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

fengyuan14 May 20, 2025

Choose a reason for hiding this comment

Uh oh!

CuiYifeng May 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

majing921201 May 20, 2025

Choose a reason for hiding this comment

Uh oh!

CuiYifeng May 21, 2025

Choose a reason for hiding this comment

Uh oh!

EikanWang May 22, 2025

Choose a reason for hiding this comment

Uh oh!

CuiYifeng May 22, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

EikanWang commented May 22, 2025

Uh oh!

Uh oh!

yucai-intel commented Mar 26, 2025 •

edited by CuiYifeng

Loading

yucai-intel commented Mar 27, 2025 •

edited

Loading

CuiYifeng May 20, 2025 •

edited

Loading

CuiYifeng May 21, 2025 •

edited

Loading