You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat: enable deepgemm jit for fp8 block-scale on SM90 (#1969)
<!-- .github/pull_request_template.md -->
## π Description
Enable JIT compile for the FP8 DeepGEMM kernels, NVRTC is currently
disabled it uses NVCC by default.
## π Pull Request Checklist
Thank you for contributing to FlashInfer! Before we review your pull
request, please make sure the following items are complete.
### β Pre-commit Checks
- [x] I have installed `pre-commit` by running `pip install pre-commit`
(or used your preferred method).
- [x] I have installed the hooks with `pre-commit install`.
- [x] I have run the hooks manually with `pre-commit run --all-files`
and fixed any reported issues.
> If you are unsure about how to set up `pre-commit`, see [the
pre-commit documentation](https://pre-commit.com/).
## π§ͺ Tests
- [x] Tests have been added or updated as needed.
- [x] All tests are passing (`unittest`, etc.).
<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit
* **Refactor**
* JIT include directory discovery now uses the flashinfer-python package
instead of the previous package.
* Updated resolved include path to the flashinfer data location.
* Runtime compilation now consistently uses NVCC; the prior
environment-variable toggle was removed.
* Updated warning text when the expected package installation cannot be
found.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
---------
Signed-off-by: Duncan Moss <[email protected]>
0 commit comments