Skip to content

v0.9.0

Choose a tag to compare

@joerunde joerunde released this 04 Sep 23:09
· 81 commits to main since this release
79577c0

This release

  • Adds suport for reranker models
  • Adds support for vllm 0.10.1
  • Adds extra debug options for tensor parallel operation
  • Fixes a bug where VLLM_SPYRE_MAX_LOAD_PROCESSES did not work properly

What's Changed

Full Changelog: v0.8.0...v0.9.0