Skip to content

Commit 0ae7872

Browse files
authored
🔧 Update default simple compile backend (#506)
# Description Updates the default simple compile backend to `inductor` to match vLLM. This could be considered slightly risky because it updates default behavior, but we do want to run with `inductor` where possible. This was previously set to `eager` because we had issues linking g++ correctly and needed a fast workaround. Impact here should be minimal. If users run into problems with environments that don't have python headers and a valid compiler toolchain, they can run with `VLLM_SPYRE_SIMPLE_COMPILE_BACKEND=eager` Signed-off-by: Joe Runde <[email protected]>
1 parent 33872d0 commit 0ae7872

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

vllm_spyre/envs.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@
2323
VLLM_SPYRE_WORKER_LOG_REDIRECT_DIR: str = ""
2424
VLLM_SPYRE_GLOO_TIMEOUT_MINUTES: int = 60
2525
VLLM_SPYRE_REQUIRE_PRECOMPILED_DECODERS: bool = False
26-
VLLM_SPYRE_SIMPLE_COMPILE_BACKEND: str = "eager"
26+
VLLM_SPYRE_SIMPLE_COMPILE_BACKEND: str = "inductor"
2727
VLLM_SPYRE_NUM_CPUS: int = 0
2828

2929
logger = init_logger(__name__)
@@ -172,7 +172,7 @@ def _backend_backwards_compat() -> str:
172172
# Defaults to eager, iductor can be used if python headers and a compiler
173173
# are available.
174174
"VLLM_SPYRE_SIMPLE_COMPILE_BACKEND":
175-
lambda: os.getenv("VLLM_SPYRE_SIMPLE_COMPILE_BACKEND", "eager"),
175+
lambda: os.getenv("VLLM_SPYRE_SIMPLE_COMPILE_BACKEND", "inductor"),
176176

177177
# Configures the number of CPUs used when determining multi-threading
178178
# configurations

0 commit comments

Comments
 (0)