feat: Add feature gate #798

yeahdongcn · 2025-03-05T03:42:26Z

This PR introduces a feature gate mechanism in ktransformers, providing centralized control over specific features. This helps streamline feature management and reduces the frequent invocation of get_compute_capability.
Vendors can extend this file to implement more fine-grained controls.

Testing Done

$ python ./ktransformers/local_chat.py --force_think true --cpu_infer 64 --model_path deepseek-ai/DeepSeek-R1 --gguf_path /home/gpuserver/models/DeepSeek-R1-Q4_K_M --port 10002
flashinfer not found, use triton for linux
Feature gate initialized: KTRANSFORMERS_USE_TORCH_NATIVE=False, KTRANSFORMERS_USE_FLASHINFER=False
using custom modeling_xxx.py.
using default_optimize_rule for DeepseekV3ForCausalLM
Injecting model as ktransformers.operators.models . KDeepseekV2Model
...

yeahdongcn · 2025-03-05T03:42:56Z

#787 is no longer necessary.

Signed-off-by: Xiaodong Ye <[email protected]>

yeahdongcn · 2025-03-20T01:33:57Z

Rebased from upstream/main.

feat: Add feature gate

e0fe707

Signed-off-by: Xiaodong Ye <[email protected]>

yeahdongcn force-pushed the feat_gate branch from 0fb7a33 to e0fe707 Compare March 20, 2025 01:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add feature gate #798

feat: Add feature gate #798

yeahdongcn commented Mar 5, 2025

yeahdongcn commented Mar 5, 2025

yeahdongcn commented Mar 20, 2025

feat: Add feature gate #798

Are you sure you want to change the base?

feat: Add feature gate #798

Conversation

yeahdongcn commented Mar 5, 2025

Testing Done

yeahdongcn commented Mar 5, 2025

yeahdongcn commented Mar 20, 2025