Loading base model/lora adapters from s3 fails and throws HF error #766

robert-moyai · 2025-02-28T14:47:20Z

🐛 Describe the bug

Im trying to initialize AIBrix using model files hosted on s3. But I am not able to get vllm-openai container to run as its incorrectly identifying the s3 link as being a HF repo.

huggingface_hub.errors.HFValidationError: Repo id must be in the form 'repo_name' or 'namespace/repo_name': 's3://s3-bucket/llama-3.1-nemoguard-8b-topic-control'. Use repo_type argument if needed.

Steps to Reproduce

Move a model llama-3.1-nemoguard-8b-topic-control to a s3 bucket
Deploy AIBrix following the docs: https://aibrix.readthedocs.io/latest/features/lora-dynamic-loading.html#create-base-model. Be sure to reference the model using its s3 path: s3://s3-bucket/llama-3.1-nemoguard-8b-topic-control
Inspect the pod logs and find the error. Pod will crash when applying.
Same issue if you just load LoRA from s3

Expected behavior

Pod should just start running like when downloading model from HF

Environment

AIBrix version: 0.20.0
k8s deployment
AWS s3 bucket with model files

The text was updated successfully, but these errors were encountered:

Jeffwan added kind/bug Something isn't working area/lora labels Mar 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loading base model/lora adapters from s3 fails and throws HF error #766

Loading base model/lora adapters from s3 fails and throws HF error #766

robert-moyai commented Feb 28, 2025

Loading base model/lora adapters from s3 fails and throws HF error #766

Loading base model/lora adapters from s3 fails and throws HF error #766

Comments

robert-moyai commented Feb 28, 2025

🐛 Describe the bug

Steps to Reproduce

Expected behavior

Environment