Skip to content

Pretrained Llama 2 Model URI leads to a .tar.gz with dummy.txt fileΒ #4122

Open
@abdullahsych

Description

@abdullahsych

Describe the bug
A clear and concise description of what the bug is.

I get the error: not a gzip file when I run the following code:

model_id, model_version = (
    "meta-textgeneration-llama-2-7b-f",
    "1.2.0",
)
scope = "inference" 

model_uri = model_uris.retrieve(
    model_id=model_id,
    model_version=model_version,
    model_scope=scope,
)

print(f"model_url: {model_uri}\n")

script_uri = script_uris.retrieve(
    model_id=model_id,
    model_version=model_version,
    script_scope=scope,
)

print(f"script_uri: {script_uri}\n")

instance_type = instance_types.retrieve_default(
    model_id=model_id, model_version=model_version, scope=scope
)

print(f"instance_type: {instance_type}\n")

image_uri = image_uris.retrieve(
    region=None,
    framework=None,
    image_scope=scope,
    model_id=model_id,
    model_version=model_version,
    instance_type=instance_type,
)

print(f"image_uri: {image_uri}\n")

model = Model(
    image_uri=image_uri,
    model_data=model_uri,
    source_dir=script_uri,
    entry_point="inference.py",
    role=role,
    name="some-model-name",
    predictor_cls=Predictor,
)

health_check_timeout = 300

predictor = model.deploy(
    initial_instance_count=1,
    instance_type=instance_type,
    endpoint_name="some-endpoint-name",
)

After inspecting the model uri on s3 https://s3.console.aws.amazon.com/s3/object/jumpstart-cache-prod-us-east-1?region=us-east-1&prefix=meta-infer%2Finfer-meta-textgeneration-llama-2-7b-f.tar.gz I noticed that it only contains a dummy.txt file.

To reproduce
A clear, step-by-step set of instructions to reproduce the bug.

Expected behavior
A clear and concise description of what you expected to happen.

It get a valid Model URI with model artifacts.

Screenshots or logs
If applicable, add screenshots or logs to help explain your problem.

System information
A description of your system. Please provide:

  • SageMaker Python SDK version: version 2.185.0
  • Framework name (eg. PyTorch) or algorithm (eg. KMeans):
  • Framework version:
  • Python version: 3.10.2
  • CPU or GPU:
  • Custom Docker image (Y/N):

Additional context
Add any other context about the problem here.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions