Skip to content

[Feature] Support Google Gemma ModelsΒ #72

@sahiljoshi515

Description

@sahiljoshi515

πŸš€ Feature Request

Describe the feature you'd like
A clear and concise description of the new functionality.
Google Gemma uses a softcap in attention computation which needs to be supported in skylight.

Why is it needed?
Explain the problem it solves or the value it adds.
Need more model support

Additional context (optional)
Any links, references, screenshots, or related issues.
https://github.com/huggingface/transformers/blob/main/src/transformers/models/gemma/modeling_gemma.py

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions