About implementation of MLP layer.

Hi, excellent work!

I noticed it is a little different from the vanilla LLama MLP layer in terms of the implementation of MLP layer.

In this paper, the `feedforward_channels` is defined as follows  https://github.com/Meituan-AutoML/VisionLLaMA/blob/33fa5618044a8be5ce3d2f102d5b16249058cd3c/mmpretrain/mmpretrain/models/utils/swiglu_ffn.py#L118 while the vanilla LLama feedforward_channels is just what it is.

Is there any consideration for this modification?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About implementation of MLP layer. #7

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

About implementation of MLP layer. #7

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions