Skip to content

chore: add llama mlp_bias reading from hf_config #3476

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

WhatGhost
Copy link

When i want to convert and build a llama model with attn_bias and mlp_bias. I noticed that in the config of trtllm's ckpt, the mlp bias is always false even i add mlp_bias in hf_model's config.

So i check the code of trtllm, I find the llama's config.py doesn't read mlp_bias param like the attn_bias.
So i add it.And the model can be converted and built correctly.

@juney-nvidia juney-nvidia added Community want to contribute PRs initiated from Community Community Engagement help/insights needed from community labels Apr 11, 2025
@juney-nvidia juney-nvidia changed the title chore: add llama mpl_bias reading from hf_config chore: add llama mlp_bias reading from hf_config Apr 11, 2025
@juney-nvidia
Copy link
Collaborator

@WhatGhost Hi, thanks for the contribution. Pls do the sign-off of your own information to pass the DCO check firstly.

Thanks
June

lucifer1004 and others added 2 commits April 14, 2025 03:59
@WhatGhost
Copy link
Author

@juney-nvidia hi,i have sign-off follow the guide.

@WhatGhost
Copy link
Author

@juney-nvidia Hi,Is there any work I need to do

@poweiw poweiw added Generic Runtime General operational aspects of TRTLLM execution not in other categories. triaged Issue has been triaged by maintainers labels Jun 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Community Engagement help/insights needed from community Community want to contribute PRs initiated from Community Generic Runtime General operational aspects of TRTLLM execution not in other categories. triaged Issue has been triaged by maintainers
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants