chore: add llama mlp_bias reading from hf_config #3476

WhatGhost · 2025-04-11T06:42:30Z

When i want to convert and build a llama model with attn_bias and mlp_bias. I noticed that in the config of trtllm's ckpt, the mlp bias is always false even i add mlp_bias in hf_model's config.

So i check the code of trtllm, I find the llama's config.py doesn't read mlp_bias param like the attn_bias.
So i add it.And the model can be converted and built correctly.

juney-nvidia · 2025-04-11T11:48:03Z

@WhatGhost Hi, thanks for the contribution. Pls do the sign-off of your own information to pass the DCO check firstly.

Thanks
June

Signed-off-by: Zihua Wu <[email protected]> Signed-off-by: whatghost <[email protected]>

Signed-off-by: whatghost <[email protected]>

WhatGhost · 2025-04-14T04:09:57Z

@juney-nvidia hi,i have sign-off follow the guide.

WhatGhost · 2025-04-17T02:34:54Z

@juney-nvidia Hi,Is there any work I need to do

juney-nvidia added Community want to contribute PRs initiated from Community Community Engagement help/insights needed from community labels Apr 11, 2025

juney-nvidia changed the title ~~chore: add llama mpl_bias reading from hf_config~~ chore: add llama mlp_bias reading from hf_config Apr 11, 2025

lucifer1004 and others added 2 commits April 14, 2025 03:59

fix: remove DeepGEMM line info (NVIDIA#3411)

e6d8052

Signed-off-by: Zihua Wu <[email protected]> Signed-off-by: whatghost <[email protected]>

add mpl_bias reading from hf_config

c517c61

Signed-off-by: whatghost <[email protected]>

WhatGhost force-pushed the dev/llama_bias branch from 581ce15 to c517c61 Compare April 14, 2025 04:07

poweiw assigned Shixiaowei02 Jun 5, 2025

poweiw added Generic Runtime General operational aspects of TRTLLM execution not in other categories. triaged Issue has been triaged by maintainers labels Jun 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: add llama mlp_bias reading from hf_config #3476

chore: add llama mlp_bias reading from hf_config #3476

Uh oh!

WhatGhost commented Apr 11, 2025

Uh oh!

juney-nvidia commented Apr 11, 2025

Uh oh!

WhatGhost commented Apr 14, 2025

Uh oh!

WhatGhost commented Apr 17, 2025

Uh oh!

Uh oh!

chore: add llama mlp_bias reading from hf_config #3476

Are you sure you want to change the base?

chore: add llama mlp_bias reading from hf_config #3476

Uh oh!

Conversation

WhatGhost commented Apr 11, 2025

Uh oh!

juney-nvidia commented Apr 11, 2025

Uh oh!

WhatGhost commented Apr 14, 2025

Uh oh!

WhatGhost commented Apr 17, 2025

Uh oh!

Uh oh!