GLM-4 Series #24

hhhhzl · 2025-02-14T21:25:42Z

Add GLM-4

File Changed:

Test:

Challenges/Questions:

I got different results when I changed AutoModelForCausalLM to GlmForCausalLM. However, I can get a uniform unreadable result if I set top_k, top_p, and temperature = 0 with GlmForCausalLM. My intuition is that this is because GlmForCausalLM has slightly different settings for a GLM-like model.
I also got different unreadable outputs from generate.py, which makes it difficult to compare them due to challenge 1.
Do we need to pass trust-remote-code = true in for GLMs?

hhhhzl added 3 commits February 14, 2025 14:43

glm

8700438

clean cache

1c93025

clean c

a25cf3f

Provide feedback