You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
elephantpanda
changed the title
Bug in 0.5.2
Bug DMLFusedNode_0_0 on second token in 0.5.2 (DML)
Dec 3, 2024
elephantpanda
changed the title
Bug DMLFusedNode_0_0 on second token in 0.5.2 (DML)
Bug DMLFusedNode_0_0 on second token in 0.5.2 (DML) (Wrong tensor shape)
Dec 3, 2024
I updated to 0.5.2 DirectML mode. Quadro P5000 GPU. Windows. C# DirectML 1.15.4. Model: microsoft/Phi-3-mini-4k-instruct-onnx
I get the following bug. (Bug only comes up in DML mode not CPU mode. It worked in version 0.4.0).
For any prompt e.g.
"<|user|>Hello <|end|><|assistant|>"
(tokens:)
1,32010,15043,29871,32007,32001
It outputs one token but then when trying to output the second token it errors out with:
The relevant part seems to be: Current shape:{1,32,7,96} Requested shape:{1,32,2048,96}
It appears to be not padding the tokens to the max_tokens or something.
The text was updated successfully, but these errors were encountered: