Lookahead in forward #1447
Labels
bug
Something isn't working
help wanted
Extra attention is needed
low priority
(will be worked on after all priority issues)
System Info
transformers '4.47.0', bitsanbytes '0.45.0', torch '2.5.1+cu124', NVIDIA RTX A4000
Reproduction
During assisted greedy generation with the verifier loaded in 8-bit, I have noticed that it diverges from vanilla greedy generation. I have investigated this problem and discovered lookahead during forward on a particular sequence. Here's a snippet of the notebook without unnecessary code. You can run it and see the problem, but you need a Llama2 chat token because it's the model I was using.
https://gist.github.com/Dionysour/24b352bb685f7d4a8ffd18896455700d
Expected behavior
change of [n] token doesn't affect logits of [:n] tokens
The text was updated successfully, but these errors were encountered: