Hi
I noticed that the result of torch.allclose(xbow, xbow2), torch.allclose(xbow, xbow3) are all false when running the Collab example gpt-dev.ipynb in The mathematical trick in self-attention section. Here is what I got, has anyone encountered the same issue?

Hi

I noticed that the result of
torch.allclose(xbow, xbow2), torch.allclose(xbow, xbow3)are allfalsewhen running the Collab examplegpt-dev.ipynbin The mathematical trick in self-attention section. Here is what I got, has anyone encountered the same issue?