### 🐛 Describe the bug hf_Reformer inference with bs=128 performance regression from 505ms to 623ms due to https://github.com/pytorch/pytorch/commit/e2917a38a000a3699a2bc4ea29e5549cbf5c3ed6 ### Versions b580