Even with gguf models and memory arguments, it crashes after the initial cell with an OOM. Using colab free environment with 12gb vram.