Skip to content

srka99/dockerLLM

This branch is 3 commits ahead of, 14 commits behind TheBlokeAI/dockerLLM:main.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

26d933d · Oct 18, 2023

History

41 Commits
Jun 20, 2023
Jun 17, 2023
Aug 5, 2023
Jun 30, 2023
Oct 18, 2023
Jul 10, 2023
Jul 23, 2023
Jul 23, 2023
Jul 10, 2023
Jun 19, 2023
Jun 19, 2023
Aug 1, 2023
Aug 1, 2023
Jun 5, 2023
Jul 23, 2023
Jul 23, 2023
Jul 23, 2023
Aug 5, 2023
Oct 16, 2023

Repository files navigation

TheBloke's Docker templates

Update: 23rd July 2023 - Llama 2 support, including Llama 2 70B in ExLlama

  • Llama 2 models, including Llama 2 70B, are now fully supported
  • Updated to latest text-generation-webui requirements.txt
  • Removed the exllama pip package installed by text-generation-webui
    • Therefore the ExLlama kernel will build automatically on first use
    • This ensures that ExLlama is always up-to-date with any new ExLlama commits (which are pulled automatically on each boot)
  • Added simple build script for building the Docker containers

Update: 28th June 2023 - SuperHOT fixed

  • Updated to latest ExLlama code, fixing issue with SuperHOT GPTQs
  • ExLlama now automaticaly updates on boot, like text-generation-webui already did
    • This should result in the template automatically supporting new ExLlama features in future

Update: 19th June 2023

  • Major update to the template
  • text-generation-webui is now integrated with:
    • AutoGPTQ with support for all Runpod GPU types
    • ExLlama, turbo-charged Llama GPTQ engine - performs 2x faster than AutoGPTQ (Llama 4bit GPTQs only)
    • CUDA-accelerated GGML support, with support for all Runpod systems and GPUs.
  • All text-generation-webui extensions are included and supported (Chat, SuperBooga, Whisper, etc).
  • text-generation-webui is always up-to-date with the latest code and features.
  • Automatic model download and loading via environment variable MODEL.
  • Pass text-generation-webui parameters via environment variable UI_ARGS.

Runpod: TheBloke's Local LLMs UI

Runpod template link

Full documentation is available here

Runpod: TheBloke's Local LLMs UI & API

Runpod template link

Full documentation is available here

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Shell 65.7%
  • Python 22.9%
  • Vim Script 6.4%
  • Dockerfile 5.0%