Popular repositories Loading
-
lit-llama
lit-llama PublicForked from Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Python 1
-
DistiLlama
DistiLlama PublicForked from shreyaskarnik/DistiLlama
Chrome Extension to Summarize Web Pages Using locally running LLMs
TypeScript 1
-
QuaRot
QuaRot PublicForked from spcl/QuaRot
Code for QuaRot, an end-to-end 4-bit inference of large language models.
Python 1
-
R2R
R2R PublicForked from SciPhi-AI/R2R
The framework for fast development and deployment of RAG backends.
Python 1
-
-
llm_finetuning
llm_finetuning PublicForked from taprosoft/llm_finetuning
Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes)
Python
If the problem persists, check the GitHub status page or contact support.