Skip to content

QuantKit is a flexible and modular framework for generating quantized machine learning models with highly customizable workflows. It supports layer-wise quantization, asymmetric/symmetric schemes, post-training and dynamic quantization, and integrates seamlessly with the Hugging Face Transformers ecosystem.

Notifications You must be signed in to change notification settings

MuhammadAliS/QuantKit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

1 Commit
Β 
Β 

Repository files navigation

QuantKit πŸ”§πŸ“¦

A flexible toolkit for customizable transformer model quantization

License Python Transformers Status


QuantKit is a modular and extensible framework for building and exporting quantized transformer models with fine-grained control. Whether you're targeting edge deployment, reducing inference latency, or experimenting with quantization strategies, QuantKit gives you all the knobs.

Designed with researchers and engineers in mind, QuantKit supports layer-wise quantization, asymmetric/symmetric schemes, 4/8-bit precision, and LoRA integration for efficient fine-tuning.


πŸš€ Features

  • βœ… Layer-wise, selective, or full-model quantization
  • πŸ”’ Supports 4-bit, 8-bit, mixed-precision
  • βš™οΈ Implements symmetric and asymmetric quantization
  • 🧠 Compatible with Hugging Face transformers and bitsandbytes
  • πŸ“¦ Easily save/load quantized models
  • πŸ€– Works with LoRA/PEFT for fine-tuning
  • πŸ§ͺ Offers a Python API and CLI for full control

πŸ“¦ Installation

git clone https://github.com/your-username/quantkit.git
cd quantkit
pip install -e .

About

QuantKit is a flexible and modular framework for generating quantized machine learning models with highly customizable workflows. It supports layer-wise quantization, asymmetric/symmetric schemes, post-training and dynamic quantization, and integrates seamlessly with the Hugging Face Transformers ecosystem.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published