Skip to content
@LeanModels

LeanModels

LeanModels — Making Foundation Models Leaner and Meaner

Welcome to LeanModels, an organization founded by Tianyi Zhang dedicated to making foundation models, such as LLMs and diffusion models, more memory- and compute-efficient through practical compression and inference optimization techniques.

Explore our key projects:

  • DFloat11: A lossless LLM compression framework enabling efficient GPU inference
  • Bagel-DFloat11: DFloat11-compressed version of Bagel, a unified multimodal model
  • LeanQuant: Scalable, loss-error-aware quantization for LLMs

We welcome contributors, collaborators, and feedback! If you're working on model compression or efficient inference, feel free to reach out.

Pinned Loading

  1. DFloat11 DFloat11 Public

    DFloat11: Lossless LLM Compression for Efficient GPU Inference

    Python 552 33

  2. Bagel-DFloat11 Bagel-DFloat11 Public

    Forked from ByteDance-Seed/Bagel

    Python 95 7

  3. LeanQuant LeanQuant Public

    Code repository for ICLR 2025 paper "LeanQuant: Accurate and Scalable Large Language Model Quantization with Loss-error-aware Grid"

    Python 23 3

Repositories

Showing 7 of 7 repositories
  • LeanModels/ComfyUI-DFloat11’s past year of commit activity
    Python 9 2 4 2 Updated Aug 26, 2025
  • DFloat11 Public

    DFloat11: Lossless LLM Compression for Efficient GPU Inference

    LeanModels/DFloat11’s past year of commit activity
    Python 552 Apache-2.0 33 19 0 Updated Aug 24, 2025
  • SketchTune Public

    Code for [ICML 2025] Sketch to Adapt: Fine-Tunable Sketches for Efficient LLM Adaptation

    LeanModels/SketchTune’s past year of commit activity
    Python 5 0 1 0 Updated Jun 30, 2025
  • LeanModels/OmniGen2-DFloat11’s past year of commit activity
    Jupyter Notebook 11 Apache-2.0 2 1 0 Updated Jun 27, 2025
  • .github Public
    LeanModels/.github’s past year of commit activity
    0 0 0 0 Updated May 26, 2025
  • LeanModels/Bagel-DFloat11’s past year of commit activity
    Python 95 Apache-2.0 449 0 0 Updated May 25, 2025
  • LeanQuant Public

    Code repository for ICLR 2025 paper "LeanQuant: Accurate and Scalable Large Language Model Quantization with Loss-error-aware Grid"

    LeanModels/LeanQuant’s past year of commit activity
    Python 23 3 3 0 Updated Mar 2, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…