Support FLUX OneTrainer LoRA formats (incl. DoRA) #7590

RyanJDick · 2025-01-24T19:35:25Z

Summary

This PR adds support for the FLUX LoRA model format produced by OneTrainer.

Specifically, this PR adds:

Support for DoRA patches
Support for patch models that modify the FLUX T5 encoder
Probing / loading support for OneTrainer models

Known limitations

DoRA patches cannot currently be applied to base weights that are quantized with bitsandbytes. The DoRA algorithm requires accessing the original model weight in order to compute the patch diff, and the bitsandbytes quantization layers make this difficult. DoRA patches can be applied to non-quantized and GGUF-quantized layers without issue.
This PR results in a slight speed regression for a very particular inference combination: quantized base model + LoRA with diffusers keys (i.e. uses the MergedLayerPatch). Now that more LoRA formats are using the MergedLayerPatch, it was becoming too much work to maintain this optimization. Regression from ~1.7 it/s to ~1.4 it/s.

Future Notes

We may want to consider dropping support for bitsandbytes quantization. It is very difficult to maintain compatibility for across features like partial-loading and LoRA patching.
At a future time, we should refactor the LoRA parsing logic to be more generalized rather than handling each format independently.
There are some redundant device casts and dequantizations in autocast_linear_forward_sidecar_patches(...) (and its sub-calls). Optimizing this is left for future work.

Related Issues / Discussions

This PR should address a handful of the LoRAs reported in [bug]: Certain FLUX LoRAs don't install #7131 (specifically, most of the envy* LoRAs).
This PR should address the example in [bug]: Model manager can't install Flux LoRa created in OneTrainer #6912 (though the intended effect of that LoRA is not totally clear, so its hard to verify with full confidence).

QA Instructions

OneTrainer test models:

https://civitai.com/models/844821/envy-flux-dark-watercolor-01?modelVersionId=945159 (DoRA, transformer only)
https://civitai.com/models/836757/envy-flux-digital-brush-01?modelVersionId=936167 (hada, transformer only)
ball_flux from [bug]: Model manager can't install Flux LoRa created in OneTrainer #6912 (DoRA, transformer/clip/t5)

The following tests were repeated with each of the OneTrainer test models:

Test with non-quantized base model
Test with GGUF-quantized base model
Test with BnB-quantized base model
Test with non-quantized base model that is partially-loaded onto the GPU

Other regression test:

Test some SD1 LoRAs
Test some SDXL LoRAs
Test a variety of existing FLUX LoRA formats
Test a FLUX Control LoRA on all base model quantization formats.

Merge Plan

No special instructions.

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
Documentation added / updated (if applicable)
Updated What's New copy (if doing a release after this PR)

…sers_state_dict(). Previously, it only supported PEFT-style LoRA keys.

…t it can be re-used for OneTrainer LoRAs.

…der.

… it can work with more LoRA variants (e.g. hada)

…oRALayer to CustomModuleMixin.

…ies alpha scaling to delta_v, and 2) warns if the base model is incompatible.

…y the T5 encoder weights.

…anup.

…tensors.

github-actions bot added python PRs that change python files invocations PRs that change invocations backend PRs that change backend files frontend PRs that change frontend files python-tests PRs that change python tests labels Jan 24, 2025

RyanJDick force-pushed the ryan/flux-dora-one-trainer-concatenated branch from 14c5f31 to 5085a8c Compare January 24, 2025 19:50

RyanJDick marked this pull request as ready for review January 24, 2025 20:24

RyanJDick requested review from psychedelicious, blessedcoolant, maryhipp, hipsterusername, lstein and brandonrising as code owners January 24, 2025 20:24

hipsterusername approved these changes Jan 26, 2025

View reviewed changes

RyanJDick added 16 commits January 28, 2025 14:51

Add a test state dict for the OneTrainer DoRA format.

8b4f411

Add is_state_dict_likely_in_flux_onetrainer_format() util function.

5bd6428

Expand unit tests to test for confusion between FLUX LoRA formats.

faa4fa0

First draft of DoRALayer. Not tested yet.

4f369e3

Add utils for working with Kohya LoRA keys.

dfa253e

Add support for LyCoris-style LoRA keys in lora_model_from_flux_diffu…

908976a

…sers_state_dict(). Previously, it only supported PEFT-style LoRA keys.

Further updates to lora_model_from_flux_diffusers_state_dict() so tha…

7eee4da

…t it can be re-used for OneTrainer LoRAs.

Add utils for loading FLUX OneTrainer DoRA models.

206f261

Fix typo in DoRALayer.

409b69e

Update FLUX invocations to support LoRAs that modify the T5 text enco…

f4a0b78

…der.

Fix bug in FLUX T5 Koyha-style LoRA key parsing.

1054283

Relax lora_layers_from_flux_diffusers_grouped_state_dict(...) so that…

b8eed2b

… it can work with more LoRA variants (e.g. hada)

Add FLUX OneTrainer model probing.

0db6639

Update GGMLTensor with ops necessary to work with ConcatenatedLoRALayer.

5ea7953

Update ConcatenatedLoRALayer to work with all sub-layer types.

28514ba

Move quantized weight handling for patch layers up from ConcatenatedL…

5d472ac

…oRALayer to CustomModuleMixin.

RyanJDick added 5 commits January 28, 2025 14:51

Update DoRALayer with a custom get_parameters() override that 1) appl…

e7fb435

…ies alpha scaling to delta_v, and 2) warns if the base model is incompatible.

Update frontend graph building logic to support FLUX LoRAs that modif…

7fef569

…y the T5 encoder weights.

Rename ConcatenatedLoRALayer to MergedLayerPatch. And other minor cle…

5357d6e

…anup.

Handle DoRA layer device casting when model is partially-loaded.

6c919e1

Performance optimizations for LoRAs applied on top of GGML-quantized …

229834a

…tensors.

RyanJDick force-pushed the ryan/flux-dora-one-trainer-concatenated branch from 07d83b7 to 229834a Compare January 28, 2025 15:11

RyanJDick merged commit debcbd6 into main Jan 28, 2025
15 checks passed

RyanJDick deleted the ryan/flux-dora-one-trainer-concatenated branch January 28, 2025 17:50

RyanJDick mentioned this pull request Jan 28, 2025

Fix T5EncoderField initialization in SD3 model loader #7604

Merged

4 tasks

keturn mentioned this pull request Jan 29, 2025

[bug]: Certain FLUX LoRAs don't install #7131

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support FLUX OneTrainer LoRA formats (incl. DoRA) #7590

Support FLUX OneTrainer LoRA formats (incl. DoRA) #7590

Uh oh!

RyanJDick commented Jan 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Support FLUX OneTrainer LoRA formats (incl. DoRA) #7590

Support FLUX OneTrainer LoRA formats (incl. DoRA) #7590

Uh oh!

Conversation

RyanJDick commented Jan 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Known limitations

Future Notes

Related Issues / Discussions

QA Instructions

Merge Plan

Checklist

Uh oh!

Uh oh!

Uh oh!

RyanJDick commented Jan 24, 2025 •

edited

Loading