Update dependency peft to v0.18.0 #61

red-hat-konflux · 2025-05-17T21:52:03Z

This PR contains the following updates:

Package	Change	Age	Confidence
peft	`==0.3.0` -> `==0.18.0`

Warning

Some dependencies could not be looked up. Check the warning logs for more information.

Release Notes

huggingface/peft (peft)

`v0.18.0`: 0.18.0: RoAd, ALoRA, Arrow, WaveFT, DeLoRA, OSF, and more

Compare Source

Highlights

FIXME update list of all changes, so some more commits were added

New Methods

RoAd

@ppetrushkov added RoAd: 2D Rotary Adaptation to PEFT in #2678. RoAd learns 2D rotation matrices that are applied using only element-wise multiplication, thus promising very fast inference with adapters in unmerged state.

Remarkably, besides LoRA, RoAd is the only PEFT method that supports mixed adapter batches. This means that when you have loaded a model with multiple RoAd adapters, you can use all of them for different samples in the same batch, which is much more efficient than switching adapters between batches:

model = PeftModel.from_pretrained(base_model, <path-to-road-adapter-A>, adapter_name="adapter-A")
model.add_adapter("adapter-B", <path-to-road-adapter-B>)

inputs = ...  # input with 3 samples

##### apply adapter A to sample 0, adapter B to sample 1, and use the base model for sample 2:
adapter_names = ["adapter-A", "adapter-B", "__base__"]
output_mixed = model(**inputs, adapter_names=adapter_names)
gen_mixed = model.generate(**inputs, adapter_names=adapter_names)

ALoRA

Activated LoRA is a technique added by @kgreenewald in #2609 for causal language models, allowing to selectively enable LoRA adapters depending on a specific token invocation sequence in the input. This has the major benefit of being able to re-use most of the KV cache during inference when the adapter is only used to generate part of the response, after which the base model takes over again.

Arrow & GenKnowSub

@TheTahaaa contributed not only support for Arrow, a dynamic routing algorithm between multiple loaded LoRAs in #2644, but also GenKnowSub, a technique built upon Arrow where the 'library' of LoRAs available to Arrow is first modified by subtracting general knowledge adapters (e.g., trained on subsets of Wikipedia) to enhance task-specific performance.

WaveFT

Thanks to @Bilican, Wavelet Fine-Tuning (WaveFT) was added to PEFT in #2560. This method trains sparse updates in the wavelet domain of residual matrices, which is especially parameter efficient. It is very interesting for image generation, as it promises to generate diverse outputs while preserving subject fidelity.

DeLoRA

Decoupled Low-rank Adaptation (DeLoRA) was added by @mwbini in #2780. This new PEFT method is similar to DoRA in so far as it decouples the angle and magnitude of the learned adapter weights. However, DeLoRA implements this in a way that promises to better prevent divergence. Moreover, it constrains the deviation of the learned weight by imposing an upper limit of the norm, which can be adjusted via the delora_lambda parameter.

OSF

Orthogonal Fine-Tuning (OSF) was added by @NikhilNayak-debug in #2685. By freezing the high-rank subspace of the targeted weight matrices and projecting gradient updates to a low-rank subspace, OSF achieves good performance on continual learning tasks. While it is a bit memory intensive for standard fine-tuning processes, it is definitely worth checking out on tasks where performance degradation of previously learned tasks is a concern.

Enhancements

Text generation benchmark

In #2525, @ved1beta added the text generation benchmark to PEFT. This is a framework to determine and compare metrics with regard to text generation of different PEFT methods, e.g. runtime and memory usage. Right now, this benchmark is still lacking experimental settings and a visualization, analogous to what we have in the MetaMathQA benchmark. If this is something that interests you, we encourage you to let us know or, even better, contribute to this benchmark.

Reliable interface for integrations

PEFT has integrations with other libraries like Transformers and Diffusers. To facilitate this integration, PEFT now provides a stable interface of functions that should be used if applicable. For example, the set_adapter function can be used to switch between PEFT adapters on the model, even if the model is not a PeftModel instance. We commit to keeping these functions backwards compatible, so it's safe for other libraries to build on top of those.

Handling of weight tying

Some Transformers models can have tied weights. This is especially prevalent when it comes to the embedding and the LM head. Currently, the way that this is handled in PEFT is not obvious. We thus drafted an issue to illustrate the intended behavior in #2864. This shows what our goal is, although not everything is implemented yet.

In #2803, @romitjain added the ensure_weight_tying argument to LoraConfig. This argument, if set to True, enforces weight tying of the modules targeted with modules_to_save. Thus, if embedding and LM head are tied, they will share weights, which is important to allow, for instance, weight merging. Therefore, for most users, we recommend to enable this setting if they want to fully fine-tune the embedding and LM head. For backwards compatability, the setting is off by default though.

Note that in accordance with #2864, the functionality of ensure_weight_tying=True will be expanded to also include trainable tokens (#2870) and LoRA (tbd.) in the future.

Support Conv1d and 1x1 Conv2 layers in LoHa and LoKr

@grewalsk extended LoHa and LoKr to support nn.Conv1d layers, as well as nn.Conv2d with 1x1 kernels, in #2515.

New prompt tuning initialization

Thanks to @macmacmacmac, we now have a new initialization option for prompt tuning, random discrete initialization (#2815). This option should generally work better than random initialization, as corroborated on our PEFT method comparison suite. Give it a try if you use prompt tuning.

Combining LoRA adapters with negative weights

If you use multiple LoRA adapters, you can merge them into a single adapter using model.add_weighted_adapter. However, so far, this only worked with positive weights per adapter. Thanks to @sambhavnoobcoder and @valteu, it is now possible to pass negative weights too.

Changes

Transformers compatibility

At the time of writing, the Transformers v5 release is imminent. This Transformers version will be incomptabile with PEFT < 0.18.0. If you plan to use Transformers v5 with PEFT, please upgrade PEFT to 0.18.0+.

Python version

This PEFT version no longer supports Python 3.9, which has reached its end of life. Please use Python 3.10+.

Updates to OFT

The OFT method has been updated to make it slightly faster and to stabilize the numerics in #2805. This means, however, that existing checkpoints may give slightly different results after upgrading to PEFT 0.18.0. Therefore, if you use OFT, we recommend to retrain the adapter.

All Changes

add xpu support for boft/controlnet example by @kaixuanliu in #2674
enabe boft_dreambooth on XPU by @yao-matrix in #2679
Add XPU support for dna_language_model example by @kaixuanliu in #2689
validated lora dreambooth on xpu, pass by @yao-matrix in #2696
validated lorafa on xpu, passed by @yao-matrix in #2697
enable corda finetuning on xpu by @yao-matrix in #2687
validated cpt, ephemeral_gpu_offloading and eva finetuning on XPU by @yao-matrix in #2694
validated PISSA on xpu, pass by @yao-matrix in #2703
validated MISS on xpu, pass by @yao-matrix in #2704
fix bug for feature_extraction example by @kaixuanliu in #2706
Use hub_online_once in trainable token tests by @githubnemo in #2701
Bump version to 0.17.1.dev0 after release by @BenjaminBossan in #2707
validated multi_adapter on xpu, pass by @yao-matrix in #2711
verified mlp on xpu, pass by @yao-matrix in #2712
use CPU instead of XPU for face_alignment by @kaixuanliu in #2713
Add conditional_generation example xpu support by @kaixuanliu in #2684
validated POLY on XPU, pass by @yao-matrix in #2702
add XPU support for hra_dreambooth example by @kaixuanliu in #2717
enable xpu device for causal_language_modeling example by @kaixuanliu in #2680
add xpu support for fp4_finetuing example by @kaixuanliu in #2714
bench mark scripts by @ved1beta in #2525
enable oft-dreambooth on xpu, and fix example bugs, pass by @yao-matrix in #2718
enable qalora on xpu, pass by @yao-matrix in #2719
enabled randlora on xpu, pass by @yao-matrix in #2720
validated semantic-segmentation peft on xpu, pass by @yao-matrix in #2721
add xpu support for image-classification example by @kaixuanliu in #2722
CI: Fix Windows error for low CPU mem usage tests by @BenjaminBossan in #2724
FIX: Warn when using LoRA bias w/o base layer bias by @BenjaminBossan in #2725
Updated MetaMathQA results by @githubnemo in #2686
Add XPU support for Int8 training example by @kaixuanliu in #2723
enable sd example on xpu, pass by @yao-matrix in #2726
validated token classification on xpu, pass by @yao-matrix in #2727
extend docs to cover more accelerators like intel XPU by @yao-matrix in #2728
enable xpu for train_memory script by @yao-matrix in #2729
add xpu support for sequence_classification example by @kaixuanliu in #2732
extend device_str to support other devices other than cuda by @yao-matrix in #2731
Add XPU support for sft example by @kaixuanliu in #2709
extend text-generation-benchmark to xpu, pass by @yao-matrix in #2730
FIX Multiple issues with target_parameters by @BenjaminBossan in #2710
Bug in documentation, update dataset load, prompt_based_methods.md by @Apurro12 in #2708
CHORE: Upgrade ruff to ~0.12.8 by @BenjaminBossan in #2734
enable TP with lora adapter by @3outeille in #2741
CI: Allow CI to pass even if MacOS tests error by @BenjaminBossan in #2715
CHORE: Clean up config kwargs in custom model tests by @BenjaminBossan in #2736
Support for RoAd: 2D Rotary Adaptation by @ppetrushkov in #2678
FIX: DynamicCache max_cache_len attribute error by @BenjaminBossan in #2735
Bump version to 0.17.2.dev0 after release by @BenjaminBossan in #2748
FIX: DynamicCache key_cache attribute deprecation by @BenjaminBossan in #2737
[DOC] update description for BOFT under Adapters conceptual guide by @rojagtap in #2744
feat(lokr, loha): add 1x1 Conv2d and Conv1d support by @grewalsk in #2515
FIX: Multiple active adapters with auxiliary layers by @BenjaminBossan in #2758
Support for Activated LoRA (Issue #2523) by @kgreenewald in #2609
Fix missing code start in docs by @githubnemo in #2768
TST FIX Failing AutoAWQ test with torch 2.8 by @BenjaminBossan in #2752
FIX Deprecated key_cache attribute on Cache pt 2 by @BenjaminBossan in #2753
Support dataclass model configs by @githubnemo in #2778
FIX X-LoRA forward hook issue during generate by @BenjaminBossan in #2761
CHORE: Upgrade trufflehog GitHub action to 3.90.5 by @BenjaminBossan in #2770
Replace from_legacy_cache method with constructors by @SP1029 in #2767
Add Arrow + GenKnowSub to LoRA by @TheTahaaa in #2644
FIX: Wrong coupling between requires_grad and the active adapter by @BenjaminBossan in #2765
CHORE: Update and pin (commit hash) GitHub actions by @BenjaminBossan in #2779
Fix RS-LoRA scaling in set_scale by @tanuj-rai in #2775
TST Add missing configs to test_config.py by @BenjaminBossan in #2781
The great deduplication by @BenjaminBossan in #2771
ENH Small speedups to adapter injection by @BenjaminBossan in #2785
Add xpu support for Evaluation example by @kaixuanliu in #2705
Use technical user for CI runs by @githubnemo in #2800
Add dora_ft example xpu support by @kaixuanliu in #2700
FIX: Small fixes to warning like missing spaces by @BenjaminBossan in #2788
Method comparison: Add MiSS result by @BenjaminBossan in #2740
DOC: Explain how to use multiple adapters at the same time by @BenjaminBossan in #2763
FIX: All PEFT layers expose in_features, out_features by @BenjaminBossan in #2784
ENH: Model and layer status for auxiliary modules by @BenjaminBossan in #2762
CHORE DOC Migrate tips syntax by @BenjaminBossan in #2801
ENH: Store PEFT version in PEFT config file by @BenjaminBossan in #2782
Fix module target edge cases by @BenjaminBossan in #2773
Some more TIP migration by @githubnemo in #2806
TST: fix to issue for 8-bit model by @yao-matrix in #2797
Drop Python 3.9, add 3.13 by @cyyever in #2790
CHORE: Ensure PEFT works with huggingface_hub 1.0.0 by @BenjaminBossan in #2808
Fix typo in pissa finetune readme by @JamesSand in #2812
WaveFT method added into tuners by @Bilican in #2560
FIX DOC Add missing TOC entry for WaveFT by @BenjaminBossan in #2814
Added new initialization option for PromptEmbedding by @macmacmacmac in #2815
Fix issue #2786: Store xlora scaling and fix per token normalization by @Che-Xu in #2793
Support Negative Weights When Merging LoRA Adapters #2796 by @sambhavnoobcoder in #2811
fix dequantize bnb weight on CPU by @jiqing-feng in #2820
Fix xpu accuracy check by changing seed by @jiqing-feng in #2829
Add num_trainable_params column to gradio app by @githubnemo in #2819
CI Testing transformers deprecations by @BenjaminBossan in #2817
ENH: Add set_requires_grad method by @BenjaminBossan in #2807
Method comparison: Add prompt tuning experiment with sample vocab by @BenjaminBossan in #2824
Handling embeddings scaling for TrainableTokensModel #2809 by @sambhavnoobcoder in #2825
XLoRA embed_scale Support #2830 by @sambhavnoobcoder in #2831
DoRA embed_scale Support #2838 by @sambhavnoobcoder in #2839
FIX TST Wrong attribute in LoftQ test by @BenjaminBossan in #2841
FIX: update deprecated torch_dtype to dtype (fixes #2835) by @shantanugupta2004 in #2837
Add RWKV LoRA defaults and opt-in test by @nirbo in #2810
Method comparison: LoRA that targets MLP modules by @BenjaminBossan in #2845
FEAT add DeLoRA by @mwbini in #2780
Ensure weight tying is maintained for embed_tokens and lm_head by @romitjain in #2803
add paper link for C3A by @Phoveran in #2852
DOC Update DeLoRA docs by @mwbini in #2854
CI: Remove bitsandbytes CI by @BenjaminBossan in #2858
FIX: DeLoRA adapter deletion issue by @BenjaminBossan in #2853
CI: Remove bnb docker image build from GH workflow by @BenjaminBossan in #2859
Add Orthogonal Subspace Fine-Tuning (OSF) Tuner for Parameter-Efficient Continual Learning by @NikhilNayak-debug in #2685
minor changes to OFT to make it faster by @zqiu24 in #2805
Fix trainable_token_indices for lm_head by @aflueckiger in #2863
use max_length to replace max_seq_length; correct README for by @kaixuanliu in #2862
add XPU support for alora-finetune example by @kaixuanliu in #2866
enable arrow_multitask example on Intel XPU by @kaixuanliu in #2867
Updated MetaMathQA results by @githubnemo in #2869
Update LoRA developer guides: non-in-place operations by @DargorAbraxas in #2871
FIX Bug when dequantizing 4bit bnb weights by @BenjaminBossan in #2847
Release 0.18.0.rc0 by @BenjaminBossan in #2849
Post rc-release version bump by @githubnemo in #2875
Fix #2826: implement gradient checkpoint callbacks by @githubnemo in #2860
ArXiv -> HF Papers by @qgallouedec in #2890
Fixed 4bit compare UT on XPU by @YangKai0616 in #2843
FIX: Exploit trust_remote_code in prompt tuning by @BenjaminBossan in #2896
FIX Prefix tuning with Qwen3 issue by @BenjaminBossan in #2883
CI: Fix issues caused by pytest v9 by @BenjaminBossan in #2904
Add forward compat. for tied_weights_keys dicts by @githubnemo in #2902

New Contributors

@ved1beta made their first contribution in #2525
@Apurro12 made their first contribution in #2708
@3outeille made their first contribution in #2741
@ppetrushkov made their first contribution in #2678
@rojagtap made their first contribution in #2744
@grewalsk made their first contribution in #2515
@kgreenewald made their first contribution in #2609
@TheTahaaa made their first contribution in #2644
@tanuj-rai made their first contribution in #2775
@JamesSand made their first contribution in #2812
@Bilican made their first contribution in #2560
@macmacmacmac made their first contribution in #2815
@Che-Xu made their first contribution in #2793
@sambhavnoobcoder made their first contribution in #2811
@shantanugupta2004 made their first contribution in #2837
@nirbo made their first contribution in #2810
@mwbini made their first contribution in #2780
@romitjain made their first contribution in #2803
@NikhilNayak-debug made their first contribution in #2685
@aflueckiger made their first contribution in #2863
@DargorAbraxas made their first contribution in #2871
@YangKai0616 made their first contribution in #2843

Full Changelog: huggingface/peft@v0.17.1...v0.18.0

`v0.17.1`: 0.17.1

Compare Source

This patch release contains a few fixes (via #2710) for the newly introduced target_parameters feature, which allows LoRA to target nn.Parameters directly (useful for mixture of expert layers). Most notably:

PEFT no longer removes possibly existing parametrizations from the parameter.
Adding multiple adapters (via model.add_adapter or model.load_adapter) did not work correctly. Since a solution is not trivial, PEFT now raises an error to prevent this situation.

`v0.17.0`: 0.17.0: SHiRA, MiSS, LoRA for MoE, and more

Compare Source

Highlights

New Methods

SHiRA

@kkb-code contributed Sparse High Rank Adapters (SHiRA, paper) which promise to offer a potential gain in performance over LoRAs - especially the concept loss when using multiple adapters is improved. Since the adapters only train on 1-2% of the weights and are inherently sparse, switching between adapters may be cheaper than with LoRAs. (#2584)

MiSS

@JL-er added a new PEFT method, MiSS (Matrix Shard Sharing) in #2604. This method is an evolution of Bone, which, according to our PEFT method comparison benchmark, gives excellent results when it comes to performance and memory efficiency. If you haven't tried it, you should do so now.

At the same time, Bone will be deprecated in favor of MiSS and will be removed in PEFT v0.19.0. If you already have a Bone checkpoint, you can use scripts/convert-bone-to-miss.py to convert it into a MiSS checkpoint and proceed with training using MiSS.

Enhancements

LoRA for `nn.Parameter`

LoRA is now able to target nn.Parameter directly (#2638, #2665)! Ever had this complicated nn.Module with promising parameters inside but it was too custom to be supported by your favorite fine-tuning library? No worries, now you can target nn.Parameters directly using the target_parameters config attribute which works similarly to target_modules.

This option can be especially useful for models with Mixture of Expert (MoE) layers, as those often use nn.Parameters directly and cannot be targeted with target_modules. For example, for the Llama4 family of models, use the following config to target the MoE weights:

config = LoraConfig(
    ...,
    target_modules=[],  # <= prevent targeting any modules
    target_parameters=["feed_forward.experts.down_proj", "feed_forward.experts.gate_up_proj"],
)

Note that this feature is still experimental as it comes with a few caveats and therefore might change in the future. Also, MoE weights with many experts can be quite huge, so expect a higher memory usage than compared to targeting normal nn.Linear layers.

Injecting adapters based on a `state_dict`

Sometimes, it is possible that there is a PEFT adapter checkpoint but the corresponding PEFT config is not known for whatever reason. To inject the PEFT layers for this checkpoint, you would usually have to reverse-engineer the corresponding PEFT config, most notably the target_modules argument, based on the state_dict from the checkpoint. This can be cumbersome and error prone. To avoid this, it is also possible to call inject_adapter_in_model and pass the loaded state_dict as an argument:

from safetensors.torch import load_file
from peft import LoraConfig, inject_adapter_in_model

model = ...
state_dict = load_file(<path-to-safetensors-file>)
lora_config = LoraConfig()  # <= no need to specify further
model = inject_adapter_in_model(lora_config, model, state_dict=state_dict)

Find more on state_dict based injection in the docs.

Changes

Compatibility

A bug in prompt learning methods caused modules_to_save to be ignored. Especially classification tasks are affected since they usually add the classification/score layer to modules_to_save. In consequence, these layers were neither trained nor stored after training. This has been corrected now. (#2646)

All Changes

Bump version to 0.16.1.dev0 after release by @BenjaminBossan in #2632
FEAT: Add GH action to deploy method comparison app by @BenjaminBossan in #2625
enable FSDP example for model `hugging-quants/Meta-Llama-3.1-8B-Instr… by @kaixuanliu in #2626
FIX: Create mask function signature change in transformers 4.53.1 by @BenjaminBossan in #2633
FIX: Correctly skip AWQ test based on torch version by @BenjaminBossan in #2631
FIX: Faulty OFT parameter device test by @BenjaminBossan in #2630
Fix #2634: Allow peft_type to be a string by @githubnemo in #2635
SHiRA Adapters by @kkb-code in #2584
FIX: Prompt learning methods modules_to_save issue by @BenjaminBossan in #2646
FIX: Error in workflow file to deploy method comparison app by @BenjaminBossan in #2645
FEAT Allow LoRA to target nn.Parameter by @BenjaminBossan in #2638
Update BibTeX entry by @cx-alberto-simoes in #2659
FIX Prefix tuning after transformers PR 38635 by @BenjaminBossan in #2662
make method comparison device agnostic, so it can expand to more accelerators like XPU by @yao-matrix in #2610
Update tokenizer parameter in sfttrainer across multiple examples by @gapsong in #2664
Update lora.md by @qgallouedec in #2666
GPT2 compatible version of LLama-Adapters by @efraimdahl in #2643
Method Comparison: Improve formatting/layout of table by @githubnemo in #2670
ENH: Targeting multiple parameters on the same module by @BenjaminBossan in #2665
Update extending vocab docs by @githubnemo in #2669
FIX Failing target_parameters param usage count by @BenjaminBossan in #2676
Fix trainable tokens with fsdp by @BenjaminBossan in #2681
FIX: Small fixes to target_parameters by @BenjaminBossan in #2677
TST: Add more HF Hub model caching by @BenjaminBossan in #2682
FIX: Missing device map for facebook/opt-125m by @BenjaminBossan in #2675
Fix not detecting regex-targeted embedding layer by @githubnemo in #2649
Add MiSS as a replacement for Bone. by @JL-er in #2604
[WIP] ENH: Adapter injection based on state_dict by @BenjaminBossan in #2637
Release 0.17.0 by @BenjaminBossan in #2691

New Contributors

@kaixuanliu made their first contribution in #2626
@kkb-code made their first contribution in #2584
@cx-alberto-simoes made their first contribution in #2659
@efraimdahl made their first contribution in #2643

Full Changelog: huggingface/peft@v0.16.0...v0.17.0

`v0.16.0`: 0.16.0: LoRA-FA, RandLoRA, C³A, and much more

Compare Source

Highlights

New Methods

LoRA-FA

In #2468, @AaronZLT added the LoRA-FA optimizer to PEFT. This optimizer is based on AdamW and it increases memory efficiency of LoRA training. This means that you can train LoRA with less memory, or, with the same memory budget, use higher LoRA ranks, potentially getting better results.

RandLoRA

Thanks to @PaulAlbert31, a new PEFT method called RandLoRA was added to PEFT (#2464). Similarly to VeRA, it uses non-learnable random low rank matrices that are combined through learnable matrices. This way, RandLoRA can approximate full rank updates of the weights. Training models quantized with bitsandbytes is supported.

C³A

@Phoveran added Circular Convolution Adaptation, C3A, in #2577. This new PEFT method can overcome the limit of low rank adaptations as seen e.g. in LoRA while still promising to be fast and memory efficient.

Enhancements

Thanks to @gslama12 and @SP1029, LoRA now supports Conv2d layers with groups != 1. This requires the rank r being divisible by groups. See #2403 and #2567 for context.

@dsocek added support for Intel Neural Compressor (INC) quantization to LoRA in #2499.

DoRA now supports Conv1d layers thanks to @EskildAndersen (#2531).

Passing init_lora_weights="orthogonal" now enables orthogonal weight initialization for LoRA (#2498).

@gapsong brought us Quantization-Aware LoRA training in #2571. This can make QLoRA training more efficient, please check the included example. Right now, only GPTQ is supported.

There has been a big refactor of Orthogonal Finetuning, OFT, thanks to @zqiu24 (#2575). This makes the PEFT method run more quickly and require less memory. It is, however, incompatible with old OFT checkpoints. If you have old OFT checkpoints, either pin the PEFT version to <0.16.0 or retrain it with the new PEFT version.

Thanks to @keepdying, LoRA hotswapping with compiled models no longer leads to CUDA graph re-records (#2611).

Changes

Compatibility

#2481: The value of required_grads_ of modules_to_save is now set to True when used directly with inject_adapter. This is relevant for PEFT integrations, e.g. Transformers or Diffusers.
Due to a big refactor of vision language models (VLMs) in Transformers, the model architecture has been slightly adjusted. One consequence of this is that if you use a PEFT prompt learning method that is applied to vlm.language_model, it will no longer work, please apply it to vlm directly (see #2554 for context). Morever, the refactor results in different checkpoints. We managed to ensure backwards compatability in PEFT, i.e. old checkpoints can be loaded successfully. There is, however, no forward compatibility, i.e. loading checkpoints trained after the refactor is not possible with package versions from before the refactor. In this case, you need to upgrade PEFT and transformers. More context in #2574.
#2579: There have been bigger refactors in Transformers concerning attention masks. This required some changes on the PEFT side which can affect prompt learning methods. For prefix tuning specifically, this can result in numerical differences but overall performance should be the same. For other prompt learning methods, numerical values should be the same, except if the base model uses 4d attention masks, like Gemma. If you load old prompt learning checkpoints, please double-check that they still perform as expected, especially if they're trained on Gemma or similar models. If not, please re-train them or pin PEFT and transformers to previous versions (<0.16.0 and <4.52.0, respectively).

All Changes

Bump version and minor instruction fix by @githubnemo in #2439
FIX for ConvNd layers using the groups argument. by @gslama12 in #2403
DOC: Tip on how to merge with DeepSpeed by @BenjaminBossan in #2446
Fix incorrect link in docs by @kenning in #2444
Fix typos by @omahs in #2447
Refactor to better support LoRA variants by @BenjaminBossan in #2443
enable 5 test cases on XPU by @yao-matrix in #2442
FIX: Faulty test that results in nan weight

Configuration

📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about these updates again.

If you want to rebase/retry this PR, check this box

To execute skipped test pipelines write comment /ok-to-test.

Documentation

Find out how to configure dependency updates in MintMaker documentation or see all available configuration options in Renovate documentation.

coveralls · 2025-05-17T22:09:27Z

Pull Request Test Coverage Report for Build 15089424447

Details

0 of 0 changed or added relevant lines in 0 files are covered.
No unchanged relevant lines lost coverage.
Overall first build on konflux/mintmaker/konflux-poc/peft-0.x at 93.407%

Totals
Change from base Build 15020007478:	93.4%
Covered Lines:	85
Relevant Lines:	91

💛 - Coveralls

coderabbitai · 2025-07-05T05:02:30Z

Important

Review skipped

Bot user detected.

To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Signed-off-by: red-hat-konflux <126015336+red-hat-konflux[bot]@users.noreply.github.com>

red-hat-konflux bot force-pushed the konflux/mintmaker/konflux-poc/peft-0.x branch from a3dffa8 to 10afd8b Compare July 5, 2025 05:02

red-hat-konflux bot changed the title ~~Update dependency peft to v0.15.2~~ Update dependency peft to v0.16.0 Jul 5, 2025

red-hat-konflux bot force-pushed the konflux/mintmaker/konflux-poc/peft-0.x branch from 10afd8b to b908a64 Compare August 9, 2025 08:24

red-hat-konflux bot changed the title ~~Update dependency peft to v0.16.0~~ Update dependency peft to v0.17.0 Aug 9, 2025

red-hat-konflux bot force-pushed the konflux/mintmaker/konflux-poc/peft-0.x branch from b908a64 to 2f9947a Compare August 23, 2025 08:42

red-hat-konflux bot changed the title ~~Update dependency peft to v0.17.0~~ Update dependency peft to v0.17.1 Aug 23, 2025

Update dependency peft to v0.18.0

ea2b697

Signed-off-by: red-hat-konflux <126015336+red-hat-konflux[bot]@users.noreply.github.com>

red-hat-konflux bot force-pushed the konflux/mintmaker/konflux-poc/peft-0.x branch from 2f9947a to ea2b697 Compare November 13, 2025 13:00

red-hat-konflux bot changed the title ~~Update dependency peft to v0.17.1~~ Update dependency peft to v0.18.0 Nov 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update dependency peft to v0.18.0 #61

Update dependency peft to v0.18.0 #61

Uh oh!

red-hat-konflux bot commented May 17, 2025 •

edited

Loading

Uh oh!

coveralls commented May 17, 2025

Uh oh!

coderabbitai bot commented Jul 5, 2025 •

edited

Loading

Review skipped

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Update dependency peft to v0.18.0 #61

Are you sure you want to change the base?

Update dependency peft to v0.18.0 #61

Uh oh!

Conversation

red-hat-konflux bot commented May 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Release Notes

v0.18.0: 0.18.0: RoAd, ALoRA, Arrow, WaveFT, DeLoRA, OSF, and more

Highlights

New Methods

RoAd

ALoRA

Arrow & GenKnowSub

WaveFT

DeLoRA

OSF

Enhancements

Text generation benchmark

Reliable interface for integrations

Handling of weight tying

Support Conv1d and 1x1 Conv2 layers in LoHa and LoKr

New prompt tuning initialization

Combining LoRA adapters with negative weights

Changes

Transformers compatibility

Python version

Updates to OFT

All Changes

New Contributors

v0.17.1: 0.17.1

v0.17.0: 0.17.0: SHiRA, MiSS, LoRA for MoE, and more

Highlights

New Methods

SHiRA

MiSS

Enhancements

LoRA for nn.Parameter

Injecting adapters based on a state_dict

Changes

Compatibility

All Changes

New Contributors

v0.16.0: 0.16.0: LoRA-FA, RandLoRA, C³A, and much more

Highlights

New Methods

LoRA-FA

RandLoRA

C³A

Enhancements

Changes

Compatibility

All Changes

Configuration

Documentation

Uh oh!

coveralls commented May 17, 2025

Pull Request Test Coverage Report for Build 15089424447

Details

💛 - Coveralls

Uh oh!

coderabbitai bot commented Jul 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

red-hat-konflux bot commented May 17, 2025 •

edited

Loading

`v0.18.0`: 0.18.0: RoAd, ALoRA, Arrow, WaveFT, DeLoRA, OSF, and more

`v0.17.1`: 0.17.1

`v0.17.0`: 0.17.0: SHiRA, MiSS, LoRA for MoE, and more

LoRA for `nn.Parameter`

Injecting adapters based on a `state_dict`

`v0.16.0`: 0.16.0: LoRA-FA, RandLoRA, C³A, and much more

coderabbitai bot commented Jul 5, 2025 •

edited

Loading