Add CABS Merge Method #568

zongzhenyang · 2025-05-09T07:52:39Z

Implements CABS (Conflict-Aware and Balanced Sparsification) model merging technique from "CABS: Conflict-Aware and Balanced Sparsification for Enhancing Model Merging".

CABS aims to improve merged model quality by mitigating parameter interference through sequential conflict-aware pruning and applying n:m structural pruning to task vector components.

Key Features:

Sequential Conflict-Aware Pruning: Processes task vectors in a user-defined pruning_order, masking out parameters already claimed by prior models in the sequence before subsequent pruning. This minimizes destructive overlap.
N:M Structural Pruning:
- Applies n:m pruning (retaining n largest magnitude weights out of every m consecutive weights) to the conflict-masked task vector components.
- n and m values are configurable globally (default_n_val, default_m_val) and per-model (n_val, m_val).
Weighted Aggregation: Pruned task vectors are scaled by a weight (lambda) and added to the base model.
Added cabs.py implementing CABSMerge and CABSTask.
Added CABS example configuration in examples/cabs.yml.

Add CABSMerge

github-actions · 2025-05-09T07:52:50Z

All contributors have signed the CLA ✍️ ✅
_{Posted by the CLA Assistant Lite bot.}

zongzhenyang · 2025-05-09T07:57:04Z

I have read the CLA Document and I hereby sign the CLA

cg123 · 2025-05-10T05:04:59Z

Thanks for the PR! I'd love to have your method in mergekit.

Two things:

Could you please run the pre-commit hook to format the code and push the changes?
Would you like to add your method to the table in the README?

CasualAutopsy · 2025-05-23T19:18:50Z

This is absolutely an amazing merging method, however it seems to need more support with gradients. While technically compatible if you supply as many values to the array as there are blocks, it would be nice to have it round to the closest whole number so it doesn't throw an error.

cursor

This PR is being reviewed by Cursor Bugbot

Details

Your team is on the Bugbot Free tier. On this plan, Bugbot will review limited PRs each billing cycle for each member of your team.

To receive Bugbot reviews on all of your PRs, visit the Cursor dashboard to activate Pro and start your 14-day free trial.

cursor · 2025-10-26T16:48:49Z

mergekit/merge_methods/cabs.py

+        return tensor.clone(), torch.ones_like(tensor, dtype=torch.bool)
+    if n_val < 0 or n_val > m_val:
+        logging.error(f"Tensor {original_shape}: n_val ({n_val}) invalid.")
+        return tensor.clone(), torch.ones_like(tensor, dtype=torch.bool)


Bug: Validation Failure Causes Incorrect Masking

When validation fails (m_val <= 0 or invalid n_val), the function returns torch.ones_like(tensor, dtype=torch.bool) as the mask. This means ALL parameters are marked as retained/claimed. In the CABS conflict-aware algorithm, this causes the cumulative_param_mask to be filled with True values, preventing subsequent models from claiming any parameters and breaking the conflict-aware merging logic. The function should either return torch.zeros_like (no parameters retained) or raise an exception on validation failure.

graphite-app · 2025-10-26T16:50:21Z

examples/cabs.yml

+      weight: 0.4
+      n_val: 8  # Per-model n
+      m_val: 32 # Per-model m
+      # n_val and m_val not set for zephyr_beta, will use global defaults


The comment on line 13 is incorrect - it states that n_val and m_val are not set for zephyr-7b-beta, but these parameters are actually defined in lines 11-12 directly above the comment. This comment should either be removed or corrected to accurately reflect the configuration.

Suggested change

# n_val and m_val not set for zephyr_beta, will use global defaults

# n_val and m_val are set for zephyr_beta above

Spotted by Graphite Agent

Is this helpful? React 👍 or 👎 to let us know.

zongzhenyang added 10 commits May 9, 2025 14:28

Update registry.py

655ff27

Add CABSMerge

Create cabs.py

4c59139

Create cabs.yml

bd5dded

Update cabs.yml

268065a

Update cabs.yml

7ba30cd

Update cabs.yml

0934a16

Update cabs.py

a65b5ac

Update cabs.yml

8fb2f85

Update cabs.yml

6b05fb7

Update cabs.py

5068a33

zongzhenyang and others added 3 commits May 11, 2025 09:18

Merge branch 'main' into main

0174835

isort and black hooks2

915219a

Merge branch 'main' into main

a0827a7

zongzhenyang added 2 commits June 10, 2025 22:48

Merge branch 'main' into main

d63fffc

Merge branch 'main' into main

94e7433

cursor bot reviewed Oct 26, 2025

View reviewed changes

graphite-app bot reviewed Oct 26, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add CABS Merge Method #568

Add CABS Merge Method #568

zongzhenyang commented May 9, 2025

Uh oh!

github-actions bot commented May 9, 2025 •

edited

Loading

Uh oh!

zongzhenyang commented May 9, 2025

Uh oh!

cg123 commented May 10, 2025

Uh oh!

CasualAutopsy commented May 23, 2025

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Oct 26, 2025

Uh oh!

graphite-app bot Oct 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	# n_val and m_val not set for zephyr_beta, will use global defaults
	# n_val and m_val are set for zephyr_beta above

Add CABS Merge Method #568

Are you sure you want to change the base?

Add CABS Merge Method #568

Conversation

zongzhenyang commented May 9, 2025

Uh oh!

github-actions bot commented May 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zongzhenyang commented May 9, 2025

Uh oh!

cg123 commented May 10, 2025

Uh oh!

CasualAutopsy commented May 23, 2025

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

This PR is being reviewed by Cursor Bugbot

Uh oh!

cursor bot Oct 26, 2025

Choose a reason for hiding this comment

Bug: Validation Failure Causes Incorrect Masking

Uh oh!

graphite-app bot Oct 26, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions bot commented May 9, 2025 •

edited

Loading