handle inputs from Siglip/Siglip2 non-automapped encoder layers #41930

molbap · 2025-10-29T09:51:32Z

What does this PR do?

Should fix #41929 . The check_model_inputs / can_record_outputs interaction is not always trivial and models with several entrypoints such as VisionModel vs VisionTransformer are missing some, adding it here. Also added a modification in generic to make sure the flag was captured, not 100% sure it's needed.

HuggingFaceDocBuilderDev · 2025-10-29T10:00:57Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

vasqu

lgtm, I would just revert the registry default change. Shouldnt be needed and I don't wanna risk breaking executorch (or similar trace ops)

vasqu · 2025-10-29T10:25:15Z

src/transformers/utils/generic.py


            # _can_record_outputs is None by default
-            capture_flags = _CAN_RECORD_REGISTRY.get(str(self.__class__)) or {}  # there is a weak ref for executorch
+            capture_flags = _CAN_RECORD_REGISTRY.get(str(self.__class__)) or getattr(self, "_can_record_outputs", {})


I think we can revert this, the registry should already be either applied to the class or we get the default {}. See

transformers/src/transformers/modeling_utils.py

Line 1851 in 4d0b675

_CAN_RECORD_REGISTRY[str(self.__class__)] = self._can_record_outputs # added for executorch support only

And I honestly don't wanna risk breaking executorch

yeah the latter reason is compelling 😬

vasqu · 2025-10-29T10:26:23Z

src/transformers/models/siglip2/modeling_siglip2.py

        return self.vision_model.embeddings.patch_embedding

    @check_model_inputs(tie_last_hidden_states=False)
-    @auto_docstring


Nit: Ig this was already inherited?

uh, interesting. yes it should have been, entirely unrelated to this lol

molbap · 2025-10-29T16:15:14Z

run-slow: siglip, siglip2

github-actions · 2025-10-29T16:17:05Z

This comment contains run-slow, running the specified jobs:

models: ['models/siglip', 'models/siglip2']
quantizations: [] ...

…ce/transformers into siglip_and_check_model_changes

molbap · 2025-10-29T16:41:20Z

run-slow: siglip, siglip2

github-actions · 2025-10-29T16:43:20Z

This comment contains run-slow, running the specified jobs:

models: ['models/siglip', 'models/siglip2']
quantizations: [] ...

molbap · 2025-10-29T17:31:12Z

ended up down the rabbit hole of wrong PreTrainedModel inheritances hehe, @yonigozlan if you want to take a look

molbap · 2025-10-30T07:36:55Z

run-slow: siglip, siglip2

github-actions · 2025-10-30T07:39:23Z

This comment contains run-slow, running the specified jobs:

models: ['models/siglip', 'models/siglip2']
quantizations: [] ...

molbap · 2025-10-30T07:49:01Z

run-slow: siglip, siglip2

github-actions · 2025-10-30T07:52:07Z

This comment contains run-slow, running the specified jobs:

models: ['models/siglip', 'models/siglip2']
quantizations: [] ...

molbap · 2025-10-31T09:31:31Z

run-slow: siglip, siglip2

github-actions · 2025-10-31T09:33:32Z

This comment contains run-slow, running the specified jobs:

models: ['models/siglip', 'models/siglip2']
quantizations: [] ...

github-actions · 2025-10-31T10:48:31Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: siglip, siglip2

molbap · 2025-10-31T10:53:49Z

run-slow: siglip, siglip2

github-actions · 2025-10-31T10:55:47Z

This comment contains run-slow, running the specified jobs:

models: ['models/siglip', 'models/siglip2']
quantizations: [] ...

vasqu

Awkward situation with the auto model, can you also check out clip? We should face very similar issues there as well + do we need to adjust tests maybe?

vasqu · 2025-10-31T13:54:53Z

tests/models/siglip2/test_modeling_siglip2.py

+    @unittest.skip(reason="This test is broken on A10 multi runners for now")
+    def test_multi_gpu_data_parallel_forward(self):
+        pass


We shouldn't skip these, will be hard to revert because everyone will forget imo

vasqu · 2025-10-31T13:57:32Z

src/transformers/models/siglip/modeling_siglip.py

            nn.init.normal_(module.fc1.bias, std=1e-6)
            nn.init.normal_(module.fc2.bias, std=1e-6)
-        elif isinstance(module, SiglipMultiheadAttentionPoolingHead):
+        elif "MultiheadAttentionPoolingHead" in module.__class__.__name__:


Seems unrelated, no?

handle inputs from non-automapped encoder layers

ca68be8

molbap requested a review from vasqu October 29, 2025 09:51

vasqu approved these changes Oct 29, 2025

View reviewed changes

molbap and others added 3 commits October 29, 2025 17:13

correct inheritance + protect executorch

76a14c7

fixup

4f93734

Merge branch 'main' into siglip_and_check_model_changes

082dcf2

molbap added 3 commits October 29, 2025 17:38

fix tests

fe7c922

Merge branch 'siglip_and_check_model_changes' of github.com:huggingfa…

5aa7610

…ce/transformers into siglip_and_check_model_changes

missing docstring

807983c

molbap added 2 commits October 29, 2025 18:01

attn support

448dd63

fix initialization

91d34b0

molbap and others added 2 commits October 30, 2025 08:47

reorder/simplify

40a9dc8

Merge branch 'main' into siglip_and_check_model_changes

f54d0db

Merge branch 'main' into siglip_and_check_model_changes

3ee3c56

molbap requested a review from ArthurZucker October 30, 2025 14:11

molbap added 2 commits October 31, 2025 10:36

Merge branch 'main' into siglip_and_check_model_changes

c4400cb

flag test as broken

92eeed6

vasqu reviewed Oct 31, 2025

View reviewed changes

handle inputs from Siglip/Siglip2 non-automapped encoder layers #41930

Are you sure you want to change the base?

handle inputs from Siglip/Siglip2 non-automapped encoder layers #41930

Conversation

molbap commented Oct 29, 2025

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Oct 29, 2025

Uh oh!

vasqu left a comment

Choose a reason for hiding this comment

Uh oh!

vasqu Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

molbap Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

vasqu Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

molbap Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

molbap commented Oct 29, 2025

Uh oh!

github-actions bot commented Oct 29, 2025

Uh oh!

molbap commented Oct 29, 2025

Uh oh!

github-actions bot commented Oct 29, 2025

Uh oh!

molbap commented Oct 29, 2025

Uh oh!

molbap commented Oct 30, 2025

Uh oh!

github-actions bot commented Oct 30, 2025

Uh oh!

molbap commented Oct 30, 2025

Uh oh!

github-actions bot commented Oct 30, 2025

Uh oh!

molbap commented Oct 31, 2025

Uh oh!

github-actions bot commented Oct 31, 2025

Uh oh!

github-actions bot commented Oct 31, 2025

Uh oh!

molbap commented Oct 31, 2025

Uh oh!

github-actions bot commented Oct 31, 2025

Uh oh!

vasqu left a comment

Choose a reason for hiding this comment

Uh oh!

vasqu Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

vasqu Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants