Add DINOv3ViTForImageClassification support #41224

dimidagd · 2025-09-30T13:00:02Z

@yonigozlan @molbap

molbap

Thanks! Left two comments

src/transformers/models/dinov3_vit/modular_dinov3_vit.py

tests/models/dinov3_vit/test_modeling_dinov3_vit.py

src/transformers/models/dinov3_vit/modeling_dinov3_vit.py

dimidagd · 2025-10-01T10:37:49Z

Discussed reviewers comments, no immediate points of action. Left for future

Refactoring to remove redundant code between Dinov2 and DINOv3
Convert meta checkpoint with classifier and add a regression test

@molbap would you like to approve this PR?

molbap

Sure, just need to check get_input_embeddings, as in most cases it's not needed to add it and we prefer to add as little code as possible. It's not needed in Dinov2, why here?
Otherwise LGTM and ok to merge once this is sorted out!

dimidagd · 2025-10-07T11:44:41Z

@molbap what is the process for merging PRs after they have been approved? Trying to understand better the contribution workflow

Best
DD

molbap

I have a pending question on which test was failing for the embeddings, please let me know! apart from that for the review process when reviewers have approved we ping the core maintainers to approve and merge your PR!

This one in particular launched an internal discussion since we're trying to minimize the maintenance surface, see here #41450 but it'll wait a follow-up PR.

src/transformers/models/dinov3_vit/modeling_dinov3_vit.py

molbap

Added a missed head_mask, other tham that ping me when you know for the failing test and we can merge this! With #41276 it's nice additions to dinov3

src/transformers/models/dinov3_vit/modular_dinov3_vit.py

dimidagd · 2025-10-16T19:42:41Z

Added a missed head_mask, other tham that ping me when you know for the failing test and we can merge this! With #41276 it's nice additions to dinov3

Here is the failing test

https://app.circleci.com/jobs/github/huggingface/transformers/1982122

dimidagd · 2025-10-16T20:03:49Z

Added a missed head_mask, other tham that ping me when you know for the failing test and we can merge this! With #41276 it's nice additions to dinov3

Here is the failing test

https://app.circleci.com/jobs/github/huggingface/transformers/1982122

on a second thought, I removed the associated test in 1fb6000

Looking forward to your feedback

dimidagd · 2025-10-27T14:46:00Z

Added a missed head_mask, other tham that ping me when you know for the failing test and we can merge this! With #41276 it's nice additions to dinov3

@molbap upon rereading your comments, I think you implied that adding the get_input_embeddings should not be necessary.

Here is the failing test, which was copied over from dinov2.

https://app.circleci.com/jobs/github/huggingface/transformers/1993707

molbap · 2025-10-31T15:03:11Z

Hi @dimidagd , circling back to this, this is to support the classification head released by Meta for dinov3, right? In that case, it would also be needed to have a small test to load that head and test it works to avoid future regressions. I'm referring to this https://github.com/facebookresearch/dinov3?tab=readme-ov-file#pretrained-heads---image-classification

That way there'd be more ground to support the implementation of a new head!

dimidagd · 2025-10-31T15:07:51Z

@molbap I will see if I can find the time for this, what are your thoughts on the `get_input_embeddings` test?

…

On Fri, 31 Oct 2025 at 16:03, Pablo Montalvo ***@***.***> wrote: *molbap* left a comment (huggingface/transformers#41224) <#41224 (comment)> Hi @dimidagd <https://github.com/dimidagd> , circling back to this, this is to support the classification head released by Meta for dinov3, right? In that case, it would also be needed to have a small test to load that head and test it works to avoid future regressions. I'm referring to this https://github.com/facebookresearch/dinov3?tab=readme-ov-file#pretrained-heads---image-classification That way there'd be more ground to support the implementation of a new head! — Reply to this email directly, view it on GitHub <#41224 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ALECAUJ5KWWT4LDURHPDCB332N24LAVCNFSM6AAAAACH4V2IVCVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZTINZTGQ4DGOJWGQ> . You are receiving this because you were mentioned.Message ID: ***@***.***>

-- [image: photo] *Dagdilelis Dimitris | PhD* m:+45 52 60 18 25 <+45+52+60+18+25> | ***@***.*** | a:Gullandsgade 18, 2800 Lyngby <http://us.linkedin.com/in/dimidagd>

merveenoyan · 2025-10-31T16:05:02Z

hey @dimidagd thanks a lot for working on this!

Meta has three checkpoints for image classification (with linear layer), object detection (DINO3+DETR) and depth estimation (DINO3+DPT). Would you like to load image classification weights and push to Hub for people's convenience?

We have DETR and DPT officially supported in transformers. Would you be down to convert the depth and detector checkpoints too?

dimidagd · 2025-11-06T15:14:30Z

hey @dimidagd thanks a lot for working on this!

Meta has three checkpoints for image classification (with linear layer), object detection (DINO3+DETR) and depth estimation (DINO3+DPT). Would you like to load image classification weights and push to Hub for people's convenience?

We have DETR and DPT officially supported in transformers. Would you be down to convert the depth and detector checkpoints too?

Hi @merveenoyan! My dev machine is on codespaces, with limited memory, therefore I can't even load their checkpoints. Do you have access to resources I could use?

merveenoyan · 2025-11-07T14:04:06Z

@dimidagd can you use a Colab? 👀 using small checkpoints to validate implementation is ok!

dimidagd · 2025-11-07T15:39:41Z

@dimidagd can you use a Colab? 👀 using small checkpoints to validate implementation is ok!

@merveenoyan hi again, meta released the classification head only for the 7B model. I can't fit that on colab resources either (GPU or CPU). I can't imagine myself implementing such tests from a colab env tbh.

molbap · 2025-11-07T17:10:30Z

@dimidagd Just in case, we have a run-slow CI which we can run, which runs tests with a @slow marker and fits a 7B. If you write a corresponding ...IntegrationTest, we can run it on our large github runner and test out the result, just ping me and I'll run it

- Implements DINOv3ViTForImageClassification class - Implements unit tests - Updates docs

Co-authored-by: Pablo Montalvo <[email protected]>

This reverts commit 416d0b2.

github-actions · 2025-11-14T23:31:19Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto, dinov3_vit

dimidagd · 2025-11-14T23:46:47Z

@molbap I uploaded the 7b weights with the linear classifier adapter on HF, and wrote a simple test on the cat COCO sample in the repo. Perhaps you could run the slow tests?

dimidagd force-pushed the feat/dinov3forimageclassification branch 6 times, most recently from ebb610b to 0462fb0 Compare September 30, 2025 13:53

molbap reviewed Sep 30, 2025

View reviewed changes

src/transformers/models/dinov3_vit/modular_dinov3_vit.py Show resolved Hide resolved

tests/models/dinov3_vit/test_modeling_dinov3_vit.py Show resolved Hide resolved

molbap reviewed Sep 30, 2025

View reviewed changes

src/transformers/models/dinov3_vit/modeling_dinov3_vit.py Show resolved Hide resolved

molbap approved these changes Oct 1, 2025

View reviewed changes

molbap reviewed Oct 8, 2025

View reviewed changes

src/transformers/models/dinov3_vit/modeling_dinov3_vit.py Show resolved Hide resolved

molbap reviewed Oct 9, 2025

View reviewed changes

src/transformers/models/dinov3_vit/modular_dinov3_vit.py Outdated Show resolved Hide resolved

dimidagd force-pushed the feat/dinov3forimageclassification branch from fce9f88 to 548597e Compare October 16, 2025 16:03

dimidagd force-pushed the feat/dinov3forimageclassification branch from 1fb6000 to 416d0b2 Compare October 16, 2025 20:05

dimidagd and others added 5 commits November 13, 2025 12:46

Add support for dinov3 with classificaiton head

b2a0366

- Implements DINOv3ViTForImageClassification class - Implements unit tests - Updates docs

Update src/transformers/models/dinov3_vit/modeling_dinov3_vit.py

098d55b

Co-authored-by: Pablo Montalvo <[email protected]>

Update src/transformers/models/dinov3_vit/modular_dinov3_vit.py

6142b9d

Co-authored-by: Pablo Montalvo <[email protected]>

modeling

81eab1a

rm method

e2ed96d

dimidagd added 2 commits November 13, 2025 12:46

remove get_embeddings test

26f5561

Revert "remove get_embeddings test"

f42a62e

This reverts commit 416d0b2.

dimidagd force-pushed the feat/dinov3forimageclassification branch from e204914 to f42a62e Compare November 13, 2025 12:46

Merge branch 'huggingface:main' into feat/dinov3forimageclassification

8542a67

dimidagd force-pushed the feat/dinov3forimageclassification branch from 92840b4 to 7ce4837 Compare November 13, 2025 17:12

setup env

41dfcbe

dimidagd force-pushed the feat/dinov3forimageclassification branch 3 times, most recently from 63639c5 to 2c0786b Compare November 14, 2025 13:49

Merge branch 'huggingface:main' into feat/dinov3forimageclassification

d6980cf

dimidagd force-pushed the feat/dinov3forimageclassification branch from 2c0786b to d6980cf Compare November 14, 2025 14:29

Merge branch 'huggingface:main' into feat/dinov3forimageclassification

ad9b1bc

dimidagd force-pushed the feat/dinov3forimageclassification branch from 365715e to ad9b1bc Compare November 14, 2025 20:22

updates

d6d670f

update tests

174d7c5

dimidagd force-pushed the feat/dinov3forimageclassification branch from 2a404d2 to 174d7c5 Compare November 14, 2025 23:39

Merge branch 'main' into feat/dinov3forimageclassification

c6df85a

Add DINOv3ViTForImageClassification support #41224

Are you sure you want to change the base?

Add DINOv3ViTForImageClassification support #41224

Conversation

dimidagd commented Sep 30, 2025

Uh oh!

molbap left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dimidagd commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

molbap left a comment

Choose a reason for hiding this comment

Uh oh!

dimidagd commented Oct 7, 2025

Uh oh!

molbap left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

molbap left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dimidagd commented Oct 16, 2025

Uh oh!

dimidagd commented Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dimidagd commented Oct 27, 2025

Uh oh!

molbap commented Oct 31, 2025

Uh oh!

dimidagd commented Oct 31, 2025 via email • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

merveenoyan commented Oct 31, 2025

Uh oh!

dimidagd commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

merveenoyan commented Nov 7, 2025

Uh oh!

dimidagd commented Nov 7, 2025

Uh oh!

molbap commented Nov 7, 2025

Uh oh!

github-actions bot commented Nov 14, 2025

Uh oh!

dimidagd commented Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dimidagd commented Oct 1, 2025 •

edited

Loading

dimidagd commented Oct 16, 2025 •

edited

Loading

dimidagd commented Oct 31, 2025 via email •

edited

Loading

dimidagd commented Nov 6, 2025 •

edited

Loading