Support `.to(device)` or Device Aware Handling for Segmentation Labels in `EOMTImageProcessor`

### Feature request

Enable device-aware handling of *segmentation label tensors* (specifically `class_labels` and `mask_labels`) returned by `EOMTImageProcessor`.

Currently, the processor outputs *lists of tensors*, which cannot be moved to a device using `.to(device)`, leading to device mismatches during training.

### Motivation

I’ve been fine-tuning *EOMT* on the `segments/sidewalk-semantic` dataset using the Transformers image processor.
Here is [the code](https://gist.github.com/ariG23498/f4eb5ce6fe57f474225f01741e035c24) reproducing the issue.


### Your contribution

Happy to help brainstorm and work on this if need be.

CC: @merveenoyan @NielsRogge @molbap 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support `.to(device)` or Device Aware Handling for Segmentation Labels in `EOMTImageProcessor` #42205

Feature request

Motivation

Your contribution

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Support .to(device) or Device Aware Handling for Segmentation Labels in EOMTImageProcessor #42205

Description

Feature request

Motivation

Your contribution

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Support `.to(device)` or Device Aware Handling for Segmentation Labels in `EOMTImageProcessor` #42205