feat: Apple Silicon support for Chipper Model

As mentioned by @ajjimeno, the `encoder` is not available to MPS but the `decoder` is the bottleneck and can be run through a CUDA or MPS backend for GPU acceleration. This MPS backend is supported by the PyTorch framework. [Pytorch backend support docs](https://pytorch.org/docs/stable/backends.html)

It would just be to check if MPS is available, detach the `encoder` and `decoder` when detecting MPS instead of running `model.generate`, and map the computational graph of the `decoder` on the `mps` device. [HugginFace example on MPS backend](https://pytorch.org/docs/stable/notes/mps.html).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Apple Silicon support for Chipper Model #239

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

feat: Apple Silicon support for Chipper Model #239

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions