MPT 30B inference on Mac M1 #353

RonanKMcGovern · 2023-07-07T12:27:00Z

RonanKMcGovern
Jul 7, 2023

Is it realistic to try and get inference running on Mac M1 with similar results quality as on a GPU?

I find 7B and 13B models are not good enough to get working well with functions. Also, I like that MPT has extendable context compared to Falcon and llama.

If I were to try and get MPT 30B running, can I bootstrap using work from llama? Thanks

ggerganov · 2023-10-10T11:04:27Z

ggerganov
Oct 10, 2023
Maintainer

This is now supported in llama.cpp : ggml-org/llama.cpp#3417

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MPT 30B inference on Mac M1 #353

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

MPT 30B inference on Mac M1 #353

Uh oh!

RonanKMcGovern Jul 7, 2023

Replies: 1 comment

Uh oh!

ggerganov Oct 10, 2023 Maintainer

RonanKMcGovern
Jul 7, 2023

ggerganov
Oct 10, 2023
Maintainer