Running on top of MACbook 2019 (Intel i9 + AMD Radeon Pro 5500) #2119
Replies: 5 comments 7 replies
-
The assert suggests that you may be using a type of quantization that is not supported yet in metal: Lines 757 to 766 in 31cfbb1 |
Beta Was this translation helpful? Give feedback.
-
I'm pretty sure that your Intel Mac with AMD GPU does not support Metal. The specs list Listed APIs are DirectX, OpenGL OpenCL and Vulcan I'd suggest you try a CLBlast build... which uses OpenCL ie Good news is that that the Vicuna model you have should work with CLBlast. |
Beta Was this translation helpful? Give feedback.
-
Hi i run But when i run
I think this happens because in my system, like on old Macs, RAM and video memory are separated and 8GB video memory for metal is not enough to run a model 13B model. New Macs has unified memory and on Mac with 16GB Ram you can run 13B model with metal. |
Beta Was this translation helpful? Give feedback.
-
My MacBook Pro 2018 can run metal. Apple metal doc says:
|
Beta Was this translation helpful? Give feedback.
-
Dead thread revive but similar specs to you except a W5700x, examples run but ggml_metal_init seems to label all floating point operations as not supported and it falls back to the CPU you can try the vulkan build with: I can confirm this does use the GPU but example models output is bugged, and I tried with Mistral-7B-Instruct-v0.3-Q4_K_M.gguf which outputs spanish. See issue: #10984 Browsing through numerous threads and issues it does not seem like anyone has gotten it to work with mac/intel/amd combo and actually utilising the gpu? Could be mistaken. |
Beta Was this translation helpful? Give feedback.
-
Situation: It's running on CPU; when trying to run on GPU (after compiling correctly with Metal + OpenBlas), i'm reaching a point of break and the model does not run.
My machine:
Do someone have a clue of what I'm missing?
Beta Was this translation helpful? Give feedback.
All reactions