You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Bitnet.cpp seems to be a great lightweight model that would benefit from the hardware acceleration benefits available from many Rockchip processors. However given the unique nature of the LLM architecture it seems to be built using a custom framework outside of ONNX, Tensorflow, etc. Is there a way to run bitnet using the available NPU hardware acceleration?