Releases · ggml-org/llama.cpp

04 Jul 06:37

c79184d

b5825

batch : add n_used count (#14512)

ggml-ci

Assets 15

04 Jul 04:04

github-actions

b5824

499a8f5

b5824

CANN: Replace aclrtMemsetSync with aclnnInplaceZero operator (#14002)

Co-authored-by: luyuhong <[email protected]>

Assets 15

03 Jul 22:14

github-actions

b5823

28657a8

b5823

ggml : implement GEGLU_ERF and GEGLU_QUICK ops (#14445)

Assets 15

03 Jul 19:24

github-actions

b5822

bee2842

b5822

opencl : broadcast for soft_max (#14510)

Assets 15

03 Jul 18:57

github-actions

b5821

2b72bed

b5821

vulkan: support mixed/deepseekR1 FA head sizes (#14509)

* vulkan: better parameterize FA by head sizes

* vulkan: support mixed/deepseekR1 FA head sizes

Assets 15

03 Jul 15:39

github-actions

b5820

c8c4495

b5820

ggml: backward pass for split swiglu (#14483)

Assets 15

03 Jul 11:18

github-actions

b5819

7b63a71

b5819

Fix conditional enabling following arch checks for ggml-sycl (#14504)

Signed-off-by: nscipione <[email protected]>

Assets 15

03 Jul 11:17

github-actions

b5817

a70c8a0

b5817

kv-cache : use ggml_set_rows (#14285)

* kv-cache : use ggml_set_rows

ggml-ci

* graph : separate k and v indices

ggml-ci

* cont : remove redundant ifs

ggml-ci

* kv-cache : improve find_slot impl

* kv-cache : bounds-check when accessing slot_info indices

* kv-cache : add comments

ggml-ci

* ggml : add TODOs for adding GGML_OP_SET_ROWS support in the backends

ggml-ci

Assets 15

03 Jul 11:13

github-actions

b5816

9067487

b5816

ggml : fix FA mask dim 2 and 3 (#14505)

* ggml : fix FA mask dim 2 and 3

ggml-ci

* backends : unsupport batched FA in CUDA and Vulkan

ggml-ci

* vulkan : disable FA for mask->ne[2] != 1

Assets 15

03 Jul 05:05

github-actions

b5815

d4cdd9c

b5815

ggml : remove kompute backend (#14501)

ggml-ci

Assets 15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ggml-org/llama.cpp

b5825

Uh oh!

b5824

Uh oh!

b5823

Uh oh!

b5822

Uh oh!

b5821

Uh oh!

b5820

Uh oh!

b5819

Uh oh!

b5817

Uh oh!

b5816

Uh oh!

b5815

Uh oh!