-
Notifications
You must be signed in to change notification settings - Fork 12k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
llama : add thread safety test
devops
improvements to build systems and github actions
testing
Everything test related
#14035
opened Jun 5, 2025 by
slaren
Loading…
sycl: Adding additional cpy dbg print output
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14034
opened Jun 5, 2025 by
ShanoToni
Loading…
cuda : fix device sync on buffer clear
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#14033
opened Jun 5, 2025 by
slaren
Loading…
cpu: Update RISC-V condition to require GCC version 14 or higher
ggml
changes relating to the ggml tensor library for machine learning
#14032
opened Jun 5, 2025 by
Ghosts381937
Loading…
llama : support qwen3 rerank and embeddings
python
python script changes
#14029
opened Jun 5, 2025 by
ngxson
Loading…
ggml-cpu: fix uncaught underscore terminators for s390x
ggml
changes relating to the ggml tensor library for machine learning
#14023
opened Jun 5, 2025 by
taronaeo
Loading…
tests : add test-tokenizers-repo
testing
Everything test related
#14017
opened Jun 4, 2025 by
CISC
Loading…
server: Enable mtmd in llama-server
/completion
endpoint
examples
server
#14016
opened Jun 4, 2025 by
92MING
Loading…
llama: Attempt to add ModernBert
python
python script changes
#14014
opened Jun 4, 2025 by
huydt84
Loading…
opencl: preliminary support for Q4_0 mul_mat_id using matvec
ggml
changes relating to the ggml tensor library for machine learning
#14003
opened Jun 4, 2025 by
lhez
Loading…
[CANN]:Replace aclrtMemsetSync with InplaceZero operator for zero tensor creation
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#14002
opened Jun 4, 2025 by
luyhcsu
Loading…
chore(server): split context-server to its own file
examples
server
#13987
opened Jun 3, 2025 by
mudler
Loading…
llama : allow building all tests on windows when not using shared libs
devops
improvements to build systems and github actions
testing
Everything test related
#13980
opened Jun 2, 2025 by
slaren
Loading…
sycl: GGML_SYCL_DISABLE_OPT on by default for all Intel Devices
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13973
opened Jun 2, 2025 by
ShanoToni
Loading…
ci: add LoongArch cross-compile build
devops
improvements to build systems and github actions
#13944
opened May 31, 2025 by
wojiushixiaobai
Loading…
llama : support multiple classifier outputs and labels
examples
#13940
opened May 31, 2025 by
CISC
Loading…
chat
: improve llama 3.x handling of <|python_tag|> (+ allow --special combo)
testing
[CANN]Support Acl Graph
ggml
changes relating to the ggml tensor library for machine learning
#13915
opened May 30, 2025 by
noemotiovon
•
Draft
[Ascend NPU] Enable labeler
devops
improvements to build systems and github actions
#13914
opened May 30, 2025 by
shink
Loading…
remove WIP since PR has been merged
documentation
Improvements or additions to documentation
#13912
opened May 30, 2025 by
pepijndevos
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.