Skip to content

Pull requests: ggerganov/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

server : (web ui) Enable gzip compression for local storage demo Demonstrate some concept or idea, not intended to be merged examples server
#10945 opened Dec 22, 2024 by exxocism Loading…
2 tasks
vulkan: im2col and matmul optimizations for stable diffusion ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#10942 opened Dec 22, 2024 by jeffbolznv Loading…
server: allow filtering llama server response fields examples python python script changes server
#10940 opened Dec 21, 2024 by nvrxq Loading…
llama : the WPM vocabs use the CLS token as BOS
#10930 opened Dec 21, 2024 by ggerganov Loading…
Allow user to compile with any cuda version using github actions devops improvements to build systems and github actions
#10928 opened Dec 21, 2024 by jianlins Loading…
llamafile_sgemm API - INT8 implementation ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#10912 opened Dec 20, 2024 by amritahs-ibm Loading…
llama : refactor src/llama.cpp devops improvements to build systems and github actions examples server
#10902 opened Dec 19, 2024 by ggerganov Draft
3 tasks
llama : add support for Cohere2ForCausalLM python python script changes
#10900 opened Dec 19, 2024 by dranger003 Loading…
ASCII/Romanization for OuteTTS Multilingual Processing demo Demonstrate some concept or idea, not intended to be merged examples
#10894 opened Dec 19, 2024 by edwko Loading…
llama: Ensure KV cache is fully defragmented.
#10873 opened Dec 17, 2024 by jessegross Loading…
SYCL: Fixes for building SYCL backend for AMD GPUs documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#10851 opened Dec 16, 2024 by lhl Loading…
vulkan: multi-row k quants ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#10846 opened Dec 16, 2024 by netrunnereve Loading…
Fix compilation on Pop!_OS 22.04 LTS CUDA ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#10835 opened Dec 15, 2024 by mika314 Loading…
add ggml_backend_sched_dump_dot ggml changes relating to the ggml tensor library for machine learning
#10825 opened Dec 14, 2024 by foldl Loading…
Bamba architecture Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning python python script changes testing Everything test related
#10810 opened Dec 12, 2024 by gabe-l-hart Draft
3 tasks
server: bench: minor fixes examples performance Speed related topics python python script changes server
#10765 opened Dec 10, 2024 by phymbert Draft
Cuda build doc documentation Improvements or additions to documentation
#10743 opened Dec 10, 2024 by YannFollet Loading…
more perfo with llamafile tinyblas on x86_64. examples ggml changes relating to the ggml tensor library for machine learning python python script changes script Script related server
#10714 opened Dec 8, 2024 by Djip007 Draft
Make->CMake devops improvements to build systems and github actions
#10663 opened Dec 4, 2024 by jboero Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.