Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

dummy PR ggml changes relating to the ggml tensor library for machine learning
#14124 opened Jun 11, 2025 by Vithulep Loading…
Update multimodal.md documentation Improvements or additions to documentation
#14122 opened Jun 11, 2025 by ddpasa Loading…
Pass --keep to llama-server examples server
#14120 opened Jun 11, 2025 by MightyAlex200 Loading…
kv-cache : relax SWA masking condition
#14119 opened Jun 11, 2025 by ggerganov Loading…
llama-model : add dots.llm1 architecture support (#14044) python python script changes
#14118 opened Jun 11, 2025 by Noeda Loading…
ggml: aarch64: Implement SVE Kernels for Int 8 Quantization ggml changes relating to the ggml tensor library for machine learning
#14117 opened Jun 11, 2025 by Vithulep Loading…
vulkan: Better thread-safety for command pools/buffers ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14116 opened Jun 11, 2025 by jeffbolznv Loading…
scripts: Fix remote option in Windows (#14102) python python script changes
#14100 opened Jun 10, 2025 by pqnet Loading…
Bump ROCm versions, re-enable in GHA devops improvements to build systems and github actions
#14098 opened Jun 10, 2025 by gremlinofthemysticarts Loading…
server: add model alias presets examples python python script changes server
#14083 opened Jun 9, 2025 by am17an Loading…
Implement GGML_CPU_ALL_VARIANTS for ARM ggml changes relating to the ggml tensor library for machine learning
#14080 opened Jun 9, 2025 by ckastner Loading…
llama: automatically set runtime parameters such as --n-gpu-layers to fit VRAM ggml changes relating to the ggml tensor library for machine learning
#14067 opened Jun 8, 2025 by JohannesGaessler Draft
vulkan : fix build failure caused by vulkan-shaders-gen install ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14047 opened Jun 6, 2025 by AsbjornOlling Loading…
ggml-cpu: optimise assembly calls for hsum on s390x
#14037 opened Jun 5, 2025 by taronaeo Loading…
llama : add thread safety test devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#14035 opened Jun 5, 2025 by slaren Loading…
sycl: Adding additional cpy dbg print output ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14034 opened Jun 5, 2025 by ShanoToni Loading…
cpu: Update RISC-V condition to require GCC version 14 or higher ggml changes relating to the ggml tensor library for machine learning
#14032 opened Jun 5, 2025 by Ghosts381937 Loading…
llama : support qwen3 rerank and embeddings examples python python script changes server
#14029 opened Jun 5, 2025 by ngxson Loading…
ggml-cpu: fix uncaught underscore terminators for s390x ggml changes relating to the ggml tensor library for machine learning
#14023 opened Jun 5, 2025 by taronaeo Loading…
tests : add test-tokenizers-repo testing Everything test related
#14017 opened Jun 4, 2025 by CISC Loading…
llama: Attempt to add ModernBert python python script changes
#14014 opened Jun 4, 2025 by huydt84 Loading…
llama-chat : Do not throw when tool parsing fails
#14012 opened Jun 4, 2025 by p1-0tr Loading…
ProTip! Exclude everything labeled bug with -label:bug.