Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

ci : fix rocm archive name devops improvements to build systems and github actions
#19808 opened Feb 22, 2026 by CISC Loading…
server: add --log-output option examples server
#19807 opened Feb 22, 2026 by tarruda Loading…
CUDA: add CDNA3 MFMA support for flash attention MMA kernel ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#19806 opened Feb 22, 2026 by Jayluci4 Loading…
3 of 4 tasks
model: Add Kanana-2 model support python python script changes
#19803 opened Feb 22, 2026 by HelloKS Loading…
llama: end-to-end tests testing Everything test related
#19802 opened Feb 22, 2026 by JohannesGaessler Draft
common : add more aliases for sampler CLI params
#19797 opened Feb 22, 2026 by ddh0 Loading…
vulkan: fix coopmat1 without bf16 support ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#19793 opened Feb 22, 2026 by jeffbolznv Loading…
tools : add learning-cache tool for persistent latent context examples
#19791 opened Feb 22, 2026 by arkavo-com Loading…
7 of 10 tasks
vulkan: fix data race in mul_mat_id shader ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#19790 opened Feb 21, 2026 by jeffbolznv Loading…
jinja: correct stats for tojson and string filters jinja parser Issues related to the jinja parser testing Everything test related
#19785 opened Feb 21, 2026 by ngxson Loading…
cli : provide model with text filename examples server
#19783 opened Feb 21, 2026 by CISC Loading…
[WIP] ggml-hexagon: convert f32 to f16 - fa opt part4 ggml changes relating to the ggml tensor library for machine learning
#19780 opened Feb 21, 2026 by chraac Draft
Clean up per-thread parameter buffer pool and job submission logic ggml changes relating to the ggml tensor library for machine learning
#19772 opened Feb 20, 2026 by nikhilJain17 Draft
WIP: ggml : add NVFP4 quantization type support Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes testing Everything test related Vulkan Issues specific to the Vulkan backend
#19769 opened Feb 20, 2026 by richarddd Loading…
vulkan: check for memory overlap before doing fusion ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#19768 opened Feb 20, 2026 by jeffbolznv Loading…
Proposal: Guidelines for quantization schemes
#19762 opened Feb 20, 2026 by pwilkin Loading…
grammar : Fix grammar root symbol check bugfix fixes an issue or bug grammar Issues related to the GBNF (grammar) code
#19761 opened Feb 20, 2026 by AsbjornOlling Loading…
ProTip! Add no:assignee to see everything that’s not assigned.