-
Notifications
You must be signed in to change notification settings - Fork 15k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ci : fix rocm archive name
devops
improvements to build systems and github actions
#19808
opened Feb 22, 2026 by
CISC
Loading…
CUDA: add CDNA3 MFMA support for flash attention MMA kernel
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#19806
opened Feb 22, 2026 by
Jayluci4
Loading…
3 of 4 tasks
common : fix improper trimming in XML parser on complete message
#19805
opened Feb 22, 2026 by
aldehir
Loading…
model: Add Kanana-2 model support
python
python script changes
#19803
opened Feb 22, 2026 by
HelloKS
Loading…
llama: end-to-end tests
testing
Everything test related
#19802
opened Feb 22, 2026 by
JohannesGaessler
•
Draft
model: add qwen3omnimoe architecture support (text-only)
#19800
opened Feb 22, 2026 by
SiaoZeng
Loading…
chat: fix llama-server image placeholder issue for PaddleOCR-VL
#19799
opened Feb 22, 2026 by
megemini
Loading…
Add model metadata loading from huggingface for use with tests requiring real model data
testing
Everything test related
#19796
opened Feb 22, 2026 by
bartowski1182
Loading…
vulkan: fix coopmat1 without bf16 support
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#19793
opened Feb 22, 2026 by
jeffbolznv
Loading…
tools : add learning-cache tool for persistent latent context
examples
#19791
opened Feb 22, 2026 by
arkavo-com
Loading…
7 of 10 tasks
vulkan: fix data race in mul_mat_id shader
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#19790
opened Feb 21, 2026 by
jeffbolznv
Loading…
jinja: correct stats for tojson and string filters
jinja parser
Issues related to the jinja parser
testing
Everything test related
#19785
opened Feb 21, 2026 by
ngxson
Loading…
[WIP] ggml-hexagon: convert f32 to f16 - fa opt part4
ggml
changes relating to the ggml tensor library for machine learning
Clean up per-thread parameter buffer pool and job submission logic
ggml
changes relating to the ggml tensor library for machine learning
#19772
opened Feb 20, 2026 by
nikhilJain17
•
Draft
Document custom default webui preferences in server readme
examples
server
#19771
opened Feb 20, 2026 by
woof-dog
Loading…
WIP: ggml : add NVFP4 quantization type support
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#19769
opened Feb 20, 2026 by
richarddd
Loading…
vulkan: check for memory overlap before doing fusion
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#19768
opened Feb 20, 2026 by
jeffbolznv
Loading…
Server: allow immediate reuse of http port
examples
server
#19763
opened Feb 20, 2026 by
jpm-canonical
Loading…
grammar : Fix grammar root symbol check
bugfix
fixes an issue or bug
grammar
Issues related to the GBNF (grammar) code
#19761
opened Feb 20, 2026 by
AsbjornOlling
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.