Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

common : do not wrap raw strings in schema parser for tagged parsers
#22827 opened May 8, 2026 by aldehir Contributor Loading…
Feature hexagon tri ggml changes relating to the ggml tensor library for machine learning Hexagon
#22822 opened May 7, 2026 by pdhinaka Loading…
HIP: Adds 4x packed Q8_1 activation for Q4_K_M models in MMVQ ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#22821 opened May 7, 2026 by jiachengjason Contributor Loading…
CUDA: lower-case PCI bus id, standardize for ggml ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#22820 opened May 7, 2026 by JohannesGaessler Contributor Loading…
Feature hexagon l2 norm ggml changes relating to the ggml tensor library for machine learning Hexagon
#22816 opened May 7, 2026 by pdhinaka Draft
Add flash attention MMA / Tiles to support MiMo-V2.5 ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes testing Everything test related
#22812 opened May 7, 2026 by AesSedai Contributor Loading…
ggml-virtgpu: include missing mutex header ggml changes relating to the ggml tensor library for machine learning
#22810 opened May 7, 2026 by olliewalsh Contributor Loading…
ggml-webgpu: address precision issues for multimodel ggml changes relating to the ggml tensor library for machine learning WebGPU
#22808 opened May 7, 2026 by Constannnnnt Contributor Loading…
Gemma4_26B_A4B_NvFp4 hf checkpoint convert to gguf format fixes model Model specific python python script changes
#22804 opened May 7, 2026 by ynankani Contributor Loading…
ggml: use dynamic allocation for split graph inputs ggml changes relating to the ggml tensor library for machine learning
#22789 opened May 7, 2026 by AgoraPete Loading…
spec : refactor ctx Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning python python script changes server
#22787 opened May 7, 2026 by ggerganov Member Loading…
1 of 2 tasks
ggml-sycl : use malloc_shared for UMA/integrated GPU devices ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#22766 opened May 6, 2026 by vmartirosyan Loading…
Draft: ggml-opencl: Early proof-of-concept implementation of plans via command buffers ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#22764 opened May 6, 2026 by jansol Draft
2 tasks done
android: extract GgufMetadataReader factory to break cyclic dependency android Issues specific to Android examples
#22763 opened May 6, 2026 by Juste-Leo2 Contributor Loading…
[ggml] fix vulkan spv shadowing ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#22760 opened May 6, 2026 by miyanyan Loading…
ggml-opencl: add opt-in Adreno xmem F16xF32 GEMM for prefill ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#22755 opened May 6, 2026 by happyyzy Loading…
ggml-cpu: extend RVV quantization vec dot to higher VLENs ggml changes relating to the ggml tensor library for machine learning
#22754 opened May 6, 2026 by rehan-10xengineer Contributor Loading…
ProTip! no:milestone will show everything without a milestone.