-
Notifications
You must be signed in to change notification settings - Fork 17.9k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
common : do not wrap raw strings in schema parser for tagged parsers
#22827
opened May 8, 2026 by
aldehir
Contributor
Loading…
server: preserve context checkpoint coverage
examples
server
#22826
opened May 8, 2026 by
jacekpoplawski
Contributor
Loading…
Feature hexagon tri
ggml
changes relating to the ggml tensor library for machine learning
Hexagon
#22822
opened May 7, 2026 by
pdhinaka
Loading…
HIP: Adds 4x packed Q8_1 activation for Q4_K_M models in MMVQ
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#22821
opened May 7, 2026 by
jiachengjason
Contributor
Loading…
CUDA: lower-case PCI bus id, standardize for ggml
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#22820
opened May 7, 2026 by
JohannesGaessler
Contributor
Loading…
Add flash attention MMA / Tiles to support MiMo-V2.5
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
testing
Everything test related
#22812
opened May 7, 2026 by
AesSedai
Contributor
Loading…
ggml-virtgpu: include missing mutex header
ggml
changes relating to the ggml tensor library for machine learning
#22810
opened May 7, 2026 by
olliewalsh
Contributor
Loading…
ggml-webgpu: address precision issues for multimodel
ggml
changes relating to the ggml tensor library for machine learning
WebGPU
#22808
opened May 7, 2026 by
Constannnnnt
Contributor
Loading…
Gemma4_26B_A4B_NvFp4 hf checkpoint convert to gguf format fixes
model
Model specific
python
python script changes
#22804
opened May 7, 2026 by
ynankani
Contributor
Loading…
webui: Add Import/Export of Settings configuration + improve architecture
examples
server/webui
server
#22803
opened May 7, 2026 by
allozaur
Contributor
Loading…
Add new config file options for saving and loading configuration for llama tools in INI format
#22802
opened May 7, 2026 by
bartowski1182
Contributor
•
Draft
webui: page title use app name variable
examples
server/webui
server
#22801
opened May 7, 2026 by
jpm-canonical
Loading…
mtmd-cli: load GPU backends before arg parsing to fix false 'no GPU' warning
examples
#22790
opened May 7, 2026 by
saga08003137
Loading…
ggml: use dynamic allocation for split graph inputs
ggml
changes relating to the ggml tensor library for machine learning
#22789
opened May 7, 2026 by
AgoraPete
Loading…
spec : refactor ctx
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
server
#22787
opened May 7, 2026 by
ggerganov
Member
Loading…
1 of 2 tasks
convert : add python script changes
--fuse-qkv flag to fuse Q/K/V into QKV during HF-to-GGUF conversion
python
ggml-sycl : use malloc_shared for UMA/integrated GPU devices
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#22766
opened May 6, 2026 by
vmartirosyan
Loading…
android: extract GgufMetadataReader factory to break cyclic dependency
android
Issues specific to Android
examples
#22763
opened May 6, 2026 by
Juste-Leo2
Contributor
Loading…
server: fix /infill prompt placement after FIM_MID
examples
server
#22761
opened May 6, 2026 by
Aayush7g
Loading…
[ggml] fix vulkan spv shadowing
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#22760
opened May 6, 2026 by
miyanyan
Loading…
ggml-opencl: add opt-in Adreno xmem F16xF32 GEMM for prefill
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#22755
opened May 6, 2026 by
happyyzy
Loading…
ggml-cpu: extend RVV quantization vec dot to higher VLENs
ggml
changes relating to the ggml tensor library for machine learning
#22754
opened May 6, 2026 by
rehan-10xengineer
Contributor
Loading…
Add more wav-compatiable MIME types and enhance MIME type normalization
examples
server/webui
server
#22744
opened May 6, 2026 by
guangchenli
•
Draft
Previous Next
ProTip!
no:milestone will show everything without a milestone.