Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

SYCL: add full support for ABS unary op documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17126 opened Nov 9, 2025 by shani-f Loading…
llama: introduce support for model-embedded sampling parameters python python script changes
#17120 opened Nov 9, 2025 by taronaeo Loading…
rpc : fix alloc size logic Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#17116 opened Nov 9, 2025 by ggerganov Draft
2 tasks
Refactor: convert_hf_to_gguf.py python python script changes refactoring Refactoring
#17114 opened Nov 9, 2025 by pwilkin Draft
CPU SIMD and pipeline optimizations across vec/mmq/ops/kv-cache/repack ggml changes relating to the ggml tensor library for machine learning
#17113 opened Nov 8, 2025 by NoahOksuz Loading…
batched-bench : add "separate text gen" mode examples
#17103 opened Nov 8, 2025 by ggerganov Loading…
CUDA: support F32 kernel type for CONV_TRANSPOSE_2D ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#17094 opened Nov 8, 2025 by AgainstEntropy Loading…
add version to all shared object files Apple Metal https://en.wikipedia.org/wiki/Metal_(API) Ascend NPU issues specific to Ascend NPUs examples ggml changes relating to the ggml tensor library for machine learning IBM zDNN issues specific to IBM zDNN Accelerator Nvidia GPU Issues specific to Nvidia GPUs OpenCL Issues specific to the OpenCL backend SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language Vulkan Issues specific to the Vulkan backend
#17091 opened Nov 7, 2025 by furrysalamander Loading…
opencl: add fastdiv and use it in set_rows, ported from cuda ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#17090 opened Nov 7, 2025 by lhez Draft
metal : enable tensor API for A19 Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#17087 opened Nov 7, 2025 by ggerganov Loading…
convert: (demo) repacking compressed_tensor format of kimi-k2 python python script changes
#17083 opened Nov 7, 2025 by ngxson Draft
HIP: RDNA4 tensor core support for MMF ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17077 opened Nov 7, 2025 by zhang-hui-yulo Loading…
[RFC] ggml: new backend for API Remoting Apple Metal https://en.wikipedia.org/wiki/Metal_(API) build Compilation issues ggml changes relating to the ggml tensor library for machine learning
#17072 opened Nov 7, 2025 by kpouget Loading…
Fix NetBSD compilation error
#17068 opened Nov 7, 2025 by xinitrcn1 Loading…
Add ops needed for new hybrid models: SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#17063 opened Nov 6, 2025 by pwilkin Loading…
cmake: add option to build and link BoringSSL build Compilation issues
#17062 opened Nov 6, 2025 by angt Loading…
cuda: extended MMF_ROWS_PER_BLOCK ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17051 opened Nov 6, 2025 by zhang-hui-yulo Loading…
Add MoE dynamic routing with expert caching build Compilation issues documentation Improvements or additions to documentation examples
#17044 opened Nov 6, 2025 by jmangold23 Draft
ggml-hexagon: fix test-backend-ops failures on specific binary ops ggml changes relating to the ggml tensor library for machine learning
#17042 opened Nov 6, 2025 by chraac Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.