-
Notifications
You must be signed in to change notification settings - Fork 11.7k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
metal : use FA-vec kernel up to batch size 20
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#13496
opened May 13, 2025 by
ggerganov
Loading…
metal : optimize multi-sequence FA vec kernel
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#13493
opened May 13, 2025 by
ggerganov
Loading…
llama: Add configuration presets for chat and reranking servers
#13462
opened May 12, 2025 by
heyyymonth
Loading…
mtmd : remove libllava, remove clip-quantize-cli (⚠️ breaking change)
breaking change
Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility.
examples
python
python script changes
#13460
opened May 11, 2025 by
ngxson
Loading…
scripts : support arbitrary input file formats in compare-llama-bench.py
python
python script changes
script
Script related
#13455
opened May 11, 2025 by
CISC
Loading…
CUDA: faster Deepseek FA, add Turing support
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13435
opened May 10, 2025 by
JohannesGaessler
Loading…
Break down main function in llama-server
examples
server
#13425
opened May 10, 2025 by
ericcurtin
Loading…
Update README.md for using llama.cpp in Microsoft Word locally
#13401
opened May 9, 2025 by
GPTLocalhost
Loading…
grammar: handle misplaced special regex chars [*+?]
#13391
opened May 8, 2025 by
rick-github
Loading…
sycl: simplify bin_bcast_kernel
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13383
opened May 8, 2025 by
AD2605
Loading…
musa: restore MUSA graph settings in CMakeLists.txt
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13382
opened May 8, 2025 by
yeahdongcn
•
Draft
gguf-py: Optimize python script changes
GGUFReader
read-only mode performance
python
#13378
opened May 8, 2025 by
Isotr0py
Loading…
CUDA: update build CTK version to 12.8
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13360
opened May 7, 2025 by
thevishalagarwal
Loading…
python : bump transformers version
python
python script changes
#13351
opened May 7, 2025 by
ngxson
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.