Georgi Gerganov
|
db3010bd23
vulkan : fix compile warnings on macos (#15340)
|
5 months ago |
Aaron Teo
|
ff27f80a74
ggml: initial IBM zDNN backend (#14975)
|
5 months ago |
Sigbjørn Skjæret
|
d3248d9b65
ci : fix ios-xcode-build (#15324)
|
5 months ago |
Diego Devesa
|
7aeee88cfe
ci : move ccache action to ggml-org fork (#15328)
|
5 months ago |
Johannes Gäßler
|
b07791aa1d
test-opt: fix backend support check (#15317)
|
5 months ago |
Johannes Gäßler
|
4227c9be42
CUDA: fix negative KV_max values in FA (#15321)
|
5 months ago |
Georgi Gerganov
|
df36bce667
eval-callback : stop on first NaN (#15320)
|
5 months ago |
Diego Devesa
|
f75b830647
chat : include kwargs in template example (#15309)
|
5 months ago |
Daniel Bevenius
|
7a0de96045
llama : add 18-layer model type for Gemma 3-270m (#15319)
|
5 months ago |
simevo
|
e4e915912c
devops : fix compile bug when the BASE_CUDA_DEV_CONTAINER is based on Ubuntu 24.04 (#15005)
|
5 months ago |
uvos
|
5ba36f6103
HIP: Cleanup hipification header (#15285)
|
5 months ago |
Aldehir Rojas
|
b204a5a234
gpt-oss: implement harmony parsing (#15181)
|
5 months ago |
Christian Kastner
|
646944cfa8
docker : Enable GGML_CPU_ALL_VARIANTS for ARM (#15267)
|
5 months ago |
Georgi Gerganov
|
1a01899b61
readme : update hot topics (#15315)
|
5 months ago |
Jeff Bolz
|
863d341eeb
vulkan: perf_logger improvements (#15246)
|
5 months ago |
Georgi Gerganov
|
d32e03f449
server : add SWA checkpoints (#15293)
|
5 months ago |
Georgi Gerganov
|
3973163bff
sync : ggml
|
5 months ago |
Jason Ni
|
5ade3000bd
ggml: fix ggml_conv_1d_dw bug (ggml/1323)
|
5 months ago |
Georgi Gerganov
|
8b2483730f
tests : remove unused includes (ggml/0)
|
5 months ago |
kallewoof
|
810b9fc8b9
perplexity : provide a helpful hint for has_cpl case in split_equal error. (#15304)
|
5 months ago |
Sigbjørn Skjæret
|
4ebd0c125b
cuda : fix GGML_CUDA_GRAPHS=OFF (#15300)
|
5 months ago |
Jonathan Graehl
|
5cdb27e091
finetune: SGD optimizer, more CLI args (#13873)
|
5 months ago |
kallewoof
|
3ea913f1ce
perplexity: give more information about constraints on failure (#15303)
|
5 months ago |
uvos
|
29c8fbe4e0
HIP: bump requirement to rocm 6.1 (#15296)
|
5 months ago |
Bas Nijholt
|
1adc9812bd
fix(nix): remove non-functional llama-cpp cachix cache from flake.nix (#15295)
|
5 months ago |
Sigbjørn Skjæret
|
b3e16665e1
server : enable -td and -tbd parameters (#15172)
|
5 months ago |
Judd
|
c24f4e2688
ggml : update `ggml_rope_multi` (#12665)
|
5 months ago |
Copilot
|
d8914fc47e
common : add --override-tensor-draft, --cpu-moe-draft and --n-cpu-moe-draft parameters (#15191)
|
5 months ago |
Aldehir Rojas
|
e885445bc1
server : filter out harmony thought messages (#15278)
|
5 months ago |
Ali Tariq
|
648ebcdb73
ci : Added CI with RISC-V RVV1.0 Hardware (#14439)
|
5 months ago |