Jason Ni
|
5ade3000bd
ggml: fix ggml_conv_1d_dw bug (ggml/1323)
|
5 месяцев назад |
Georgi Gerganov
|
8b2483730f
tests : remove unused includes (ggml/0)
|
5 месяцев назад |
kallewoof
|
810b9fc8b9
perplexity : provide a helpful hint for has_cpl case in split_equal error. (#15304)
|
5 месяцев назад |
Sigbjørn Skjæret
|
4ebd0c125b
cuda : fix GGML_CUDA_GRAPHS=OFF (#15300)
|
5 месяцев назад |
Jonathan Graehl
|
5cdb27e091
finetune: SGD optimizer, more CLI args (#13873)
|
5 месяцев назад |
kallewoof
|
3ea913f1ce
perplexity: give more information about constraints on failure (#15303)
|
5 месяцев назад |
uvos
|
29c8fbe4e0
HIP: bump requirement to rocm 6.1 (#15296)
|
5 месяцев назад |
Bas Nijholt
|
1adc9812bd
fix(nix): remove non-functional llama-cpp cachix cache from flake.nix (#15295)
|
5 месяцев назад |
Sigbjørn Skjæret
|
b3e16665e1
server : enable -td and -tbd parameters (#15172)
|
5 месяцев назад |
Judd
|
c24f4e2688
ggml : update `ggml_rope_multi` (#12665)
|
5 месяцев назад |
Copilot
|
d8914fc47e
common : add --override-tensor-draft, --cpu-moe-draft and --n-cpu-moe-draft parameters (#15191)
|
5 месяцев назад |
Aldehir Rojas
|
e885445bc1
server : filter out harmony thought messages (#15278)
|
5 месяцев назад |
Ali Tariq
|
648ebcdb73
ci : Added CI with RISC-V RVV1.0 Hardware (#14439)
|
5 месяцев назад |
Sigbjørn Skjæret
|
07aa869a91
ci : add more python requirements to copilot-setup-steps (#15289)
|
5 месяцев назад |
Georgi Gerganov
|
00f35d509e
ggml : repack block_iq4_nlx8 (#14904)
|
5 месяцев назад |
Oliver Simons
|
6028bf7435
CUDA: Optimize `reduce_rows_f32` kernel, leading up to 25x perf improvement on kernel-level and 10% perf increase for Gemma3n (#15132)
|
5 месяцев назад |
Sigbjørn Skjæret
|
bc5182272c
ci : add copilot-setup-steps.yml (#15214)
|
5 месяцев назад |
Tak-RS
|
e71d48e326
ggml-rpc: chunk send()/recv() to avoid EINVAL for very large tensors over RPC (macOS & others) (#15188)
|
5 месяцев назад |
uvos
|
b0493156fa
HIP: disable sync warp shuffel operators from clr amd_warp_sync_functions.h (#15273)
|
5 месяцев назад |
Romain Biessy
|
f4586ee598
sycl: Fix and disable more configurations of mul_mat (#15151)
|
5 месяцев назад |
rmatif
|
60a7658810
opencl: allow mixed f16/f32 `add` (#15140)
|
5 месяцев назад |
Aman Gupta
|
efe3a90996
CUDA cmake: add `-lineinfo` for easier debug (#15260)
|
5 месяцев назад |
Chenguang Li
|
bbd57b7eaf
CANN: GGML_OP_CPY optimization (#15070)
|
5 месяцев назад |
R0CKSTAR
|
25ff6f7659
musa: fix failures in test-backend-ops for mul_mat_id op (#15236)
|
5 месяцев назад |
hipudding
|
be48528b06
CANN: Add broadcast for softmax and FA (#15208)
|
5 месяцев назад |
rainred
|
cf9e5648a7
mtmd : Fix MinicpmV model converter and clip to avoid using hardcode. (#14750)
|
5 месяцев назад |
Xuan-Son Nguyen
|
fba5c0d680
chat : hotfix gpt-oss jinja raising an exception (#15243)
|
5 месяцев назад |
Xuan-Son Nguyen
|
53d0a12658
server : allow specifying reasoning_format in HTTP request (#15238)
|
5 месяцев назад |
Zagaj
|
27093afe78
readme : update infra list (#15234)
|
5 месяцев назад |
Georgi Gerganov
|
228f724d9c
kv-cache : fix seq_rm with seq_id == -1 (#15226)
|
5 месяцев назад |