duduta
|
73460f6278
ggml-cpu: templateify ggml_compute_forward_rope_f32 and _f16 (#16805)
|
hai 2 meses |
JJJYmmm
|
d261223d24
model: add support for qwen3vl series (#16780)
|
hai 2 meses |
HimariO
|
ba1cb19cdd
llama : add Qwen2VL support + multimodal RoPE (#10361)
|
hai 1 ano |
Diego Devesa
|
9f40989351
ggml : move CPU backend to a separate file (#10144)
|
hai 1 ano |
Faisal Zaghloul
|
42c76d1358
Threadpool: take 2 (#8672)
|
hai 1 ano |
Clint Herron
|
07a3fc0608
Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (#8258)
|
hai 1 ano |
Georgi Gerganov
|
2b3389677a
ggml : refactor rope norm/neox (#7634)
|
hai 1 ano |
Georgi Gerganov
|
ec893798b7
llama : custom attention mask + parallel decoding + no context swaps (#3228)
|
%!s(int64=2) %!d(string=hai) anos |