Johannes Gäßler
|
53ff6b9b9f
GGUF: C++ refactor, backend support, misc fixes (#11030)
|
1 ano atrás |
Georgi Gerganov
|
f66f582927
llama : refactor `src/llama.cpp` (#10902)
|
1 ano atrás |
Diego Devesa
|
cb13ef85a4
remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797)
|
1 ano atrás |
Zhenwei Jin
|
76b37d1541
gguf-split : improve --split and --merge logic (#9619)
|
1 ano atrás |
slaren
|
e6deac31f7
gguf-split : add basic checks (#9499)
|
1 ano atrás |
Christian Zhou-Zheng
|
c00fad71e5
gguf-split : change binary multi-byte units to decimal (#7803)
|
1 ano atrás |
Xuan Son Nguyen
|
842500144e
gguf-split: add --no-tensor-first-split (#7072)
|
1 ano atrás |
Sigbjørn Skjæret
|
8800226d65
Fix --split-max-size (#6655)
|
1 ano atrás |
Xuan Son Nguyen
|
f7fc5f6c6f
split: allow --split-max-size option (#6343)
|
1 ano atrás |
Pierrick Hymbert
|
f482bb2e49
common: llama_load_model_from_url split support (#6192)
|
1 ano atrás |
Pierrick Hymbert
|
dba1af6129
llama_model_loader: support multiple split/shard GGUFs (#6187)
|
1 ano atrás |
DAN™
|
d8b009a945
Remove undeed header file. (#6158)
|
1 ano atrás |
Pierrick Hymbert
|
d0d5de42e5
gguf-split: split and merge gguf per batch of tensors (#6135)
|
1 ano atrás |