Pierrick Hymbert dba1af6129 llama_model_loader: support multiple split/shard GGUFs (#6187) 1 rok pred
..
CMakeLists.txt d0d5de42e5 gguf-split: split and merge gguf per batch of tensors (#6135) 1 rok pred
README.md d0d5de42e5 gguf-split: split and merge gguf per batch of tensors (#6135) 1 rok pred
gguf-split.cpp dba1af6129 llama_model_loader: support multiple split/shard GGUFs (#6187) 1 rok pred

README.md

GGUF split Example

CLI to split / merge GGUF files.

Command line options:

  • --split: split GGUF to multiple GGUF, default operation.
  • --split-max-tensors: maximum tensors in each split: default(128)
  • --merge: merge multiple GGUF to a single GGUF.