compilade
|
90083283ec
imatrix : use GGUF to store importance matrices (#9400)
|
hai 6 meses |
Georgi Gerganov
|
745aa5319b
llama : deprecate llama_kv_self_ API (#14030)
|
hai 7 meses |
Bartowski
|
efb8b47eda
imatrix : Add --parse-special for enabling parsing of special tokens in imatrix calculation (#13389)
|
hai 8 meses |
Georgi Gerganov
|
51fb96b1ff
context : remove logits_all flag (#13284)
|
hai 8 meses |
Johannes Gäßler
|
3e959f0976
imatrix: fix oob writes if src1 is not contiguous (#13286)
|
hai 8 meses |
Diego Devesa
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
hai 8 meses |