Georgi Gerganov
|
254098a279
common : refactor common_sampler + grammar logic changes (#17937)
|
vor 1 Monat |
compilade
|
19f68fa5a4
imatrix : warn when GGUF imatrix is saved without .gguf suffix (#15076)
|
vor 5 Monaten |
compilade
|
d31192b4ee
imatrix : use GGUF by default (#14842)
|
vor 5 Monaten |
compilade
|
0a2f5496be
imatrix : fix 3d activation handling for hybrid and recurrent models (#14994)
|
vor 5 Monaten |
Ed Addario
|
d1aa0cc5d1
imatrix: add option to display importance score statistics for a given imatrix file (#12718)
|
vor 5 Monaten |
compilade
|
90083283ec
imatrix : use GGUF to store importance matrices (#9400)
|
vor 6 Monaten |
Georgi Gerganov
|
745aa5319b
llama : deprecate llama_kv_self_ API (#14030)
|
vor 7 Monaten |
Bartowski
|
efb8b47eda
imatrix : Add --parse-special for enabling parsing of special tokens in imatrix calculation (#13389)
|
vor 8 Monaten |
Georgi Gerganov
|
51fb96b1ff
context : remove logits_all flag (#13284)
|
vor 8 Monaten |
Johannes Gäßler
|
3e959f0976
imatrix: fix oob writes if src1 is not contiguous (#13286)
|
vor 8 Monaten |
Diego Devesa
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
vor 8 Monaten |