Diego Devesa
|
6c8b91500e
llama-bench : fix -ot with dl backends (#13563)
|
8 months ago |
Georgi Gerganov
|
b2838049cc
bench : handle decode errors (#13548)
|
8 months ago |
Diego Devesa
|
cf0a43bb64
llama-bench : add defrag-thold, check for invalid ranges (#13487)
|
8 months ago |
Diego Devesa
|
22cdab343b
llama-bench : accept ranges for integer parameters (#13410)
|
8 months ago |
David Huang
|
7f323a589f
Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386)
|
8 months ago |
Diego Devesa
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
8 months ago |