Commit History

Author SHA1 Message Date
  Diego Devesa 6c8b91500e llama-bench : fix -ot with dl backends (#13563) 8 months ago
  Georgi Gerganov b2838049cc bench : handle decode errors (#13548) 8 months ago
  Diego Devesa cf0a43bb64 llama-bench : add defrag-thold, check for invalid ranges (#13487) 8 months ago
  Diego Devesa 22cdab343b llama-bench : accept ranges for integer parameters (#13410) 8 months ago
  David Huang 7f323a589f Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386) 8 months ago
  Diego Devesa 1d36b3670b llama : move end-user examples to tools directory (#13249) 8 months ago