Cronologia Commit

Autore SHA1 Messaggio Data
  Vishal Singh 017761daf5 ggml-zendnn : add ZenDNN backend for AMD CPUs (#17690) 1 mese fa
  Georgi Gerganov 7956bb4d7f bench : cache the llama_context state at computed depth (#16944) 2 mesi fa
  Gadflyii 3df2244df4 llama : add --no-host to disable host buffers (#16310) 3 mesi fa
  Radoslav Gerganov 898acba681 rpc : add support for multiple devices (#16276) 3 mesi fa
  ssweens be79d9fdd9 llama-bench: add --devices and --list-devices support (#16039) 3 mesi fa
  jacekpoplawski 8ff206097c llama-bench: add --n-cpu-moe support (#15952) 4 mesi fa
  Diego Devesa 360d6533db ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797) 4 mesi fa
  Johannes Gäßler e81b8e4b7f llama: use FA + max. GPU layers by default (#15434) 4 mesi fa
  Georgi Gerganov 9ebebef62f llama : remove KV cache defragmentation logic (#15473) 4 mesi fa
  Juk Armstrong 476aa3fd57 Fixed name `-override-tensors` to `-override-tensor` (#15129) 5 mesi fa
  R0CKSTAR 3025b621d1 llama-bench: rename DB table name from test to llama_bench (#15003) 5 mesi fa
  Radoslav Gerganov c556418b60 llama-bench : use local GPUs along with RPC servers (#14917) 5 mesi fa
  bashayer hijji fffcce535e llama-bench : add --no-warmup flag (#14224) (#14270) 7 mesi fa
  Georgi Gerganov 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 mesi fa
  Max Krasnyansky 053b1539c0 threading: support for GGML_SCHED_PRIO_LOW, update thread info on Windows to avoid throttling (#12995) 7 mesi fa
  Georgi Gerganov e298d2fbd0 kv-cache : add SWA support (#13194) 8 mesi fa
  Diego Devesa 6c8b91500e llama-bench : fix -ot with dl backends (#13563) 8 mesi fa
  Georgi Gerganov b2838049cc bench : handle decode errors (#13548) 8 mesi fa
  Diego Devesa cf0a43bb64 llama-bench : add defrag-thold, check for invalid ranges (#13487) 8 mesi fa
  Diego Devesa 22cdab343b llama-bench : accept ranges for integer parameters (#13410) 8 mesi fa
  David Huang 7f323a589f Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386) 8 mesi fa
  Diego Devesa 1d36b3670b llama : move end-user examples to tools directory (#13249) 8 mesi fa