Commit History

Autor SHA1 Mensaxe Data
  Xuan-Son Nguyen aa3b7a90b4 arg: add --cache-list argument to list cached models (#17073) hai 2 meses
  Gadflyii 3df2244df4 llama : add --no-host to disable host buffers (#16310) hai 3 meses
  Aaron Teo 624207e676 devops: add s390x & ppc64le CI (#15925) hai 3 meses
  Douglas Hanley b5bd037832 llama : add support for qwen3 reranker (#15824) hai 3 meses
  Uilian Ries 152729f884 common : add missing chrono header for common.cpp (#16211) hai 3 meses
  Johannes Gäßler e81b8e4b7f llama: use FA + max. GPU layers by default (#15434) hai 4 meses
  Sigbjørn Skjæret 84ab83cc0b model : jina-embeddings-v3 support (#13693) hai 4 meses
  Georgi Gerganov 9ebebef62f llama : remove KV cache defragmentation logic (#15473) hai 4 meses
  Jie Fu (傅杰) 2f3dbffb17 common : fix incorrect print of non-ascii characters in the logging (#15466) hai 4 meses
  Jonathan Graehl 5cdb27e091 finetune: SGD optimizer, more CLI args (#13873) hai 5 meses
  Diego Devesa d6818d06a6 llama : allow other bufts when overriding to CPU, add --no-repack option (#14990) hai 5 meses
  compilade 90083283ec imatrix : use GGUF to store importance matrices (#9400) hai 6 meses
  Georgi Gerganov 225e7a1438 llama : add high-throughput mode (#14363) hai 6 meses
  Georgi Gerganov 6ffd4e9c44 server : pre-calculate EOG logit biases (#14721) hai 6 meses
  Ruikai Peng dd6e6d0b6a vocab : prevent tokenizer overflow (#14301) hai 7 meses
  fanyang 456af35eb7 build : suppress gcc15 compile warnings (#14261) hai 7 meses
  Diego Devesa 6adc3c3ebc llama : add thread safety test (#14035) hai 7 meses
  Georgi Gerganov d3e64b9f49 llama : rework embeddings logic (#14208) hai 7 meses
  bandoti 2e89f76b7a common: fix issue with regex_escape routine on windows (#14133) hai 7 meses
  Georgi Gerganov 745aa5319b llama : deprecate llama_kv_self_ API (#14030) hai 7 meses
  Max Krasnyansky 053b1539c0 threading: support for GGML_SCHED_PRIO_LOW, update thread info on Windows to avoid throttling (#12995) hai 7 meses
  Đinh Trọng Huy e0e3aa231d llama : add support for BertForSequenceClassification reranker (#13858) hai 7 meses
  Percy Piper c508256db2 rpc : Fix build on OpenBSD (#13541) hai 7 meses
  Georgi Gerganov a4090d1174 llama : remove llama_kv_cache_view API + remove deprecated (#13653) hai 8 meses
  Georgi Gerganov e298d2fbd0 kv-cache : add SWA support (#13194) hai 8 meses
  psocolovsky 1dfbf2cf3a common : add load_progress_callback (#13617) hai 8 meses
  Olivier Chafik 3198405e98 `common`: add partial regex support (#12808) hai 8 meses
  Johannes Gäßler 10d2af0eaa llama/ggml: add LLM training support (#10544) hai 8 meses
  David Huang 7f323a589f Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386) hai 8 meses
  Georgi Gerganov 51fb96b1ff context : remove logits_all flag (#13284) hai 8 meses