Commit History

Autor SHA1 Mensaxe Data
  Xuan-Son Nguyen e85e9d7637 server: (router) disable SSL on child process (#18141) hai 1 mes
  Johannes Gäßler 8dcc3662a2 llama-fit-params: fix memory print (#18136) hai 1 mes
  Kim S. d37fc93505 webui: fix chat header width when sidebar is closed (#17981) hai 1 mes
  Shouyu 4470a0764a ggml-hexagon: gelu operation (#17921) hai 1 mes
  Georgi Gerganov 4301e27319 common : restore grammar-based rejection sampling (#18137) hai 1 mes
  Johannes Gäßler a2c199e479 common: clarify instructions for bug reports (#18134) hai 1 mes
  HonestQiao 15dd67d869 model: fix GLM-ASR-Nano-2512 load error (#18130) (#18142) hai 1 mes
  Xuan-Son Nguyen bde461de8c server: (router) allow child process to report status via stdout (#18110) hai 1 mes
  Piotr Wilkin (ilintar) 8faa87db02 Extend run-org-model.py, add (a) batching (b) loading prompt from file (c) multimodal capacity (#18034) hai 1 mes
  Johannes Gäßler 6f1f6a961a Github: ask for -v logs for params_fit [no ci] (#18128) hai 1 mes
  Alberto Cabrera Pérez 669696e00d ggml-cpu: ARM64: repack version of q8_0 (dotprod and i8mm) (#18096) hai 1 mes
  Tarek Dakhran 982060fadc model: fix LFM2_MOE missing tensors (#18132) hai 1 mes
  Sigbjørn Skjæret 6853bee680 ci : clean up webui jobs (#18116) hai 1 mes
  Pascal 487674fbb3 common: fix --override-kv to support comma-separated values (#18056) hai 1 mes
  yulo acec774ef6 HIP: Refactor mma for RDNA and CDNA (#17990) hai 1 mes
  Naco Siren 5c0d18881e llama.android : Rewrite Android binding (w/o cpu_features dep) (#17413) hai 1 mes
  TrevorS 4b2a4778f8 arg: allow -kvu flag for llama-perplexity (#18117) hai 1 mes
  Aadeshveer Singh 58062860af ggml : use WARP_SIZE/2 for argmax reduction offset (#18092) hai 1 mes
  Yuri Khrustalev 2973a65ecb gguf-py : allow converting multi-tensor models from read-only locations (#18100) hai 1 mes
  Johannes Gäßler d0794e89d9 llama-fit-params: force disable mlock (#18103) hai 1 mes
  Johannes Gäßler 9dcac6cf9f llama-fit-params: lower ctx size for multi GPU (#18101) hai 1 mes
  Johannes Gäßler 0e49a7b8b4 llama-fit-params: fix underflow for dense models (#18095) hai 1 mes
  Johannes Gäßler 4164596c76 llama-fit-params: QoL impr. for prints/errors (#18089) hai 1 mes
  Xuan-Son Nguyen ef83fb8601 model: fix LFM2 missing tensors (#18105) hai 1 mes
  Johannes Gäßler ec98e20021 llama: fix early stop in params_fit if ctx is set (#18070) hai 1 mes
  yifant-code 59977eba7b server: fix crash when batch > ubatch with embeddings (#17912) hai 1 mes
  Daniel Bevenius 79dbae034a model-conversion : remove -fa option in model card template [no ci] (#18088) hai 1 mes
  Xuan-Son Nguyen 7f2b2f3c77 arch: refactor LLM_TENSOR_NAMES (#18051) hai 1 mes
  Xuan-Son Nguyen 7b1db3d3b7 arg: clarify auto kvu/np being set on server (#17997) hai 1 mes
  Piotr Wilkin (ilintar) a5251ca11d Optimization: Qwen3 next autoregressive pass (#17996) hai 1 mes