Commit Verlauf

Autor SHA1 Nachricht Datum
  Oleksandr Kuvshynov 408616adbd server : [easy] fix per round speculative decode logging (#18211) vor 4 Wochen
  Xuan-Son Nguyen 9e39a1e6a9 server: support load model on startup, support preset-only options (#18206) vor 4 Wochen
  Sigbjørn Skjæret 74e05131e9 ci : remove non-windows zip artifacts (#18201) vor 4 Wochen
  Sigbjørn Skjæret f74747d886 ci : only save ccache on master (#18207) vor 4 Wochen
  Alfred ce734a8a2f ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations (#17977) vor 4 Wochen
  Pascal 14931a826e arg: fix order to use short form before long form (#18196) vor 4 Wochen
  Julius Tischbein f99ef53d2a llama : Changing off_t to size_t for Windows (#18204) vor 4 Wochen
  Aman Gupta cc0a04343e server: friendlier error msg when ctx < input (#18174) vor 4 Wochen
  Xuan-Son Nguyen 98c1c7a7bf presets: refactor, allow cascade presets from different sources, add global section (#18169) vor 4 Wochen
  Aleksander Grygier acb73d8340 webui: Add editing attachments in user messages (#18147) vor 4 Wochen
  Daniel Bevenius 0a271d82b4 model-conversion : add verbose flag in run-org-model.py (#18194) vor 4 Wochen
  Naco Siren 52fc7fee8a android: fix missing screenshots for Android.md (#18156) vor 4 Wochen
  Jeff Bolz cdbada8d10 vulkan: Add perf logger mode with concurrency (#17944) vor 4 Wochen
  Xuan-Son Nguyen 8ea958d4d9 model : add ASR support for LFM2-Audio-1.5B (conformer) (#18106) vor 4 Wochen
  Pascal f9ec8858ed webui: display prompt processing stats (#18146) vor 4 Wochen
  Taimur Ahmad f716588e63 ggml-cpu: extend support for RVV floating-point kernels (#17318) vor 1 Monat
  Xuan-Son Nguyen 4d1316c440 arg: fix ASAN error on sampler_type_names empty (#18167) vor 1 Monat
  Sigbjørn Skjæret ec7b9329ae gguf-py : use copy-on-write mode for localtensor (#18162) vor 1 Monat
  yulo 54189c0d39 remove i_major_dual (#18157) vor 1 Monat
  Aleksander Grygier 9ce64aed7d webui: Fix selecting generated output issues during active streaming (#18091) vor 1 Monat
  Kim S. 900316da4e webui: fix chat screen shadow width (#18010) vor 1 Monat
  Johannes Gäßler 57c1e05643 llama: offload output layer to GPU first (#18148) vor 1 Monat
  Sigbjørn Skjæret 9cff4cc554 convert : sort and use file parts from model index if present (#18043) vor 1 Monat
  Julius Tischbein 4d4f4cacd1 llama : Async DirectIO model loading on Linux (#18012) vor 1 Monat
  Shouyu 0a0bba05e8 ggml-hexagon: swiglu_oai operation (#18114) vor 1 Monat
  Sigbjørn Skjæret 5166aaf868 convert : force patch_merger tensors to f16/f32 (#18124) vor 1 Monat
  Pascal 6ce3d85796 server: (webui) add --webui-config (#18028) vor 1 Monat
  Xuan-Son Nguyen e85e9d7637 server: (router) disable SSL on child process (#18141) vor 1 Monat
  Johannes Gäßler 8dcc3662a2 llama-fit-params: fix memory print (#18136) vor 1 Monat
  Kim S. d37fc93505 webui: fix chat header width when sidebar is closed (#17981) vor 1 Monat