Commit Verlauf

Autor SHA1 Nachricht Datum
  Georgi Gerganov 2ddd3f2356 sync : ggml vor 4 Monaten
  Georgi Gerganov 4d3d455d3c sync : whisper.cpp (ggml/1359) vor 4 Monaten
  Daniel Bevenius c9b1c06467 ggml : remove -dev suffix from release version (ggml/1355) vor 4 Monaten
  Daniel Bevenius b6ae75afb4 ggml : bump version to 0.9.3 (ggml/1353) vor 4 Monaten
  Georgi Gerganov b6dff20e2f ggml : prepare for development of 0.9.2-dev vor 4 Monaten
  Georgi Gerganov 2db78c75e4 ggml : bump version to 0.9.1 vor 4 Monaten
  Rafal Lewczuk 02463ab27b ggml-backend : add root cause in error message if loading backend library fails (#16172) vor 4 Monaten
  Sigbjørn Skjæret adc76347d7 ggml : check cuda and metal argsort limits and add test (#16323) vor 4 Monaten
  Aleksander Grygier 3a2bdcda0b Improve Mobile UI for dialogs and action dropdowns (#16222) vor 4 Monaten
  Pascal 66bb7985c3 fix: preserved zero values in chat settings inputs and textareas by switching to nullish coalescing for field values and default placeholders (#16312) vor 4 Monaten
  Vinkal 2f61c0f5bf llama-cli: prevent spurious assistant token (#16202) vor 4 Monaten
  ddh0 3ffd0fae47 perplexity : show more kl-divergence data (#16321) vor 4 Monaten
  Georgi Gerganov a4a0aa5ea2 ggml : fix dependencies for ggml_set_rows (#16318) vor 4 Monaten
  Jeff Bolz 92cd103f62 vulkan: Fix validation failure in quantized flash attention (#16292) vor 4 Monaten
  Sigbjørn Skjæret b887d2f341 ggml : fix GGML_F32_VEC_FMA argument order in ggml_vec_mad1_f32 (#16307) vor 4 Monaten
  crat0z bd0af02fc9 common : fix reasoning before forced tool call via tool_choice = required (#16264) vor 4 Monaten
  R0CKSTAR d9e0e7c819 ci : fix musa docker build (#16306) vor 4 Monaten
  Aaron Teo 0124ac989f devops: switch to using ubuntu-22.04-s390x image (#16302) vor 4 Monaten
  Imad Saddik 2811c65286 Fixed a few typos in the README of the LLaMA.cpp HTTP Server [no ci] (#16297) vor 4 Monaten
  Jeff Bolz d8359f5fde vulkan: 64-bit im2col (#16135) vor 4 Monaten
  Georgi Gerganov 6a2c6145a0 metal : extend mat-mat multiplication support (#16225) vor 4 Monaten
  Georgi Gerganov 3b53634fe3 metal : fuse non-sequential nodes (#16102) vor 4 Monaten
  Jeff Bolz 1384abf8b8 vulkan: handle mat_mul with A matrix > 4GB (#16176) vor 4 Monaten
  Jeff Bolz e6d65fb02d vulkan: support arbitrary KV dimension in flash attention (#16160) vor 4 Monaten
  Acly 8656f5de68 vulkan : make the vulkan.hpp dynamic dispatcher instance private (#16224) vor 4 Monaten
  Aleksander Grygier 4807e8f96a Show message actions by default (#16289) vor 4 Monaten
  Aman Gupta c0bfc57af4 CUDA: mul_mat_id for mmf for bs <= 64 for f16 and bs <= 32 for f32 (#16277) vor 4 Monaten
  Johannes Gäßler 75a3a6c2cd CUDA: refactor and deduplicate vector FA kernels (#16208) vor 4 Monaten
  Dmytro Minochkin 0499b29c6f vulkan: throw system error instead of SIGABRT during init on older devices (#16156) vor 4 Monaten
  Adrien Gallouët 234e2ff8ed server : remove old LLAMA_SERVER_SSL (#16290) vor 4 Monaten