Commit Verlauf

Autor SHA1 Nachricht Datum
  Georgi Gerganov 13730c183b metal : cap threadgroups size of set_rows (#17146) vor 2 Monaten
  Adrien Gallouët 967eb4b2bf ggml-cpu : inspect -march and -mcpu to found the CPU (#16333) vor 2 Monaten
  Ruben Ortlam f117be185e vulkan: check glslc executable string (#17144) vor 2 Monaten
  Ruben Ortlam 85234a4b3a vulkan: fix validation issue introduced by #16868 (#17145) vor 2 Monaten
  Gabe Goodhart 0c74f32632 memory: Hybrid context shift (#17009) vor 2 Monaten
  Georgi Gerganov c27efd2bd1 metal : enable tensor API for A19 (#17087) vor 2 Monaten
  fj-y-saito df70bedda7 arm64: add i8mm route with SVE ggml_vec_dot_q4_K_q8_K and ggml_vec_dot_q6_K_… (#15277) vor 2 Monaten
  Georgi Gerganov f914544b16 batched-bench : add "separate text gen" mode (#17103) vor 2 Monaten
  Xuan-Son Nguyen 4b13a684c5 mtmd: fix patch_size initialized to random value in audio models (#17128) vor 2 Monaten
  Georgi Gerganov 9898b57cbe editorconfig : ignore benches/ (#17140) vor 2 Monaten
  Acly 1032256ec9 cuda/vulkan : bicubic interpolation (#17022) vor 2 Monaten
  Georgi Gerganov 15274c0c50 benches : add eval results (#17139) vor 2 Monaten
  Georgi Gerganov b8595b16e6 mtmd : fix embedding size for image input (#17123) vor 2 Monaten
  Ruben Ortlam 392e09a608 vulkan: fix memory allocations (#17122) vor 2 Monaten
  compilade 802cef44bf convert : parse safetensors directly (#15667) vor 2 Monaten
  compilade 1c07c0c68c convert : handle compressed-tensors quant method (#17069) vor 2 Monaten
  Georgi Gerganov cb1adf8851 server : handle failures to restore host cache (#17078) vor 2 Monaten
  Georgi Gerganov ef1d826997 benches : add folder with benchmarks (#16931) vor 2 Monaten
  Eric Curtin 86fde91e62 Switch to using Ubuntu 25.10 vulkan/mesa (#16497) vor 2 Monaten
  Ruben Ortlam 7f3e9d339c vulkan: iGPU memory reporting fix (#17110) vor 2 Monaten
  Ruben Ortlam 8a3519b708 vulkan: fix mmq out of bounds reads (#17108) vor 2 Monaten
  Jeff Bolz 80a6cf6347 vulkan: fuse mul_mat_id + mul (#17095) vor 2 Monaten
  Georgi Gerganov 0750a59903 metal : retain src and dst buffers during async ops (#17101) vor 2 Monaten
  Xuan-Son Nguyen aa3b7a90b4 arg: add --cache-list argument to list cached models (#17073) vor 2 Monaten
  chansikpark 333f2595a3 webui: fix keyboard shortcuts for new chat & edit chat title (#17007) vor 2 Monaten
  Jeff Bolz 53d7d21e61 vulkan: Use spec constants for conv2d s/d/p and kernel W/H (#16978) vor 2 Monaten
  Aidan eeee367de5 server: fix correct time_ms calculation in prompt_progress (#17093) vor 2 Monaten
  Aman Gupta 64fe17fbb8 Revert "CUDA: add expert reduce kernel (#16857)" (#17100) vor 2 Monaten
  Aman Gupta c1b187688d CUDA: skip fusion for repeating adds in bias (#17080) vor 2 Monaten
  SavicStefan b8a5cfd11a vulkan: Increase BK to 32; use BK/4 for non-CM mul_mm.comp (#16636) vor 2 Monaten