Commit History

Author SHA1 Message Date
  Xuan-Son Nguyen 0c2ee38ab7 convert : correct gemma 3n conversion (#14450) 6 months ago
  Georgi Gerganov a70c8a0c4b kv-cache : use ggml_set_rows (#14285) 6 months ago
  Georgi Gerganov 9067487c44 ggml : fix FA mask dim 2 and 3 (#14505) 6 months ago
  Georgi Gerganov d4cdd9c1c3 ggml : remove kompute backend (#14501) 6 months ago
  Aman Gupta 55c2646b45 CUDA: add dynamic shared mem to softmax, refactor general usage (#14497) 6 months ago
  Sigbjørn Skjæret e75ba4c043 gguf-py : add support for chat template jinja files (#14508) 6 months ago
  compilade 5d46babdc2 llama : initial Mamba-2 support (#9126) 6 months ago
  Georgi Gerganov e17991c466 sync : ggml 6 months ago
  Daniel Bevenius c46944aa25 ggml : add version function to get lib version (ggml/1286) 6 months ago
  Rotem Dan f3ed38d793 Set RPATH to "@loader_path" / "$ORIGIN" to ensure executables and dynamic libraries search for dependencies in their origin directory. (#14309) 6 months ago
  Aman Gupta 55a1c5a5fd CUDA: add softmax broadcast (#14475) 6 months ago
  Johannes Gäßler 12a81af45f CUDA: broadcasting for FlashAttention mask (#14500) 6 months ago
  Jeff Bolz 8875523eb3 vulkan: support softmax/FA batch and broadcast (#14449) 6 months ago
  Georgi Gerganov ec68e84c32 ggml : support bcast ggml_soft_max_ext, ggml_flash_attn_ext (#14435) 6 months ago
  zhouwg 307e79d33d opencl : fix possible buffer overflow in dump_tensor (#14490) 6 months ago
  Georgi Gerganov d7f5f4e578 simple-chat : fix context-exceeded condition (#14494) 6 months ago
  Eric Zhang c8a4e470f6 opencl : skip empty nodes on cgraph compute (#14491) 6 months ago
  lhez 603e43dc91 opencl : update upscale to support align corners (#14488) 6 months ago
  Sigbjørn Skjæret 611ba4b264 ci : add OpenCL to labeler workflow (#14496) 6 months ago
  Eric Zhang 85841e121d github : add OpenCL backend to issue templates (#14492) 6 months ago
  Björn Ganster 68b3cd6514 ggml : Callback before abort (#14481) 6 months ago
  Georgi Gerganov de56944147 ci : disable fast-math for Metal GHA CI (#14478) 6 months ago
  Grzegorz Grasza 1b2aaf28ac Add Vulkan images to docker.md (#14472) 6 months ago
  Chenguang Li 343b6e94b6 CANN: update aclnnGroupedMatmulV2 to aclnnGroupedMatmulV3 (#14411) 6 months ago
  Jeff Bolz 6a746cf9c4 vulkan: Split large mul_mat_id to fit in shared memory (#14451) 6 months ago
  Sigbjørn Skjæret eff5e45443 add GELU_ERF (#14455) 6 months ago
  Georgi Gerganov a6a47958a1 ggml : remove trailing whitespace (#0) 6 months ago
  Georgi Gerganov f61c05d4b1 sync : ggml 6 months ago
  Acly 431b2c24f3 ggml-cpu : "align corners" for bilinear upscale/downscale (ggml/1285) 6 months ago
  Daniel Bevenius 497be7c01d ggml-quants : rename best_mad to best_error (ggml/1283) 7 months ago