Commit History

Author SHA1 Message Date
  Jeff Bolz b9ce940177 vulkan: Fuse rope+set_rows (#16769) 3 months ago
  Xuan-Son Nguyen 3464bdac37 llama: fix ASAN error with M-RoPE (#16848) 3 months ago
  Xuan-Son Nguyen e3af5563bd llama: store mrope data in KV cell (#16825) 3 months ago
  Jeff Bolz 10fcc41290 vulkan: Update topk_moe fusion to handle gpt's late softmax (#16656) 3 months ago
  Ruben Ortlam bcf5bda6f5 Vulkan MMQ Integer Dot Refactor and K-Quant support (#16536) 3 months ago
  Max Krasnyansky 3eb2be1ca5 Hexagon Op queue & dispatch optimizations (#16820) 3 months ago
  Aman Gupta e41bcce8f0 CUDA: use fastdiv in set-rows (#16834) 3 months ago
  Sigbjørn Skjæret 144a4ce824 vendor : sync minja (#16500) 3 months ago
  Jeff Bolz f549b0007d vulkan: Call ggml_vk_buffer_write_2d from ggml_vk_buffer_copy (#16793) 3 months ago
  Aman Gupta 9a3ea685b9 CUDA: Fix bug in topk-moe for gpt-oss (#16821) 3 months ago
  YaelLogic 338074c383 sycl: add RMS_NORM_BACK operation support (#16808) 3 months ago
  YaelGitAccount 851553ea6b cuda: add SET operation support (#16804) 3 months ago
  Georgi Gerganov 85a7d8677b memory : remove KV cache size padding (#16812) 3 months ago
  Georgi Gerganov a8ca18b4b8 llama-bench : clarify benchmarked parts of the computation (#16823) 3 months ago
  l3utterfly 8284efc35c initialise buffer.device in ggml_hexagon_session (#16816) 3 months ago
  Sam Malayek 1c1409e131 embedding: add raw option for --embd-output-format (#16541) 3 months ago
  Johannes Gäßler 7a0e900e36 llama: consistent ctx <-> buf order for KV cache (#16746) 3 months ago
  Aldehir Rojas 280d97be96 grammar : support array references in json schema (#16792) 3 months ago
  Chenguang Li 3479efd112 CANN: Improve device ID handling and aclnnArange checks (#16752) 3 months ago
  Aman Gupta 463bbf20bf CUDA: add unused vars to mmvf and mmvq (#16807) 3 months ago
  tamarPal ad8d36beff sycl: add SSM_CONV operation support (#16800) 3 months ago
  Yuri Khrustalev c053e18a66 chat: Add LFM2 tool handling (#16763) 3 months ago
  Xuan-Son Nguyen e1ab084803 mtmd : fix idefics3 preprocessing (#16806) 3 months ago
  Diego Devesa 5a4ff43e7d llama : disable pipeline parallelism if compute buffer allocation fails (#16748) 3 months ago
  Acly 10640e31aa ggml : fix interpolate with align-corners and ne=1 (#16700) 3 months ago
  Johannes Gäßler 80d28f104c HIP: fix AMDGPU_TARGETS, update documentation (#16803) 3 months ago
  Xuan-Son Nguyen c55d53acec model : add LightOnOCR-1B model (#16764) 3 months ago
  Johannes Gäßler 945501f5ea llama: fix leaked buffers for mmap + split files (#16765) 3 months ago
  Aman Gupta 75cbdd3fce test-backend-ops: print failed tests at the end (#16785) 3 months ago
  tamarPal 2b9bd9bf4e sycl: add ROLL operation support (#16665) 3 months ago