コミット履歴

作者 SHA1 メッセージ 日付
  rmatif 97669e4073 opencl: add attn sinks support for FA kernels (#15706) 4 ヶ月 前
  Chenguang Li 2f853687b3 CANN: Support eager execution mode under ACL graph compilation (#15712) 4 ヶ月 前
  hipudding ef2af57ddf CANN: Support ext_factor in rope (#15710) 4 ヶ月 前
  Johannes Gäßler 5d804a4938 ggml-backend: raise GGML_MAX_SPLIT_INPUTS (#15722) 4 ヶ月 前
  Gilad S. d4d8dbe383 vulkan: use memory budget extension to read memory usage (#15545) 4 ヶ月 前
  Jeff Bolz 35a42edac8 vulkan: add missing clamps in new mul_mat_id paths (#15702) 4 ヶ月 前
  Ruben Ortlam fec7911f8f vulkan: disable large mmv subgroups on older Nvidia GPUs (#15717) 4 ヶ月 前
  s-goto-11 078ce23ea7 ggml: SVE support for exponential functions (#15145) 4 ヶ月 前
  Prashant Vithule a0c2b207c5 ggml: aarch64: Implement SVE F16 kernels for vector functions (#15115) 4 ヶ月 前
  Jie Fu (傅杰) 4b20d8b7e3 convert : remove redundant code (#15708) 4 ヶ月 前
  Ruben Ortlam 02c1813517 Vulkan: Add Integer Dot Product mul_mat_vec shader for legacy quants (#14903) 4 ヶ月 前
  Daniel Bevenius 77dee9de97 ggml : WebGPU add TRANSPOSE and RESHAPE to supported ops (#15695) 4 ヶ月 前
  Jie Fu (傅杰) 4795c91c32 docs : add Hunyuan to models section (#15707) 4 ヶ月 前
  Akarshan Biswas b66df9d9c9 CUDA: fix build error from ambiguous __half conversions in conv2d (#15690) 4 ヶ月 前
  hipudding b9382c3877 CANN: Optimize MUL_MAT_ID (#15658) 4 ヶ月 前
  hipudding 3dc7397a27 CANN: fix RoPE cache issue on multi-device (#15629) 4 ヶ月 前
  Georgi Gerganov e92d53b29e sampling : optimize samplers by reusing bucket sort (#15665) 4 ヶ月 前
  Georgi Gerganov 0d161f021a server : enable /slots by default and make it secure (#15630) 4 ヶ月 前
  Georgi Gerganov 4efd5a8316 metal : fix checks for available FA kernels (#15700) 4 ヶ月 前
  Diego Devesa 274966226f llama : fix fattn reserve call n_seqs parameter (#15699) 4 ヶ月 前
  Diego Devesa 9777032dcc llama : separate compute buffer reserve from fattn check (#15696) 4 ヶ月 前
  Sigbjørn Skjæret 7d3c9f2b21 ci : explicitly set fa off or on (#15692) 4 ヶ月 前
  Jeff Bolz bbbf5ecccb vulkan: handle large sizes for get_rows (#15686) 4 ヶ月 前
  Jeff Bolz c37052ab4d vulkan: mul_mat_id coopmat2 optimizations (#15546) 4 ヶ月 前
  Daniel Bevenius 5c16b9c87d vulkan : remove unused portability_enumeration_ext variable (#15679) 4 ヶ月 前
  Jeff Bolz b97c9edc59 vulkan: Allow fallback to sysmem memory when vidmem is full (#15649) 4 ヶ月 前
  Jeff Bolz 94e82c7ead vulkan: clamp matmul and FA results to the max finite value (#15652) 4 ヶ月 前
  Charles Xu 4d74393bcc ggml: update kleidiai to v1.13.0 (#15663) 4 ヶ月 前
  Diego Devesa dd892555b0 Update build.md to remove MSVC arm64 notes (#15684) 4 ヶ月 前
  Johannes Gäßler e81b8e4b7f llama: use FA + max. GPU layers by default (#15434) 4 ヶ月 前