Histórico de Commits

Autor SHA1 Mensagem Data
  hipudding 5eae934883 CANN: Add RoPE contiguous check for 310I DUP device (#15735) há 5 meses atrás
  xctan 05c0380f2a ggml-cpu : optimize RVV kernels (#15720) há 5 meses atrás
  Daniel Bevenius 8c3fdf44ec model-conversion : add missing curl script [no ci] (#15761) há 5 meses atrás
  hipudding f6da8cb86a CANN: Mask unsupported TRANSPOSE_1D operator (#15733) há 5 meses atrás
  Chenguang Li 8a2234ea0c CANN: Fix type float_t to float (#15736) há 5 meses atrás
  SnA1lGo 3de008208b fix: resolve unsigned int initialization warning for n_dims/size in gguf.cpp (#15754) há 5 meses atrás
  Oliver Simons 69db8a52e6 chore: Update `.clang-format` to use `BinPackArguments=true` (#15744) há 5 meses atrás
  Johannes Gäßler c466abe158 llama: -fa 1/0/-1 aliases for -fa on/off/auto (#15746) há 5 meses atrás
  Ruben Ortlam 0a2a3841e8 vulkan: fix shaders gen when no integer dot is available (#15740) há 5 meses atrás
  hipudding 9961d244f2 CANN: Resolve soft_max precision issue (#15730) há 5 meses atrás
  Jeff Bolz 25f1045f07 vulkan: Fix macro parameter order for f32 matmul shaders (#15716) há 5 meses atrás
  rmatif 97669e4073 opencl: add attn sinks support for FA kernels (#15706) há 5 meses atrás
  Chenguang Li 2f853687b3 CANN: Support eager execution mode under ACL graph compilation (#15712) há 5 meses atrás
  hipudding ef2af57ddf CANN: Support ext_factor in rope (#15710) há 5 meses atrás
  Johannes Gäßler 5d804a4938 ggml-backend: raise GGML_MAX_SPLIT_INPUTS (#15722) há 5 meses atrás
  Gilad S. d4d8dbe383 vulkan: use memory budget extension to read memory usage (#15545) há 5 meses atrás
  Jeff Bolz 35a42edac8 vulkan: add missing clamps in new mul_mat_id paths (#15702) há 5 meses atrás
  Ruben Ortlam fec7911f8f vulkan: disable large mmv subgroups on older Nvidia GPUs (#15717) há 5 meses atrás
  s-goto-11 078ce23ea7 ggml: SVE support for exponential functions (#15145) há 5 meses atrás
  Prashant Vithule a0c2b207c5 ggml: aarch64: Implement SVE F16 kernels for vector functions (#15115) há 5 meses atrás
  Jie Fu (傅杰) 4b20d8b7e3 convert : remove redundant code (#15708) há 5 meses atrás
  Ruben Ortlam 02c1813517 Vulkan: Add Integer Dot Product mul_mat_vec shader for legacy quants (#14903) há 5 meses atrás
  Daniel Bevenius 77dee9de97 ggml : WebGPU add TRANSPOSE and RESHAPE to supported ops (#15695) há 5 meses atrás
  Jie Fu (傅杰) 4795c91c32 docs : add Hunyuan to models section (#15707) há 5 meses atrás
  Akarshan Biswas b66df9d9c9 CUDA: fix build error from ambiguous __half conversions in conv2d (#15690) há 5 meses atrás
  hipudding b9382c3877 CANN: Optimize MUL_MAT_ID (#15658) há 5 meses atrás
  hipudding 3dc7397a27 CANN: fix RoPE cache issue on multi-device (#15629) há 5 meses atrás
  Georgi Gerganov e92d53b29e sampling : optimize samplers by reusing bucket sort (#15665) há 5 meses atrás
  Georgi Gerganov 0d161f021a server : enable /slots by default and make it secure (#15630) há 5 meses atrás
  Georgi Gerganov 4efd5a8316 metal : fix checks for available FA kernels (#15700) há 5 meses atrás