Chenguang Li
|
239b60e898
CANN: fix acl_rstd allocation size in ggml_cann_rms_norm (#15760)
|
4 ماه پیش |
Ruben Ortlam
|
dff7551bfd
vulkan: fix mmv subgroup16 selection (#15775)
|
4 ماه پیش |
Jeff Bolz
|
0fce7a1248
vulkan: don't use std::string in load_shaders, to improve compile time (#15724)
|
4 ماه پیش |
Daniel Bevenius
|
8227695d7a
vulkan : update ggml_vk_instance_validation_ext_available (#15666)
|
4 ماه پیش |
Shin-myoung-serp
|
0014fb4add
ggml vulkan: add hardsigmoid and hardswish operations (#15762)
|
4 ماه پیش |
Oliver Simons
|
661ae31c9c
CUDA: Optimize `rms_norm_f32` kernel and its fused variants, giving 1-6% perf E2E (#15715)
|
4 ماه پیش |
Daniel Bevenius
|
407c23786d
model-conversion : fix pyright errors (#15770)
|
4 ماه پیش |
Georgi Gerganov
|
cdedb70a99
sampling : optimize dist sampler (#15704)
|
4 ماه پیش |
Daniel Bevenius
|
2c8dac72eb
llama : fix incorrect model type for Gemma 270M (#15764)
|
4 ماه پیش |
Daniel Bevenius
|
40a751ea9a
model-conversion : remove hardcoded /bin/bash shebangs [no ci] (#15765)
|
4 ماه پیش |
hipudding
|
5eae934883
CANN: Add RoPE contiguous check for 310I DUP device (#15735)
|
4 ماه پیش |
xctan
|
05c0380f2a
ggml-cpu : optimize RVV kernels (#15720)
|
4 ماه پیش |
Daniel Bevenius
|
8c3fdf44ec
model-conversion : add missing curl script [no ci] (#15761)
|
4 ماه پیش |
hipudding
|
f6da8cb86a
CANN: Mask unsupported TRANSPOSE_1D operator (#15733)
|
4 ماه پیش |
Chenguang Li
|
8a2234ea0c
CANN: Fix type float_t to float (#15736)
|
4 ماه پیش |
SnA1lGo
|
3de008208b
fix: resolve unsigned int initialization warning for n_dims/size in gguf.cpp (#15754)
|
4 ماه پیش |
Oliver Simons
|
69db8a52e6
chore: Update `.clang-format` to use `BinPackArguments=true` (#15744)
|
4 ماه پیش |
Johannes Gäßler
|
c466abe158
llama: -fa 1/0/-1 aliases for -fa on/off/auto (#15746)
|
4 ماه پیش |
Ruben Ortlam
|
0a2a3841e8
vulkan: fix shaders gen when no integer dot is available (#15740)
|
4 ماه پیش |
hipudding
|
9961d244f2
CANN: Resolve soft_max precision issue (#15730)
|
4 ماه پیش |
Jeff Bolz
|
25f1045f07
vulkan: Fix macro parameter order for f32 matmul shaders (#15716)
|
4 ماه پیش |
rmatif
|
97669e4073
opencl: add attn sinks support for FA kernels (#15706)
|
4 ماه پیش |
Chenguang Li
|
2f853687b3
CANN: Support eager execution mode under ACL graph compilation (#15712)
|
4 ماه پیش |
hipudding
|
ef2af57ddf
CANN: Support ext_factor in rope (#15710)
|
4 ماه پیش |
Johannes Gäßler
|
5d804a4938
ggml-backend: raise GGML_MAX_SPLIT_INPUTS (#15722)
|
4 ماه پیش |
Gilad S.
|
d4d8dbe383
vulkan: use memory budget extension to read memory usage (#15545)
|
4 ماه پیش |
Jeff Bolz
|
35a42edac8
vulkan: add missing clamps in new mul_mat_id paths (#15702)
|
4 ماه پیش |
Ruben Ortlam
|
fec7911f8f
vulkan: disable large mmv subgroups on older Nvidia GPUs (#15717)
|
4 ماه پیش |
s-goto-11
|
078ce23ea7
ggml: SVE support for exponential functions (#15145)
|
4 ماه پیش |
Prashant Vithule
|
a0c2b207c5
ggml: aarch64: Implement SVE F16 kernels for vector functions (#15115)
|
4 ماه پیش |