Rotem Dan
|
f3ed38d793
Set RPATH to "@loader_path" / "$ORIGIN" to ensure executables and dynamic libraries search for dependencies in their origin directory. (#14309)
|
6 months ago |
Aman Gupta
|
55a1c5a5fd
CUDA: add softmax broadcast (#14475)
|
6 months ago |
Johannes Gäßler
|
12a81af45f
CUDA: broadcasting for FlashAttention mask (#14500)
|
6 months ago |
Jeff Bolz
|
8875523eb3
vulkan: support softmax/FA batch and broadcast (#14449)
|
6 months ago |
Georgi Gerganov
|
ec68e84c32
ggml : support bcast ggml_soft_max_ext, ggml_flash_attn_ext (#14435)
|
6 months ago |
zhouwg
|
307e79d33d
opencl : fix possible buffer overflow in dump_tensor (#14490)
|
6 months ago |
Georgi Gerganov
|
d7f5f4e578
simple-chat : fix context-exceeded condition (#14494)
|
6 months ago |
Eric Zhang
|
c8a4e470f6
opencl : skip empty nodes on cgraph compute (#14491)
|
6 months ago |
lhez
|
603e43dc91
opencl : update upscale to support align corners (#14488)
|
6 months ago |
Sigbjørn Skjæret
|
611ba4b264
ci : add OpenCL to labeler workflow (#14496)
|
6 months ago |
Eric Zhang
|
85841e121d
github : add OpenCL backend to issue templates (#14492)
|
6 months ago |
Björn Ganster
|
68b3cd6514
ggml : Callback before abort (#14481)
|
6 months ago |
Georgi Gerganov
|
de56944147
ci : disable fast-math for Metal GHA CI (#14478)
|
6 months ago |
Grzegorz Grasza
|
1b2aaf28ac
Add Vulkan images to docker.md (#14472)
|
6 months ago |
Chenguang Li
|
343b6e94b6
CANN: update aclnnGroupedMatmulV2 to aclnnGroupedMatmulV3 (#14411)
|
6 months ago |
Jeff Bolz
|
6a746cf9c4
vulkan: Split large mul_mat_id to fit in shared memory (#14451)
|
6 months ago |
Sigbjørn Skjæret
|
eff5e45443
add GELU_ERF (#14455)
|
6 months ago |
Georgi Gerganov
|
a6a47958a1
ggml : remove trailing whitespace (#0)
|
6 months ago |
Georgi Gerganov
|
f61c05d4b1
sync : ggml
|
6 months ago |
Acly
|
431b2c24f3
ggml-cpu : "align corners" for bilinear upscale/downscale (ggml/1285)
|
6 months ago |
Daniel Bevenius
|
497be7c01d
ggml-quants : rename best_mad to best_error (ggml/1283)
|
7 months ago |
lhez
|
79b33b2317
opencl : add GEGLU, REGLU, SWIGLU (#14456)
|
6 months ago |
Aman Gupta
|
0a5a3b5cdf
Add Conv2d for CPU (#14388)
|
6 months ago |
Georgi Gerganov
|
745f11fed0
memory : correctly handle failure in apply() (#14438)
|
6 months ago |
Georgi Gerganov
|
5dd942de59
metal : disable fast-math for some cpy kernels (#14460)
|
6 months ago |
Romain Biessy
|
a7417f5594
ggml-cpu: sycl: Re-enable exp f16 (#14462)
|
6 months ago |
Diego Devesa
|
eb3fa2913e
test-backend-ops : disable llama test (#14461)
|
6 months ago |
xiaobing318
|
c839a2da1a
cmake : Remove redundant include path in CMakeLists.txt (#14452)
|
6 months ago |
Vedran Miletić
|
e9b6350e61
scripts : make the shell scripts cross-platform (#14341)
|
6 months ago |
matteo
|
caf5681fcb
server : support jinja extra template kwargs (Qwen3 enable_thinking feature), from command line and from client (#13196)
|
6 months ago |