Jared Van Bortel
|
15f5d96037
build : fix build info generation and cleanup Makefile (#3920)
|
2 vuotta sitten |
shibe2
|
465219b914
CLBlast: Add outer loops over src0 for broadcasting in mulmat
|
2 vuotta sitten |
shibe2
|
1117d06607
opencl : fix element-wise multiplication (#3656)
|
2 vuotta sitten |
shibe2
|
40e5ce054f
CLBlast: Fix temporary buffer size for f16 conversion (wsize)
|
2 vuotta sitten |
shibe2
|
1e0e873c37
CLBlast: Fix matrix-vector multiplication (#3544)
|
2 vuotta sitten |
shibe2
|
e2583cbc29
CLBlast: Fix handling of on-device tensor data
|
2 vuotta sitten |
shibe2
|
665018c749
CLBlast: Add broadcast support for matrix multiplication (#3402)
|
2 vuotta sitten |
shibe2
|
36b904e200
ggml-opencl.cpp: Make private functions static (#3300)
|
2 vuotta sitten |
slaren
|
bd33e5ab92
ggml-opencl : store GPU buffer in ggml_tensor::extra (#2994)
|
2 vuotta sitten |
Wentai Zhang
|
6460f758db
opencl : fix a bug in ggml_cl_pool_malloc() for ggml_cl_mul_mat_f32() (#2955)
|
2 vuotta sitten |
Howard Su
|
481f793acc
Fix opencl by wrap #if-else-endif with \n (#2086)
|
2 vuotta sitten |
Govlzkoy
|
14a2cc71f6
[ggml] fix index for ne03 value in ggml_cl_mul_f32 (#2088)
|
2 vuotta sitten |
LostRuins
|
96a712ca1b
Porting the improved K-Quant CUDA kernels to OpenCL (#1966)
|
2 vuotta sitten |
Howard Su
|
3d59ec5935
ggml : fix warnings under MSVC (#1908)
|
2 vuotta sitten |
0cc4m
|
d411968e99
opencl : support k-quants (#1836)
|
2 vuotta sitten |
Howard Su
|
58970a4c39
Leverage mmap for offloading tensors to GPU (#1597)
|
2 vuotta sitten |
Robert Sung-wook Shin
|
98ed165574
OpenCL: Add release memory (#1741)
|
2 vuotta sitten |
Johannes Gäßler
|
17366df842
Multi GPU support, CUDA refactor, CUDA scratch buffer (#1703)
|
2 vuotta sitten |
LostRuins
|
d5b111f53d
Clblast fixes + enhancements to save VRAM and offload more layers (#1675)
|
2 vuotta sitten |
0cc4m
|
dcb2ed4826
OpenCL: Fix duplication of layers in VRAM and RAM, add GPU mul kernel (#1653)
|
2 vuotta sitten |
Howard Su
|
bb051d9723
opencl : no need to allocate cl_mem on heap (#1612)
|
2 vuotta sitten |
Howard Su
|
ca74884f66
opencl : use strstr to check if fp16 supported (#1611)
|
2 vuotta sitten |
Maarten ter Huurne
|
7d873811f3
Fix handling of "invalid property" when creating OpenCL command queue (#1565)
|
2 vuotta sitten |
0cc4m
|
2e6cd4b025
OpenCL Token Generation Acceleration (#1459)
|
2 vuotta sitten |