haopeng
|
42ae10bbcd
add cmake rvv support (#10411)
|
1 year ago |
Georgi Gerganov
|
9fe0fb0626
sync : ggml
|
1 year ago |
Plamen Minev
|
611fabd792
metal : fox offset integer overflows in im2col (ggml/1015)
|
1 year ago |
PAB
|
12b0ad953a
metal : add `GGML_UNARY_OP_ELU` kernel (ggml/1018)
|
1 year ago |
蕭澧邦
|
342397dc7e
cmake: force MSVC compiler charset to utf-8 (#9989)
|
1 year ago |
bandoti
|
2a11b6b094
Add required ggml-base and backend libs to cmake pkg (#10407)
|
1 year ago |
Diego Devesa
|
3ee6382d48
cuda : fix CUDA_FLAGS not being applied (#10403)
|
1 year ago |
Georgi Gerganov
|
8e752a777b
llama : add check for KV cache shifts (#10401)
|
1 year ago |
Shane A
|
a88ad007de
llama : add OLMo November 2024 support (#10394)
|
1 year ago |
Romain Biessy
|
2a1507c162
sycl : Add option to set the SYCL architecture for all targets (#10266)
|
1 year ago |
Jeff Bolz
|
b3e585988f
vulkan: Optimize soft_max (#10301)
|
1 year ago |
Alberto Cabrera Pérez
|
557924f222
sycl: Revert MUL_MAT_OP support changes (#10385)
|
1 year ago |
Diego Devesa
|
d3481e6316
cuda : only use native when supported by cmake (#10389)
|
1 year ago |
bandoti
|
531cb1c233
Skip searching root path for cross-compile builds (#10383)
|
1 year ago |
Jeff Bolz
|
f139d2ea61
vulkan: remove use of null initializer (#10372)
|
1 year ago |
Georgi Gerganov
|
2eb76b2a5e
flake.lock: Update (#10346)
|
1 year ago |
0cc4m
|
9b75f03cd2
Vulkan: Fix device info output format specifiers (#10366)
|
1 year ago |
Johannes Gäßler
|
75207b3a88
docker: use GGML_NATIVE=OFF (#10368)
|
1 year ago |
Johannes Gäßler
|
76e9e58b78
CUDA: fix MMV kernel being used for FP16 src1 (#10357)
|
1 year ago |
Johannes Gäßler
|
ce2e59ba10
CMake: fix typo in comment [no ci] (#10360)
|
1 year ago |
Diego Devesa
|
be5caccef9
llama : only use default buffer types for the KV cache (#10358)
|
1 year ago |
Georgi Gerganov
|
20a780c7b6
gitignore : ignore local run scripts [no ci]
|
1 year ago |
Georgi Gerganov
|
cf32a9b93a
metal : refactor kernel args into structs (#10238)
|
1 year ago |
FirstTimeEZ
|
a43178299c
ggml : fix undefined reference to 'getcpu' (#10354)
|
1 year ago |
Johannes Gäßler
|
c3ea58aca4
CUDA: remove DMMV, consolidate F16 mult mat vec (#10318)
|
1 year ago |
Johannes Gäßler
|
467576b6cc
CMake: default to -arch=native for CUDA build (#10320)
|
1 year ago |
Diego Devesa
|
eda7e1d4f5
ggml : fix possible buffer use after free in sched reserve (#9930)
|
1 year ago |
Georgi Gerganov
|
24203e9dd7
ggml : inttypes.h -> cinttypes (#0)
|
1 year ago |
Georgi Gerganov
|
5d9e59979c
ggml : adapt AMX to tensor->grad removal (#0)
|
1 year ago |
Georgi Gerganov
|
a4200cafad
make : add ggml-opt (#0)
|
1 year ago |