matteo
|
afbb4c1322
ggml-cuda: Adding support for unified memory (#8035)
|
1 year ago |
Alex O'Connell
|
b7a08fd5e0
Build: Only include execinfo.h on linux systems that support it (#8783)
|
1 year ago |
slaren
|
7a11eb3a26
cuda : fix dmmv cols requirement to 2*GGML_CUDA_DMMV_X (#8800)
|
1 year ago |
wangshuai09
|
c8a0090922
cann: support q8_0 for Ascend backend (#8805)
|
1 year ago |
Igor Okulist
|
afbbcf3c04
server : update llama-server embedding flag documentation (#8779)
|
1 year ago |
Clint Herron
|
ed9d2854c9
Build: Fix potential race condition (#8781)
|
1 year ago |
pculliton
|
398ede5efe
Adding Gemma 2 2B configs (#8784)
|
1 year ago |
Borislav Stanimirov
|
44d28ddd5c
cmake : fix use of external ggml (#8787)
|
1 year ago |
Someone
|
268c566006
nix: cuda: rely on propagatedBuildInputs (#8772)
|
1 year ago |
Brian
|
7e72aa74fd
py: add_array() will not add to kv store if value is an empty array (#8774)
|
1 year ago |
l3utterfly
|
7c27a19b2e
added android implementation of ggml_print_backtrace_symbols (#8751)
|
1 year ago |
Georgi Gerganov
|
140074bb86
flake.lock: Update (#8729)
|
1 year ago |
wangshuai09
|
6e2b6000e5
cann: update cmake (#8765)
|
1 year ago |
zhentaoyu
|
c887d8b017
[SYCL] Add `TIMESTEP_EMBEDDING` OP (#8707)
|
1 year ago |
CarterLi999
|
75af08c475
ggml: bugfix: fix the inactive elements is agnostic for risc-v vector (#8748)
|
1 year ago |
R0CKSTAR
|
439b3fc75a
cuda : organize vendor-specific headers into vendors directory (#8746)
|
1 year ago |
Meng, Hengyu
|
0832de7236
[SYCL] add conv support (#8688)
|
1 year ago |
Johannes Gäßler
|
6eeaeba126
cmake: use 1 more thread for non-ggml in CI (#8740)
|
1 year ago |
Austin
|
4730faca61
chore : Fix vulkan related compiler warnings, add help text, improve CLI options (#8477)
|
1 year ago |
compilade
|
4c676c85e5
llama : refactor session file management (#8699)
|
1 year ago |
R0CKSTAR
|
e54c35e4fb
feat: Support Moore Threads GPU (#8383)
|
1 year ago |
Georgi Gerganov
|
5e2727fe03
scripts : sync vulkan-shaders (#0)
|
1 year ago |
Georgi Gerganov
|
56f20aa25d
scripts : sync ggml-aarch64 sources
|
1 year ago |
Georgi Gerganov
|
345c8c0c87
ggml : add missing semicolon (#0)
|
1 year ago |
Georgi Gerganov
|
ae7985cd7b
sync : ggml
|
1 year ago |
Mahesh Madhav
|
a05ca93697
ggml : loop tiling optimizations for scalar path (ggml/898)
|
1 year ago |
Ivan Filipov
|
9f77d899b7
ggml: add support for float16 input tensors in pooling operations (ggml/895)
|
1 year ago |
Tony Wasserka
|
203b7f1531
vulkan : initialize vk_buffer_struct members to VK_NULL_HANDLE (ggml/893)
|
1 year ago |
Borislav Stanimirov
|
d2b851bfa1
cmake : only enable GGML_NATIVE and x86 flags if not crosscompiling (ggml/885)
|
1 year ago |
Daniel Bevenius
|
c12b6e8ee7
ggml : remove unnecessary UNUSED macro call (ggml/880)
|
1 year ago |