Dan Johansson
|
b2e89a3274
Arm AArch64: Documentation updates (#9321)
|
1 year ago |
Markus Tavenrath
|
daa9623ab0
Overlap cmdbuffer creation and cmdbuffer execution in Vulkan backend by submitting smaller cmdbuffers early. (#9118)
|
1 year ago |
Georgi Gerganov
|
e079bffb66
cuda : fix FA Q src index (1 -> 0) (#9374)
|
1 year ago |
Xuan Son Nguyen
|
3f7ccfd649
common : bring back missing args, add env var duplication check (#9375)
|
1 year ago |
slaren
|
a249843d89
common : restore --n-gpu-layers (#9371)
|
1 year ago |
slaren
|
19f4a7b296
llama : refactor samplers internal implementation (#9370)
|
1 year ago |
Neo Zhang Jianyu
|
2a358fb0c4
[SYCL] add check malloc result on device (#9346)
|
1 year ago |
slaren
|
eae597182c
llama : sanitize tokens in the upper bound (#9359)
|
1 year ago |
Xuan Son Nguyen
|
00b02bb249
imatrix : fix arg parser for imatrix (#9366)
|
1 year ago |
Georgi Gerganov
|
a876861455
metal : update support condition for im2col + fix warning (#0)
|
1 year ago |
Georgi Gerganov
|
385decbd63
sync : ggml
|
1 year ago |
Georgi Gerganov
|
60a3107ccd
scripts : option to increase git patch context
|
1 year ago |
Salvatore Mesoraca
|
406c1a32a1
vulkan: add dryrun support to sin and cos ops (ggml/947)
|
1 year ago |
Salvatore Mesoraca
|
9cb9260861
vulkan: correctly report support for OP_CONT (ggml/946)
|
1 year ago |
Johannes Gäßler
|
202084d31d
tests: add gradient tests for all backends (ggml/932)
|
1 year ago |
Johannes Gäßler
|
dbbebcab33
ggml: fix ggml_graph_cpy undefined behavior (ggml/943)
|
1 year ago |
Georgi Gerganov
|
ba1cf846ed
cann : fix doxy (ggml/0)
|
1 year ago |
Mengqing Cao
|
d2d3200b38
cann : add Ascend NPU support (whisper/2336)
|
1 year ago |
Georgi Gerganov
|
51d964a4ef
cuda : mark BF16 CONT as unsupported
|
1 year ago |
Salvatore Mesoraca
|
efe6a83e30
ggml : fix cont with transposed tensors when one dimension is 1 (ggml/934)
|
1 year ago |
Kevin Gibbons
|
fbb7fcffbc
llama : set attrs of mislabelled EOT/EOM tokens (#9348)
|
1 year ago |
Georgi Gerganov
|
a5b5d9a101
llama.android : fix build (#9350)
|
1 year ago |
Georgi Gerganov
|
f12295b8a9
llama : fix empty ring buffer push (#9358)
|
1 year ago |
Georgi Gerganov
|
faf69d4237
llama : sanitize invalid tokens (#9357)
|
1 year ago |
Eve
|
e536426ded
llamafile : disable sgemm for batch-size 1 (#9330)
|
1 year ago |
Xuan Son Nguyen
|
1b9ae5189c
common : refactor arg parser (#9308)
|
1 year ago |
slaren
|
e32d0816ed
ggml : always check bounds on get_rows operations (#9354)
|
1 year ago |
Georgi Gerganov
|
df270ef745
llama : refactor sampling v2 (#9294)
|
1 year ago |
Xuan Son Nguyen
|
947538acb8
ggml : fix missing `cpu_set_t` on emscripten (#9336)
|
1 year ago |
slaren
|
6c89eb0b47
ci : disable rocm image creation (#9340)
|
1 year ago |