Prashant Vithule
|
5fac4d5764
ggml : vector length agnostic SVE support (#9290)
|
1 gadu atpakaļ |
slaren
|
5fb5e24811
llama : minor sampling refactor (2) (#9386)
|
1 gadu atpakaļ |
Georgi Gerganov
|
38ca6f644b
readme : update hot topics
|
1 gadu atpakaļ |
Johannes Gäßler
|
8e6e2fbe14
CUDA: fix variable name conflict for Windows build (#9382)
|
1 gadu atpakaļ |
Antonis Makropoulos
|
5ed087573e
readme : add LLMUnity to UI projects (#9381)
|
1 gadu atpakaļ |
Radoslav Gerganov
|
54f376d0b9
rpc : update README [no ci] (#9320)
|
1 gadu atpakaļ |
Dan Johansson
|
b2e89a3274
Arm AArch64: Documentation updates (#9321)
|
1 gadu atpakaļ |
Markus Tavenrath
|
daa9623ab0
Overlap cmdbuffer creation and cmdbuffer execution in Vulkan backend by submitting smaller cmdbuffers early. (#9118)
|
1 gadu atpakaļ |
Georgi Gerganov
|
e079bffb66
cuda : fix FA Q src index (1 -> 0) (#9374)
|
1 gadu atpakaļ |
Xuan Son Nguyen
|
3f7ccfd649
common : bring back missing args, add env var duplication check (#9375)
|
1 gadu atpakaļ |
slaren
|
a249843d89
common : restore --n-gpu-layers (#9371)
|
1 gadu atpakaļ |
slaren
|
19f4a7b296
llama : refactor samplers internal implementation (#9370)
|
1 gadu atpakaļ |
Neo Zhang Jianyu
|
2a358fb0c4
[SYCL] add check malloc result on device (#9346)
|
1 gadu atpakaļ |
slaren
|
eae597182c
llama : sanitize tokens in the upper bound (#9359)
|
1 gadu atpakaļ |
Xuan Son Nguyen
|
00b02bb249
imatrix : fix arg parser for imatrix (#9366)
|
1 gadu atpakaļ |
Georgi Gerganov
|
a876861455
metal : update support condition for im2col + fix warning (#0)
|
1 gadu atpakaļ |
Georgi Gerganov
|
385decbd63
sync : ggml
|
1 gadu atpakaļ |
Georgi Gerganov
|
60a3107ccd
scripts : option to increase git patch context
|
1 gadu atpakaļ |
Salvatore Mesoraca
|
406c1a32a1
vulkan: add dryrun support to sin and cos ops (ggml/947)
|
1 gadu atpakaļ |
Salvatore Mesoraca
|
9cb9260861
vulkan: correctly report support for OP_CONT (ggml/946)
|
1 gadu atpakaļ |
Johannes Gäßler
|
202084d31d
tests: add gradient tests for all backends (ggml/932)
|
1 gadu atpakaļ |
Johannes Gäßler
|
dbbebcab33
ggml: fix ggml_graph_cpy undefined behavior (ggml/943)
|
1 gadu atpakaļ |
Georgi Gerganov
|
ba1cf846ed
cann : fix doxy (ggml/0)
|
1 gadu atpakaļ |
Mengqing Cao
|
d2d3200b38
cann : add Ascend NPU support (whisper/2336)
|
1 gadu atpakaļ |
Georgi Gerganov
|
51d964a4ef
cuda : mark BF16 CONT as unsupported
|
1 gadu atpakaļ |
Salvatore Mesoraca
|
efe6a83e30
ggml : fix cont with transposed tensors when one dimension is 1 (ggml/934)
|
1 gadu atpakaļ |
Kevin Gibbons
|
fbb7fcffbc
llama : set attrs of mislabelled EOT/EOM tokens (#9348)
|
1 gadu atpakaļ |
Georgi Gerganov
|
a5b5d9a101
llama.android : fix build (#9350)
|
1 gadu atpakaļ |
Georgi Gerganov
|
f12295b8a9
llama : fix empty ring buffer push (#9358)
|
1 gadu atpakaļ |
Georgi Gerganov
|
faf69d4237
llama : sanitize invalid tokens (#9357)
|
1 gadu atpakaļ |