muggle-stack
|
342c728d03
ggml : fix SpaceMit IME array out-of-bounds in task assignment (#16629)
|
3 months ago |
Pascal
|
ababae7e1e
webui: reorganize settings layout (#16607)
|
3 months ago |
Jeff Bolz
|
b19491599d
vulkan: fix debug build (add_rms_len/data not found) (#16624)
|
3 months ago |
Ilia Ilmer
|
9ad4f1931e
metal : add `CONV_TRANSPOSE_2D` (#16542)
|
3 months ago |
Olivier Chafik
|
79967ec596
grammar : use int64_t to avoid int overflows in int schema to grammar conversion logic (#16626)
|
3 months ago |
GittyBurstein
|
ceff6bb253
SYCL SET operator optimized for F32 tensors (#16350)
|
3 months ago |
Xuan-Son Nguyen
|
1bb4f43380
mtmd : support home-cooked Mistral Small Omni (#14928)
|
3 months ago |
Pascal
|
683fa6ba4e
fix: added a normalization step for MathJax-style \[\] and \(\) delimiters (#16599)
|
3 months ago |
GittyBurstein
|
b22572e97d
sycl : add ARANGE operator (#16362)
|
3 months ago |
Chenguang Li
|
7a50cf388a
CANN: format code using .clang-format (#15863)
|
3 months ago |
takasurazeem
|
6f5d924637
common : Update the docs on -t --threads (#16236)
|
3 months ago |
takuya kodama
|
adc9b60f19
ggml-cpu: replace putenv with setenv for const-correctness (#16573)
|
3 months ago |
yael-works
|
ee50ee1ead
SYCL: Add GGML_OP_MEAN operator support (#16009)
|
3 months ago |
Aleksei Nikiforov
|
7adc79c032
gguf-py : add support for endian conversion of BF16 data (#16594)
|
3 months ago |
safranowith
|
466c1911ab
cpu : add FLOOR, CEIL, ROUND and TRUNC unary operators (#16083)
|
3 months ago |
lhez
|
0cb7a0683b
opencl: add q8_0 mm support (#16469)
|
3 months ago |
lhez
|
d93f8439b0
opencl: fix FA for f32 (#16584)
|
3 months ago |
Aleksander Grygier
|
f9fb33f263
Add server-driven parameter defaults and syncing (#16515)
|
3 months ago |
Sam/Samuel
|
f4ce81c45e
metal: optimise `GGML_OP_SUM` (#16559)
|
3 months ago |
Georgi Gerganov
|
17304cbcc1
server : fix img token logs (#16595)
|
3 months ago |
Xuan-Son Nguyen
|
3e3cb19f64
llama-quant: add support for mmproj (#16592)
|
3 months ago |
Julius Tischbein
|
5acd455460
CUDA: Changing the CUDA scheduling strategy to spin (#16585)
|
3 months ago |
Georgi Gerganov
|
554fd578a5
server : fix mtmd checkpoints (#16591)
|
3 months ago |
Georgi Gerganov
|
fa882fd2b1
metal : avoid using Metal's gpuAddress property (#16576)
|
3 months ago |
SavicStefan
|
ffa059034c
vulkan: Add ACC_TYPE_VEC2 implementation (#16203)
|
3 months ago |
Aman Gupta
|
120bf7046d
CUDA + openCL: fix bug in accessing rms_norm->src while doing fusion (#16577)
|
3 months ago |
Jeff Bolz
|
4258e0cfe7
vulkan: Support FA with K/V in F32 (#16543)
|
3 months ago |
Jeff Bolz
|
7ea15bb64c
vulkan: Improve build time for MSVC (#16545)
|
3 months ago |
Johannes Gäßler
|
9c7185dd28
CUDA: enable FA for FP32 KV cache (#16546)
|
3 months ago |
Aman Gupta
|
1ee9d0b415
CUDA: use fastdiv + ggml_cuda_mad for mmvf (#16557)
|
3 months ago |