Sigbjørn Skjæret
|
bc4e1128f7
llama : deci : support ffn-free with attention (#13296)
|
8 месяцев назад |
Ycros
|
39e73ae0d6
common : Add a warning when we can't match samplers from a string or char. (#13330)
|
8 месяцев назад |
R0CKSTAR
|
1f73301b63
cuda : remove nrows_x in mul_mat_q_process_tile (#13325)
|
8 месяцев назад |
Georgi Gerganov
|
4773d7a02f
examples : remove infill (#13283)
|
8 месяцев назад |
piDack
|
6c7fd67b64
llama : support tie embedding for chatglm models (#13328)
|
8 месяцев назад |
Johannes Gäßler
|
141a908a59
CUDA: mix virt/real CUDA archs for GGML_NATIVE=OFF (#13135)
|
8 месяцев назад |
Xuan-Son Nguyen
|
32916a4907
clip : refactor graph builder (#13321)
|
8 месяцев назад |
DocShotgun
|
ffc727203a
sampling : make top_n_sigma no-op at <=0 or a single candidate (#13345)
|
8 месяцев назад |
oobabooga
|
91a86a6f35
sampling : don't consider -infinity values in top_n_sigma (#13344)
|
8 месяцев назад |
Diego Devesa
|
f4ed10b69c
cmake : remove arm64 msvc presets (#13342)
|
8 месяцев назад |
Akarshan Biswas
|
1e333d5bba
SYCL: Disable reorder optimize by default and stop setting tensor extras when optimize is disabled (#13254)
|
8 месяцев назад |
Xuan-Son Nguyen
|
2f54e348ad
llama : fix build_ffn without gate (#13336)
|
8 месяцев назад |
Johannes Gäßler
|
2356fb1d53
CUDA: fix bad asserts for partial offload (#13337)
|
8 месяцев назад |
Sigbjørn Skjæret
|
764b85627b
convert : qwen2/3moe : set yarn metadata if present (#13331)
|
8 месяцев назад |
Johannes Gäßler
|
15a28ec8c7
CUDA: fix --split-mode row for MMQ (#13323)
|
8 месяцев назад |
compilade
|
a7366faa5b
gguf-py : avoid requiring pyside6 for other scripts (#13036)
|
8 месяцев назад |
Johannes Gäßler
|
9070365020
CUDA: fix logic for clearing padding with -ngl 0 (#13320)
|
8 месяцев назад |
oobabooga
|
233461f812
sampling : Integrate Top-nσ into main sampling chain (and add it to the server) (#13264)
|
8 месяцев назад |
igardev
|
b34c859146
server : Webui - change setText command from parent window to also send the message. (#13309)
|
8 месяцев назад |
Xuan-Son Nguyen
|
9b61acf060
mtmd : rename llava directory to mtmd (#13311)
|
8 месяцев назад |
Xuan-Son Nguyen
|
5215b91e93
clip : fix confused naming ffn_up and ffn_down (#13290)
|
8 месяцев назад |
Sigbjørn Skjæret
|
ae803bfc3d
convert : bailingmoe : set yarn metadata if present (#13312)
|
8 месяцев назад |
Akarshan Biswas
|
66645a5285
SYCL: Disable mul_mat kernels for noncontiguous tensor b (#13308)
|
8 месяцев назад |
Xuan-Son Nguyen
|
27aa259532
mtmd : add C public API (#13184)
|
8 месяцев назад |
Diego Devesa
|
9fdfcdaedd
rpc : use backend registry, support dl backends (#13304)
|
8 месяцев назад |
Aaron Teo
|
6eb7d25c70
ggml : activate s390x simd for Q3_K (#13301)
|
8 месяцев назад |
Diego Devesa
|
86bd60d3fe
llava/mtmd : fixes to fully support dl backends (#13303)
|
8 месяцев назад |
Diego Devesa
|
9f2da5871f
llama : build windows releases with dl backends (#13220)
|
8 месяцев назад |
Johannes Gäßler
|
93c4e23905
CUDA: fix race condition in MMQ stream-k fixup (#13299)
|
8 месяцев назад |
Johannes Gäßler
|
8afbd96818
CUDA: fix race condition in MMQ ids_dst (#13294)
|
8 месяцев назад |