Xuan-Son Nguyen
|
f08c4c0d8d
mtmd : clean up clip_n_output_tokens (#15391)
|
5 months ago |
Georgi Gerganov
|
6d7f1117e3
codeowners : remove mmv.*
|
5 months ago |
Georgi Gerganov
|
60212f1ead
sync : ggml
|
5 months ago |
Georgi Gerganov
|
f0c541d315
scripts : update sync scripts
|
5 months ago |
Sigbjørn Skjæret
|
baa9255a45
llama : merge conts and reshapes and remove unnecessary cont (#15380)
|
5 months ago |
Georgi Gerganov
|
3007baf201
readme : update hot topics (#15397)
|
5 months ago |
davidef
|
d1d8241600
server : fix incoming tasks not process in order (#15395)
|
5 months ago |
Dobri Danchev
|
618575c582
Fix broken build: require updated pip to support --break-system-packages (#15357)
|
5 months ago |
compilade
|
f44f793172
ggml-quants : fix make_qp_quants NANs and IQ1 assertion errors (#15379)
|
5 months ago |
Jeff Bolz
|
ae532eac2c
vulkan: disable spirv-opt for bfloat16 shaders (#15352)
|
5 months ago |
Oleksandr Kuvshynov
|
e5155e6986
server : export max observed n_past value (#15361)
|
5 months ago |
Jeff Bolz
|
21c17b5bef
vulkan: Use larger workgroups for mul_mat_vec when M is small (#15355)
|
5 months ago |
Dong Won Kim
|
19f4decae0
vulkan: support sqrt (#15370)
|
5 months ago |
Sigbjørn Skjæret
|
4d196981d4
convert : force patch_embd weights to F16 or F32 to avoid broken GGUFs (#15367)
|
5 months ago |
Sigbjørn Skjæret
|
b143fbc87a
ci : fix hang in windows-hip build/release (#15365)
|
5 months ago |
Jeff Bolz
|
de5627910d
vulkan: Optimize argsort (#15354)
|
5 months ago |
Tarek Dakhran
|
65349f26f2
model : support vision LiquidAI LFM2-VL family (#15347)
|
5 months ago |
Jeff Bolz
|
1fe00296f5
vulkan: fuse adds (#15252)
|
5 months ago |
Jeff Bolz
|
de2192794f
vulkan: Support mul_mat_id with f32 accumulators (#15337)
|
5 months ago |
Jeff Bolz
|
2e2b22ba66
vulkan: Add missing bounds checking to scalar/coopmat1 mul_mat_id (#15334)
|
5 months ago |
rmatif
|
912ff8c119
OpenCL: add initial FA support (#14987)
|
5 months ago |
Daniel Bevenius
|
5e6229a840
common : fix double bos, use common_chat_templates for add_bos and add_eos (#15326)
|
5 months ago |
lhez
|
e2c1bfff53
opencl: add initial mxfp4 support via mv (#15270)
|
5 months ago |
Georgi Gerganov
|
5edf1592fd
vulkan : fix out-of-bounds access in argmax kernel (#15342)
|
5 months ago |
Georgi Gerganov
|
db3010bd23
vulkan : fix compile warnings on macos (#15340)
|
5 months ago |
Aaron Teo
|
ff27f80a74
ggml: initial IBM zDNN backend (#14975)
|
5 months ago |
Sigbjørn Skjæret
|
d3248d9b65
ci : fix ios-xcode-build (#15324)
|
5 months ago |
Diego Devesa
|
7aeee88cfe
ci : move ccache action to ggml-org fork (#15328)
|
5 months ago |
Johannes Gäßler
|
b07791aa1d
test-opt: fix backend support check (#15317)
|
5 months ago |
Johannes Gäßler
|
4227c9be42
CUDA: fix negative KV_max values in FA (#15321)
|
5 months ago |