Neo Zhang Jianyu
|
4aced7a631
[SYCL] Support gpt-oss by OPs add-id, mul_mat for mxfp4, swiglu_oai (#17826)
|
1 kuukausi sitten |
piDack
|
745fa0e78b
model : add glm-asr support (#17901)
|
1 kuukausi sitten |
Xuan-Son Nguyen
|
52392291b2
preset: handle negated arg, reverse the meaning if needed (#18041)
|
1 kuukausi sitten |
Sigbjørn Skjæret
|
5c8a717128
convert : refactor rope scaling handling (#18013)
|
1 kuukausi sitten |
Haowei Wu
|
37f5a1093b
mtmd: enhance image resizing in llava_uhd (#18014)
|
1 kuukausi sitten |
Ruben Ortlam
|
9e6649ecf2
vulkan: fix mul_mat_vec_iq1_s formatting (#18026)
|
1 kuukausi sitten |
Xuan-Son Nguyen
|
0759b09c90
graph: add f_attn_temp_offset (#18025)
|
1 kuukausi sitten |
Georgi Gerganov
|
254098a279
common : refactor common_sampler + grammar logic changes (#17937)
|
1 kuukausi sitten |
Jeff Bolz
|
3238b1400c
vulkan: Fix data race/hang in scalar/cm1 flash attention (#17887)
|
1 kuukausi sitten |
lovedheart
|
4722671641
vulkan: improve mul_mat_vec_iq1_s speed (#17874)
|
1 kuukausi sitten |
Eve
|
d15d177f43
vulkan: faster q6_k matmul (#17813)
|
1 kuukausi sitten |
Georgi Gerganov
|
77ad8542bd
model-conversion : cast logits to float32 (#18009)
|
1 kuukausi sitten |
Georgi Gerganov
|
609a2d0268
models : fix YaRN regression + consolidate logic (#18006)
|
1 kuukausi sitten |
Georgi Gerganov
|
a63cbafbbc
ggml : arm repack fix build
|
1 kuukausi sitten |
Georgi Gerganov
|
0e59224990
sync : ggml
|
1 kuukausi sitten |
Georgi Gerganov
|
71fdcf0616
ggml : arm repack fix build (whisper/0)
|
1 kuukausi sitten |
Congcong Cai
|
615655aafe
cmake : set `CMAKE_RUNTIME_OUTPUT_DIRECTORY` for non standalone build (ggml/1394)
|
1 kuukausi sitten |
Xuan-Son Nguyen
|
c00ff929dc
scripts: add script to compare logprobs of llama.cpp against other frameworks (#17947)
|
1 kuukausi sitten |
Sergey Fedorov
|
4ed2bae50d
server-models.cpp: add missing <filesystem> (#18000)
|
1 kuukausi sitten |
Jeff Bolz
|
5266379bca
llama_context: synchronize before reallocating output buffer (#17974)
|
1 kuukausi sitten |
Xuan-Son Nguyen
|
4d5ae24c0a
arg: fix common_params_parse not accepting negated arg (#17991)
|
1 kuukausi sitten |
Gustavo Rocha Dias
|
66ba51252e
cmake: correct scope - link ws2_32 for MinGW/w64devkit builds in cpp-httplib (#17972)
|
1 kuukausi sitten |
Jeff Bolz
|
36255a2268
vulkan: support get_rows for i32 (#17941)
|
1 kuukausi sitten |
Jeff Bolz
|
3229a23fa6
vulkan: support GGML_OP_DIAG (#17893)
|
1 kuukausi sitten |
Jeff Bolz
|
303f8615e9
vulkan: Multi-pass softmax for large number of cols (#17892)
|
1 kuukausi sitten |
Georgi Gerganov
|
3c6391e748
speculative-simple : free batch on exit (#17985)
|
1 kuukausi sitten |
Sigbjørn Skjæret
|
8e4d678528
common : skip model validation when --completion-bash is requested (#17975)
|
1 kuukausi sitten |
Jeff Bolz
|
07a10c1090
vulkan: Allow non-pow2 n_experts in topk_moe (#17872)
|
1 kuukausi sitten |
Sigbjørn Skjæret
|
2bc94e7928
add llama-completion to completion-bash executables (#17976)
|
1 kuukausi sitten |
Daniel Bevenius
|
fd1085ffb7
model-conversion : use CONVERTED_MODEL value for converted model [no ci] (#17984)
|
1 kuukausi sitten |