Daniel Bevenius
|
2995341730
llama : add support for NVIDIA Nemotron 3 Nano (#18058)
|
1 месяц назад |
Darius Lukas
|
40d9c394f4
Webui: Disable attachment button and model selector button when prompt textbox is disabled. (#17925)
|
1 месяц назад |
Sigbjørn Skjæret
|
d6a1e18c65
convert : move rope_parameters to TextModel class (#18061)
|
1 месяц назад |
Shouyu
|
c45f89d551
ggml-hexagon: mm for mtmd (#17894)
|
1 месяц назад |
HelloKS
|
9d52f17ae3
model : add KORMo model (#18032)
|
1 месяц назад |
ssweens
|
4529c660c8
kv-cache: Fix state restore fragmented cache (#17982)
|
1 месяц назад |
Pascal
|
0f4f35e7be
Fix unreadable user markdown colors and truncate long texts in deletion dialogs (#17555)
|
1 месяц назад |
Jeremy Demeule
|
165caaf5fb
metal: use shared buffers on eGPU (#17866)
|
1 месяц назад |
Xuan-Son Nguyen
|
96a181a933
mtmd: refactor audio preprocessing (#17978)
|
1 месяц назад |
Andrew Aladjev
|
4a4f7e6550
cli: fixed dead links to tools/main for cli and completion, fixed code owners (#17993)
|
1 месяц назад |
Thomas Jarosch
|
e73d548659
webui: add "delete all conversations" button to import/export tab (#17444)
|
1 месяц назад |
Johannes Gäßler
|
b1f3a6e5db
llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization (#16653)
|
1 месяц назад |
Neo Zhang Jianyu
|
4aced7a631
[SYCL] Support gpt-oss by OPs add-id, mul_mat for mxfp4, swiglu_oai (#17826)
|
1 месяц назад |
piDack
|
745fa0e78b
model : add glm-asr support (#17901)
|
1 месяц назад |
Xuan-Son Nguyen
|
52392291b2
preset: handle negated arg, reverse the meaning if needed (#18041)
|
1 месяц назад |
Sigbjørn Skjæret
|
5c8a717128
convert : refactor rope scaling handling (#18013)
|
1 месяц назад |
Haowei Wu
|
37f5a1093b
mtmd: enhance image resizing in llava_uhd (#18014)
|
1 месяц назад |
Ruben Ortlam
|
9e6649ecf2
vulkan: fix mul_mat_vec_iq1_s formatting (#18026)
|
1 месяц назад |
Xuan-Son Nguyen
|
0759b09c90
graph: add f_attn_temp_offset (#18025)
|
1 месяц назад |
Georgi Gerganov
|
254098a279
common : refactor common_sampler + grammar logic changes (#17937)
|
1 месяц назад |
Jeff Bolz
|
3238b1400c
vulkan: Fix data race/hang in scalar/cm1 flash attention (#17887)
|
1 месяц назад |
lovedheart
|
4722671641
vulkan: improve mul_mat_vec_iq1_s speed (#17874)
|
1 месяц назад |
Eve
|
d15d177f43
vulkan: faster q6_k matmul (#17813)
|
1 месяц назад |
Georgi Gerganov
|
77ad8542bd
model-conversion : cast logits to float32 (#18009)
|
1 месяц назад |
Georgi Gerganov
|
609a2d0268
models : fix YaRN regression + consolidate logic (#18006)
|
1 месяц назад |
Georgi Gerganov
|
a63cbafbbc
ggml : arm repack fix build
|
1 месяц назад |
Georgi Gerganov
|
0e59224990
sync : ggml
|
1 месяц назад |
Georgi Gerganov
|
71fdcf0616
ggml : arm repack fix build (whisper/0)
|
1 месяц назад |
Congcong Cai
|
615655aafe
cmake : set `CMAKE_RUNTIME_OUTPUT_DIRECTORY` for non standalone build (ggml/1394)
|
1 месяц назад |
Xuan-Son Nguyen
|
c00ff929dc
scripts: add script to compare logprobs of llama.cpp against other frameworks (#17947)
|
1 месяц назад |