Sigbjørn Skjæret
|
d2a4ef05c6
vocab : add ByteDance-Seed/Seed-Coder (#13423)
|
8 mesiacov pred |
Xuan-Son Nguyen
|
15e6125a39
mtmd : add hard limit on image resolution for qwen2vl / qwen2.5vl (#13434)
|
8 mesiacov pred |
Xuan-Son Nguyen
|
3b24d26c22
server : update docs (#13432)
|
8 mesiacov pred |
Sigbjørn Skjæret
|
43dfd741a5
llguidance : set tokenizer slices to default (#13424)
|
8 mesiacov pred |
Thammachart Chinvarapon
|
b064a51a4e
ci: free_disk_space flag enabled for intel variant (#13426)
|
8 mesiacov pred |
Xuan-Son Nguyen
|
053367d149
mtmd : support InternVL 2.5 and 3 (#13422)
|
8 mesiacov pred |
Johannes Gäßler
|
d8919424f1
CUDA: fix FlashAttention on Turing (#13415)
|
8 mesiacov pred |
Xuan-Son Nguyen
|
7fef11766c
arg : add env var to control mmproj (#13416)
|
8 mesiacov pred |
Jeff Bolz
|
dc1d2adfc0
vulkan: scalar flash attention implementation (#13324)
|
8 mesiacov pred |
Helton Reis
|
7c28a74e07
chore(llguidance): use tagged version that does not break the build (#13413)
|
8 mesiacov pred |
Xuan-Son Nguyen
|
33eff40240
server : vision support via libmtmd (#12898)
|
8 mesiacov pred |
Alberto Cabrera Pérez
|
17512a94d6
sycl : implementation of reordered Q4_0 MMVQ for Intel GPUs (#12858)
|
8 mesiacov pred |
Georgi Gerganov
|
611aa914ef
metal : optimize MoE for large batches (#13388)
|
8 mesiacov pred |
Johannes Gäßler
|
0cf6725e9f
CUDA: FA support for Deepseek (Ampere or newer) (#13306)
|
8 mesiacov pred |
Diego Devesa
|
27ebfcacba
llama : do not crash if there is no CPU backend (#13395)
|
8 mesiacov pred |
Johannes Gäßler
|
5c86c9ed3e
CUDA: fix crash on large batch size for MoE models (#13384)
|
8 mesiacov pred |
Bartowski
|
efb8b47eda
imatrix : Add --parse-special for enabling parsing of special tokens in imatrix calculation (#13389)
|
8 mesiacov pred |
R0CKSTAR
|
0527771dd8
llama-run: add support for downloading models from ModelScope (#13370)
|
8 mesiacov pred |
Xuan-Son Nguyen
|
2189fd3b63
mtmd : fix batch_view for m-rope (#13397)
|
8 mesiacov pred |
Xuan-Son Nguyen
|
3f96aeff39
llama : one-off chat template fix for Mistral-Small-2503 (#13398)
|
8 mesiacov pred |
Radoslav Gerganov
|
b486ba05bf
rpc : add rpc_msg_set_tensor_hash_req (#13353)
|
8 mesiacov pred |
Jeff Bolz
|
02115dcd9a
vulkan: Allow up to 4096 elements for mul_mat_id row_ids (#13326)
|
8 mesiacov pred |
Xuan-Son Nguyen
|
d9c4accaff
server : (webui) rename has_multimodal --> modalities (#13393)
|
8 mesiacov pred |
Diego Devesa
|
15e03282bb
ci : limit write permission to only the release step + fixes (#13392)
|
8 mesiacov pred |
Matt Clayton
|
f05a6d71a0
mtmd : Expose helper_decode_image_chunk (#13366)
|
8 mesiacov pred |
Xuan-Son Nguyen
|
ee01d71e58
server : (webui) fix a very small misalignment (#13387)
|
8 mesiacov pred |
Xuan-Son Nguyen
|
8c83449cb7
server : (webui) revamp the input area, plus many small UI improvements (#13365)
|
8 mesiacov pred |
Sigbjørn Skjæret
|
1a844be132
convert : support rope_scaling type and rope_type (#13349)
|
8 mesiacov pred |
welix
|
0ccc121354
mtmd : fix the calculation of n_tokens for smolvlm (#13381)
|
8 mesiacov pred |
Georgi Gerganov
|
6562e5a4d6
context : allow cache-less context for embeddings (#13108)
|
8 mesiacov pred |