Xuan-Son Nguyen
|
92ecdcc06a
mtmd : add vision support for llama 4 (#13282)
|
8 ヶ月 前 |
Alberto Cabrera Pérez
|
f71f40a284
ci : upgraded oneAPI version in SYCL workflows and dockerfile (#13532)
|
8 ヶ月 前 |
Georgi Gerganov
|
d30cb5a7fa
sync : ggml
|
8 ヶ月 前 |
Johannes Gäßler
|
6c35981a64
mnist: fix segmentation fault (ggml/1227)
|
8 ヶ月 前 |
Diego Devesa
|
8b5e19aea6
ggml : fix apple OS check in ggml_print_backtrace (ggml/1229)
|
8 ヶ月 前 |
Daniel Tang
|
60aea028b5
ggml : Fix missing backtrace on Linux (ggml/1228)
|
8 ヶ月 前 |
Nick
|
9c55e5c5c2
fix: check model pointer validity before use (#13631)
|
8 ヶ月 前 |
Chenguang Li
|
33d7aed4a8
CANN: Support MOE Model MUL_MAT_ID (#13042)
|
8 ヶ月 前 |
Isaac McFadyen
|
6a2bc8bfb7
server : added --no-prefill-assistant flag (#13608)
|
8 ヶ月 前 |
Gilad S.
|
e3a7cf6c5b
cmake: use the current build config for vulkan-shaders-gen (#13595)
|
8 ヶ月 前 |
Georgi Gerganov
|
518329b2d4
parallel : add option for non-shared and larger prompts (#13598)
|
8 ヶ月 前 |
Jeff Bolz
|
2f5a4e1e09
vulkan: move common FA code to flash_attn_base.comp (#13556)
|
8 ヶ月 前 |
Jeff Bolz
|
4f41ee11d6
vulkan: use scalar FA rather than coopmat2 when N==1 (#13554)
|
8 ヶ月 前 |
Z
|
3e0be1cace
llguidance : official v0.7.20 release (no actual changes) [noci] (#13594)
|
8 ヶ月 前 |
Xuan-Son Nguyen
|
6aa892ec2a
server : do not return error out of context (with ctx shift disabled) (#13577)
|
8 ヶ月 前 |
Xuan-Son Nguyen
|
aea9f8b4e7
webui : improve accessibility for visually impaired people (#13551)
|
8 ヶ月 前 |
Xuan-Son Nguyen
|
06c1e4abc1
readme : add list of dependencies and their license (#13591)
|
8 ヶ月 前 |
Diego Devesa
|
415e40a357
releases : use arm version of curl for arm releases (#13592)
|
8 ヶ月 前 |
Georgi Gerganov
|
654a67794f
metal : add FA-vec kernel for head size 64 (#13583)
|
8 ヶ月 前 |
Diego Devesa
|
5364ae4ba5
llama : print hint when loading a model when no backends are loaded (#13589)
|
8 ヶ月 前 |
Sigbjørn Skjæret
|
7c07ac244d
ci : add ppc64el to build-linux-cross (#13575)
|
8 ヶ月 前 |
Łukasz Ślusarczyk
|
0a338ed013
sycl : fixed compilation warnings (#13582)
|
8 ヶ月 前 |
Olivier Chafik
|
bc098c3cf0
minja: sync (qwen3) (#13573)
|
8 ヶ月 前 |
Diego Devesa
|
c6a2c9e741
gguf : use ggml log system (#13571)
|
8 ヶ月 前 |
Daniel Tang
|
07ad2b6db3
gguf-py : fix disconnect-before-connect in editor-gui (#13569)
|
8 ヶ月 前 |
Xuan-Son Nguyen
|
c531edfa34
convert : fix conversion for llama 4 (#13567)
|
8 ヶ月 前 |
Atharva Dubey
|
02cdd2d8b0
sycl: simplify bin_bcast_kernel (#13383)
|
8 ヶ月 前 |
Svetlozar Georgiev
|
64bb51cf90
sycl: reordered Q4_K MMVQ (#13109)
|
8 ヶ月 前 |
Łukasz Ślusarczyk
|
9c404ed54c
sycl: use oneDNN for matrices multiplication (#12972)
|
8 ヶ月 前 |
Diego Devesa
|
6c8b91500e
llama-bench : fix -ot with dl backends (#13563)
|
8 ヶ月 前 |