Pascal
|
b1846f1c8e
webui: add rehype plugin to restore HTML in Markdown table cells (#17477)
|
1 month ago |
Jeff Bolz
|
d414db02d3
vulkan: Use fewer rows for scalar FA when HS is not a multiple of 16 (#17455)
|
1 month ago |
Aaron Teo
|
877566d512
llama: introduce support for model-embedded sampling parameters (#17120)
|
1 month ago |
Jeff Bolz
|
3d07caa99b
vulkan: more FA details in vk_perf_logger (#17443)
|
1 month ago |
Daniel Bevenius
|
134e6940ca
llama : skip output reordering for single token batches (#17466)
|
1 month ago |
Jiacheng (Jason) Chen
|
0543f928a3
HIP: WMMA-MMQ kernels for RDNA 4 (#17156)
|
1 month ago |
Sigbjørn Skjæret
|
b61de2b2df
convert : allow quantizing lora again (#17453)
|
1 month ago |
Xuan-Son Nguyen
|
b8372eecd9
server: split server.cpp code into server/common/task/queue (#17362)
|
1 month ago |
Daniel Bevenius
|
6ab8eacddf
examples : add -kvu to batched usage example [no ci] (#17469)
|
1 month ago |
Georgi Gerganov
|
2d50b9d8cb
sync : ggml
|
1 month ago |
Daniel Bevenius
|
697edfeead
ggml : remove dirty flag from version string (ggml/1391)
|
1 month ago |
Alberto Cabrera Pérez
|
dbb852b549
ggml-cpu: arm64: q4_K repack gemm and gemv implementations (i8mm) (#16739)
|
1 month ago |
ixgbe
|
5f55c385cb
ggml: add RISC-V cpu-feats (#17461)
|
1 month ago |
william pan
|
4902eebe33
models : Added support for RND1 Diffusion Language Model (#17433)
|
2 months ago |
Max Krasnyansky
|
923ae3c619
hexagon: add support for ROPE_NEOX (#17458)
|
2 months ago |
Raul Torres
|
01ad35e6d6
CANN: Define `cann_graph_update_required` before macro (#17434)
|
2 months ago |
M. Mediouni
|
fcb013847c
ggml-hexagon: Initial Hexagon v68/v69 support (#17394)
|
2 months ago |
nullname
|
d5bc1ad110
ggml-hexagon: add `hex_supported_buffer` for better buffer supported check (#17212)
|
2 months ago |
Pascal
|
0c7220db56
webui: minor settings reorganization and add disable autoscroll option (#17452)
|
2 months ago |
Sigbjørn Skjæret
|
96ac5a2329
cuda : support non-contiguous i32 to i32 copy (#17326)
|
2 months ago |
Eric Curtin
|
bc809e9c53
vulkan: Update docker image to Ubuntu 26.04 to enable glslc features (#17439)
|
2 months ago |
Jeff Bolz
|
54d83bbe85
vulkan: remove a couple unnecessary switches (#17419)
|
2 months ago |
Adrien Gallouët
|
4949ac0f18
ci : switch to BoringSSL on Server workflow (#17441)
|
2 months ago |
Masato Nakasaka
|
3f3a4fb9c3
Revive MUL_MAT_ID to perf testing (#17397)
|
2 months ago |
yulo
|
028f93ef98
HIP: RDNA4 tensor core support for MMF (#17077)
|
2 months ago |
lhez
|
8e9ddba610
opencl: refine condition for kqv mm (#17392)
|
2 months ago |
ubergarm
|
23bc779a6e
model : detect GigaChat3-10-A1.8B as deepseek lite (#17420)
|
2 months ago |
Adrien Gallouët
|
28175f857d
cmake : add option to build and link BoringSSL (#17205)
|
2 months ago |
Adrien Gallouët
|
9cc4080441
ci : start using OpenSSL (#17235)
|
2 months ago |
Jeff Bolz
|
f1ffbba68e
vulkan: disable async for older Intel devices (#17369)
|
2 months ago |