hksdpc255
|
636fc17a37
Fix Kimi-K2 tool-call parsing issues (#17376)
|
1 月之前 |
Jay Zenith
|
51e0c2d917
cuda : add FILL op support (#17851)
|
1 月之前 |
Xuan-Son Nguyen
|
37a4f63244
server : add development documentation (#17760)
|
1 月之前 |
Georgi Gerganov
|
2bc96931d2
server : make cache_reuse configurable per request (#17858)
|
1 月之前 |
wsbagnsv1
|
5814b4dce1
cuda: optimize SOLVE_TRI using registers and FMAF (#17703)
|
1 月之前 |
ixgbe
|
79d61896d3
ggml-cpu: add ggml_thread_cpu_relax with Zihintpause support (#17784)
|
1 月之前 |
Xuan-Son Nguyen
|
4d3726278b
model: add llama 4 scaling for mistral-large (deepseek arch) (#17744)
|
1 月之前 |
lovedheart
|
08f9d3cc1d
Vulkan: improve mul_mat_vec_iq1_m (#16907)
|
1 月之前 |
Sigbjørn Skjæret
|
0a540f9abd
ci : add windows-cuda 13.1 release (#17839)
|
1 月之前 |
Sigbjørn Skjæret
|
22577583a3
common : change --color to accept on/off/auto, default to auto (#17827)
|
1 月之前 |
Law Po Ying
|
d9e03db1e7
sycl: add missing BF16 conversion support for Intel oneAPI (#17780)
|
1 月之前 |
Jeff Bolz
|
db97837385
vulkan: perf_logger improvements (#17672)
|
1 月之前 |
Vishal Singh
|
017761daf5
ggml-zendnn : add ZenDNN backend for AMD CPUs (#17690)
|
1 月之前 |
Xuan-Son Nguyen
|
c42712b056
server: support multiple generations from one prompt (OAI "n" option) (#17775)
|
1 月之前 |
Phylliida Dev
|
09c7c50e64
ggml : add circular tiling support to pad, for Vulkan, CUDA, and CPU (used for making seamless textures) (#16985)
|
1 月之前 |
Johannes Gäßler
|
f334b79494
HIP: fix RDNA3 FP16/BF16 matrix multiplication (#17817)
|
1 月之前 |
Aleksander Grygier
|
a28e3c7567
webui: Stop generation from chat sidebar (#17806)
|
1 月之前 |
Aleksander Grygier
|
e31b5c55c3
webui: Fix context available value in Multi-model Router mode (#17804)
|
1 月之前 |
Aleksander Grygier
|
21f24f27a9
webui: Per-conversation system message with UI displaying, edition & branching (#17275)
|
1 月之前 |
Sky
|
7b43f55753
ggml : improve error handling for search path existence checks (#17653)
|
1 月之前 |
Daniel Bevenius
|
444f00b0ec
llama : remove quantization sanity check (#17788)
|
1 月之前 |
Jeff Bolz
|
2960eb2975
vulkan: Use one row per workgroup for f32 mmv (#17711)
|
1 月之前 |
Xuan-Son Nguyen
|
dbc15a7967
convert: support Mistral 3 Large MoE (#17730)
|
1 月之前 |
Jeff Bolz
|
c6c5e85979
vulkan: support solve_tri with larger N/K values (#17781)
|
1 月之前 |
Georgi Gerganov
|
8e5f4987b1
contrib : stale PRs (#17803)
|
1 月之前 |
Georgi Gerganov
|
8ce774a102
metal : fix build(#17799)
|
1 月之前 |
Masato Nakasaka
|
67788f6846
vulkan: Replace deprecated VK_EXT_validation_features (#17637)
|
1 月之前 |
Masato Nakasaka
|
d8c0a7b085
vulkan: Fix mismatch in TOPK_MOE unit test (#17541)
|
1 月之前 |
Jeff Bolz
|
933414c0b6
vulkan: add more num_blocks instantiations in rms_norm (#17701)
|
1 月之前 |
Jeff Bolz
|
a0f3897d53
vulkan: fix top_k bug when there are ties in the input (#17659)
|
1 月之前 |