R0CKSTAR
|
bb115d2bf7
musa: override warp_size of musa device to 32 (#12445)
|
10 mesiacov pred |
Xuan-Son Nguyen
|
29fff308c7
llama : support converting Mistral Small text-only (#12450)
|
10 mesiacov pred |
Georgi Gerganov
|
c6af2161b2
speculative : fix seg fault in certain cases (#12454)
|
10 mesiacov pred |
Xuan-Son Nguyen
|
99aa304fb9
llama : add support for EXAONE tied word embeddings (#12451)
|
10 mesiacov pred |
Georgi Gerganov
|
8551c44d84
context : always use non-causal attention for encoder graphs (#12447)
|
10 mesiacov pred |
Łukasz Ślusarczyk
|
35cae5ba05
SYCL: using graphs is configurable by environment variable and compile option (#12371)
|
10 mesiacov pred |
Georgi Gerganov
|
810e0af3f5
server : fix warmup draft cache type (#12446)
|
10 mesiacov pred |
Prajwal B Mehendarkar
|
eba92d64c3
cmake : fix PowerPC build (#12241)
|
10 mesiacov pred |
fj-y-saito
|
d9a14523bb
ggml : add SVE support for q6_K_q8_K (#12361)
|
10 mesiacov pred |
0cc4m
|
fd123cfead
Vulkan: Default to 1GB allocations instead of 4GB to avoid fragmentation and driver issues (#12434)
|
10 mesiacov pred |
Łukasz Ślusarczyk
|
a53f7f7b88
fixed compilation warnings in ggml-sycl (#12424)
|
10 mesiacov pred |
Molly Sophia
|
7dfad387e3
llama: Add support for RWKV v7 architecture (#12412)
|
10 mesiacov pred |
Sigbjørn Skjæret
|
60c902926c
docs : bring llama-cli conversation/template docs up-to-date (#12426)
|
10 mesiacov pred |
Gaurav Garg
|
b1b132efcb
cuda : enable CUDA Graph on CUDA Toolkit < 12.x (#12394)
|
10 mesiacov pred |
Guus Waals
|
01e8f2138b
ggml-vulkan: remove unused find_program(glslc) (#12416)
|
10 mesiacov pred |
Jeff Bolz
|
484a8ab513
vulkan: Add N/2 and N/4 optimized paths in coopmat2 shader (#12312)
|
10 mesiacov pred |
Daniele
|
cf2270e4d3
vulkan: subgroup size tuning (#12087)
|
10 mesiacov pred |
Jeff Bolz
|
f07690c930
vulkan: use fp32 in coopmat2 q4_k dequant function (#12309)
|
10 mesiacov pred |
Jeff Bolz
|
891c63956d
vulkan: Pad N dimension of B matrix for coopmat2 perf, to avoid bounds checking (#12273)
|
10 mesiacov pred |
Jeff Bolz
|
2f21123c1d
vulkan: Adjust coopmat2 tile sizes and selection heuristic (#12258)
|
10 mesiacov pred |
Christian Kastner
|
374101fd74
cmake : enable building llama.cpp using system libggml (#12321)
|
10 mesiacov pred |
Akarshan Biswas
|
b3c9a65673
SYCL: set extras only on GGML_TYPE_Q4_0 (#12366)
|
10 mesiacov pred |
Sigbjørn Skjæret
|
8ba95dca20
llama : fix OLMo-2-0325-32B-Instruct K-norm size (#12400)
|
10 mesiacov pred |
Georgi Gerganov
|
dc079cfdff
context : fix init of n_outputs (#12397)
|
10 mesiacov pred |
Daniel Bevenius
|
7b61bcc87c
ci : add --symlinks to xcframework zip command (#12409)
|
10 mesiacov pred |
marcoStocchi
|
f4c3dd5daa
llama-tts : add '-o' option (#12398)
|
10 mesiacov pred |
aubreyli
|
3d35d87b41
SYCL: Delete redundant plus sign and space (#12391)
|
10 mesiacov pred |
fairydreaming
|
b19bd064c0
SYCL : support non-contiguous tensors in binary ops (add, sub, etc) (#12399)
|
10 mesiacov pred |
Chenguang Li
|
92a391327e
[CANN]MUL_MAT optimization (#12382)
|
10 mesiacov pred |
Eric Curtin
|
9f2250ba72
Add CLI arg to llama-run to adjust the number of threads used (#12370)
|
10 mesiacov pred |