Max Krasnyansky
|
c273d75375
hexagon: various Op fixes (#17135)
|
пре 2 месеци |
Eve
|
7d019cff74
disable rms norm mul rope for chips with no fp16 rte (#17134)
|
пре 2 месеци |
sudhiarm
|
3fe36c3238
ci: add Arm-hosted Graviton4 runner (#17021)
|
пре 2 месеци |
Xuan-Son Nguyen
|
1d45b4228f
vendor: split httplib to cpp/h files (#17150)
|
пре 2 месеци |
ixgbe
|
ca4844062b
ggml-cpu : add RISC-V RVV (Zvfh) optimization for FP16 to FP32 conversion (#17161)
|
пре 2 месеци |
duduta
|
73460f6278
ggml-cpu: templateify ggml_compute_forward_rope_f32 and _f16 (#16805)
|
пре 2 месеци |
Charles Xu
|
8c583242ad
kleidiai: add optimized per-channel kernels for Q8_0 (#16993)
|
пре 2 месеци |
Mike Abbott
|
4a5b8aff40
cmake : add version to all shared object files (#17091)
|
пре 2 месеци |
Nicolas B. Pierron
|
d2d626938a
Install rpc-server when GGML_RPC is ON. (#17149)
|
пре 2 месеци |
levkropp
|
2fc392ce35
convert : register UMT5Model architecture for T5 conversion (#17160)
|
пре 2 месеци |
lhez
|
ece0f5c177
opencl: add fastdiv and use it in set_rows, ported from cuda (#17090)
|
пре 2 месеци |
Sigbjørn Skjæret
|
7bef684118
models : move build_inp_out_ids outside loop (#17151)
|
пре 2 месеци |
Max Krasnyansky
|
395e286bc9
cpu: skip NOPs to avoid barriers (#17133)
|
пре 2 месеци |
Georgi Gerganov
|
13730c183b
metal : cap threadgroups size of set_rows (#17146)
|
пре 2 месеци |
Adrien Gallouët
|
967eb4b2bf
ggml-cpu : inspect -march and -mcpu to found the CPU (#16333)
|
пре 2 месеци |
Ruben Ortlam
|
f117be185e
vulkan: check glslc executable string (#17144)
|
пре 2 месеци |
Ruben Ortlam
|
85234a4b3a
vulkan: fix validation issue introduced by #16868 (#17145)
|
пре 2 месеци |
Gabe Goodhart
|
0c74f32632
memory: Hybrid context shift (#17009)
|
пре 2 месеци |
Georgi Gerganov
|
c27efd2bd1
metal : enable tensor API for A19 (#17087)
|
пре 2 месеци |
fj-y-saito
|
df70bedda7
arm64: add i8mm route with SVE ggml_vec_dot_q4_K_q8_K and ggml_vec_dot_q6_K_… (#15277)
|
пре 2 месеци |
Georgi Gerganov
|
f914544b16
batched-bench : add "separate text gen" mode (#17103)
|
пре 2 месеци |
Xuan-Son Nguyen
|
4b13a684c5
mtmd: fix patch_size initialized to random value in audio models (#17128)
|
пре 2 месеци |
Georgi Gerganov
|
9898b57cbe
editorconfig : ignore benches/ (#17140)
|
пре 2 месеци |
Acly
|
1032256ec9
cuda/vulkan : bicubic interpolation (#17022)
|
пре 2 месеци |
Georgi Gerganov
|
15274c0c50
benches : add eval results (#17139)
|
пре 2 месеци |
Georgi Gerganov
|
b8595b16e6
mtmd : fix embedding size for image input (#17123)
|
пре 2 месеци |
Ruben Ortlam
|
392e09a608
vulkan: fix memory allocations (#17122)
|
пре 2 месеци |
compilade
|
802cef44bf
convert : parse safetensors directly (#15667)
|
пре 2 месеци |
compilade
|
1c07c0c68c
convert : handle compressed-tensors quant method (#17069)
|
пре 2 месеци |
Georgi Gerganov
|
cb1adf8851
server : handle failures to restore host cache (#17078)
|
пре 2 месеци |