| .. |
|
amx
|
69ffd89163
ggml-amx : fix ggml_amx_init() on generic Linux (#16049)
|
4 months ago |
|
arch
|
85e72271ba
ggml-cpu : fix typo in gemm comments [no ci] (#16189)
|
4 months ago |
|
cmake
|
ae8de6d50a
ggml : build backends as libraries (#10256)
|
1 year ago |
|
kleidiai
|
2b3efea9a4
kleidiai: fix GGML_ASSERT(*cur_backend_id != -1) failed (#15614)
|
4 months ago |
|
llamafile
|
a6a58d6478
llamafile: PowerPC Sgemm Optimization (#15558)
|
5 months ago |
|
CMakeLists.txt
|
24a6734daf
ggml-cpu : add check for ARM MATMUL_INT8/i8mm support (#15922)
|
4 months ago |
|
arch-fallback.h
|
ad5c975c2d
ggml-cpu: Support Q5_0 and Q5_1 on s390x (#15486)
|
5 months ago |
|
binary-ops.cpp
|
a62d7fa7a9
cpu: de-duplicate some of the operators and refactor (ggml/1144)
|
10 months ago |
|
binary-ops.h
|
a62d7fa7a9
cpu: de-duplicate some of the operators and refactor (ggml/1144)
|
10 months ago |
|
common.h
|
0dd58b6877
ggml : refactor forward_dup for cpu backend (#16062)
|
4 months ago |
|
ggml-cpu-impl.h
|
d36e61c580
ggml-cpu: clean up s390x SIMD (#15855)
|
4 months ago |
|
ggml-cpu.c
|
4e29084ba4
ggml-cpu: Respect cpumask settings (#16164)
|
4 months ago |
|
ggml-cpu.cpp
|
c0b45097c3
rename optimize_graph to graph_optimize (#16082)
|
4 months ago |
|
hbm.cpp
|
f470bc36be
ggml-cpu : split arch-specific implementations (#13892)
|
7 months ago |
|
hbm.h
|
f470bc36be
ggml-cpu : split arch-specific implementations (#13892)
|
7 months ago |
|
ops.cpp
|
3ecb2f671a
ggml : implement set_rows with i32 index (#16159)
|
4 months ago |
|
ops.h
|
0a1b3982cd
ggml: add ops for WAN video model (cuda && cpu) (#15669)
|
4 months ago |
|
quants.c
|
fd1234cb46
llama : add gpt-oss (#15091)
|
5 months ago |
|
quants.h
|
fd1234cb46
llama : add gpt-oss (#15091)
|
5 months ago |
|
repack.cpp
|
00f35d509e
ggml : repack block_iq4_nlx8 (#14904)
|
5 months ago |
|
repack.h
|
00f35d509e
ggml : repack block_iq4_nlx8 (#14904)
|
5 months ago |
|
simd-mappings.h
|
186415d595
ggml-cpu: drop support for nnpa intrinsics (#15821)
|
4 months ago |
|
traits.cpp
|
0d8831543c
ggml : fix fallback to CPU for ununsupported ops (#15118)
|
5 months ago |
|
traits.h
|
0d8831543c
ggml : fix fallback to CPU for ununsupported ops (#15118)
|
5 months ago |
|
unary-ops.cpp
|
a62d7fa7a9
cpu: de-duplicate some of the operators and refactor (ggml/1144)
|
10 months ago |
|
unary-ops.h
|
a62d7fa7a9
cpu: de-duplicate some of the operators and refactor (ggml/1144)
|
10 months ago |
|
vec.cpp
|
05c0380f2a
ggml-cpu : optimize RVV kernels (#15720)
|
4 months ago |
|
vec.h
|
05c0380f2a
ggml-cpu : optimize RVV kernels (#15720)
|
4 months ago |