| .. |
|
ggml-blas
|
cd6983d56d
ggml : fix field name when new ggml_backend (#14944)
|
5 months ago |
|
ggml-cann
|
c1c354e44c
CANN: Refactor ND to NZ workspace to be per-device (#15763)
|
4 months ago |
|
ggml-cpu
|
c4df49a42d
kleidiai: generalize compute_forward_kv_cache to compute_forward_fp16 (#15817)
|
4 months ago |
|
ggml-cuda
|
5143fa895e
CUDA: fastdiv, launch bounds for mmvq + q8_1 quant (#15802)
|
4 months ago |
|
ggml-hip
|
29c8fbe4e0
HIP: bump requirement to rocm 6.1 (#15296)
|
5 months ago |
|
ggml-metal
|
856ed0947f
metal : Add template specialization for mul_mm_id w/ ne20 == 10 (#15799)
|
4 months ago |
|
ggml-musa
|
7a6e91ad26
CUDA: replace GGML_CUDA_F16 with CUDA arch checks (#15433)
|
5 months ago |
|
ggml-opencl
|
0a1b3982cd
ggml: add ops for WAN video model (cuda && cpu) (#15669)
|
4 months ago |
|
ggml-rpc
|
e71d48e326
ggml-rpc: chunk send()/recv() to avoid EINVAL for very large tensors over RPC (macOS & others) (#15188)
|
5 months ago |
|
ggml-sycl
|
0a1b3982cd
ggml: add ops for WAN video model (cuda && cpu) (#15669)
|
4 months ago |
|
ggml-vulkan
|
0a1b3982cd
ggml: add ops for WAN video model (cuda && cpu) (#15669)
|
4 months ago |
|
ggml-webgpu
|
77dee9de97
ggml : WebGPU add TRANSPOSE and RESHAPE to supported ops (#15695)
|
4 months ago |
|
ggml-zdnn
|
ff27f80a74
ggml: initial IBM zDNN backend (#14975)
|
5 months ago |
|
CMakeLists.txt
|
ff27f80a74
ggml: initial IBM zDNN backend (#14975)
|
5 months ago |
|
ggml-alloc.c
|
fd1234cb46
llama : add gpt-oss (#15091)
|
5 months ago |
|
ggml-backend-impl.h
|
70680c48e5
ggml : upgrade init_tensor API to return a ggml_status (#11854)
|
10 months ago |
|
ggml-backend-reg.cpp
|
ff27f80a74
ggml: initial IBM zDNN backend (#14975)
|
5 months ago |
|
ggml-backend.cpp
|
5d804a4938
ggml-backend: raise GGML_MAX_SPLIT_INPUTS (#15722)
|
4 months ago |
|
ggml-common.h
|
fd1234cb46
llama : add gpt-oss (#15091)
|
5 months ago |
|
ggml-impl.h
|
fd1234cb46
llama : add gpt-oss (#15091)
|
5 months ago |
|
ggml-opt.cpp
|
5cdb27e091
finetune: SGD optimizer, more CLI args (#13873)
|
5 months ago |
|
ggml-quants.c
|
f44f793172
ggml-quants : fix make_qp_quants NANs and IQ1 assertion errors (#15379)
|
5 months ago |
|
ggml-quants.h
|
fd1234cb46
llama : add gpt-oss (#15091)
|
5 months ago |
|
ggml-threading.cpp
|
ae8de6d50a
ggml : build backends as libraries (#10256)
|
1 year ago |
|
ggml-threading.h
|
cb13ef85a4
remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797)
|
1 year ago |
|
ggml.c
|
0a1b3982cd
ggml: add ops for WAN video model (cuda && cpu) (#15669)
|
4 months ago |
|
ggml.cpp
|
fedf034a98
ggml : Print backtrace on uncaught C++ exceptions (ggml/1232)
|
7 months ago |
|
gguf.cpp
|
a81283820a
gguf: gguf_writer refactor (#15691)
|
4 months ago |