| .. |
|
ggml-blas
|
e68aa10d8f
vulkan: sort graph to allow more parallel execution (#15850)
|
4 months ago |
|
ggml-cann
|
62c3b645c5
CANN: Remove print (#16044)
|
4 months ago |
|
ggml-cpu
|
666fc0583d
Parity on delta!
|
3 months ago |
|
ggml-cuda
|
c959b676be
CUDA: fix FA occupancy, optimize tile kernel (#15982)
|
4 months ago |
|
ggml-hip
|
29c8fbe4e0
HIP: bump requirement to rocm 6.1 (#15296)
|
5 months ago |
|
ggml-metal
|
b213fce89b
metal : improve F32, F16 and BF16 mat-vec multiplication (#16057)
|
4 months ago |
|
ggml-musa
|
7a6e91ad26
CUDA: replace GGML_CUDA_F16 with CUDA arch checks (#15433)
|
5 months ago |
|
ggml-opencl
|
3913f8730e
ggml : fix padding in timestep embedding kernels (#15932)
|
4 months ago |
|
ggml-rpc
|
e68aa10d8f
vulkan: sort graph to allow more parallel execution (#15850)
|
4 months ago |
|
ggml-sycl
|
3913f8730e
ggml : fix padding in timestep embedding kernels (#15932)
|
4 months ago |
|
ggml-vulkan
|
cb5bb6cc05
vulkan: automatically remove unsupported devices (#15976)
|
4 months ago |
|
ggml-webgpu
|
d304f459d8
GGML WebGPU: Support for ADD, MUL, RMS_NORM, GET_ROWS operators (#16018)
|
4 months ago |
|
ggml-zdnn
|
6380d6a3e7
ggml-zdnn: rm user mapped buffers (#15965)
|
4 months ago |
|
CMakeLists.txt
|
43eb7a7757
Now that eval's running move delta net stuff back to llama-model, add cbs
|
4 months ago |
|
ggml-alloc.c
|
fd1234cb46
llama : add gpt-oss (#15091)
|
5 months ago |
|
ggml-backend-impl.h
|
360d6533db
ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797)
|
4 months ago |
|
ggml-backend-reg.cpp
|
360d6533db
ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797)
|
4 months ago |
|
ggml-backend.cpp
|
e68aa10d8f
vulkan: sort graph to allow more parallel execution (#15850)
|
4 months ago |
|
ggml-common.h
|
fd1234cb46
llama : add gpt-oss (#15091)
|
5 months ago |
|
ggml-impl.h
|
fd1234cb46
llama : add gpt-oss (#15091)
|
5 months ago |
|
ggml-opt.cpp
|
5cdb27e091
finetune: SGD optimizer, more CLI args (#13873)
|
5 months ago |
|
ggml-quants.c
|
f44f793172
ggml-quants : fix make_qp_quants NANs and IQ1 assertion errors (#15379)
|
5 months ago |
|
ggml-quants.h
|
fd1234cb46
llama : add gpt-oss (#15091)
|
5 months ago |
|
ggml-threading.cpp
|
ae8de6d50a
ggml : build backends as libraries (#10256)
|
1 year ago |
|
ggml-threading.h
|
cb13ef85a4
remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797)
|
1 year ago |
|
ggml.c
|
c87e8d550c
Tensor preparation for delta_net complete
|
3 months ago |
|
ggml.cpp
|
fedf034a98
ggml : Print backtrace on uncaught C++ exceptions (ggml/1232)
|
7 months ago |
|
gguf.cpp
|
a81283820a
gguf: gguf_writer refactor (#15691)
|
4 months ago |