| .. |
|
ggml-amx
|
60ce97c9d8
add amx kernel for gemm (#8998)
|
пре 1 година |
|
ggml-cann
|
904837e0cb
cann: fix crash when llama-bench is running on multiple cann devices (#9627)
|
пре 1 година |
|
ggml-cuda
|
8c60a8a462
increase cuda_cpy block size (ggml/996)
|
пре 1 година |
|
ggml-sycl
|
1db8c84fc6
fix mul_mat_vec_q and *_vec_q error (#9939)
|
пре 1 година |
|
kompute @ 4565194ed7
|
f3f65429c4
llama : reorganize source code + improve CMake (#8006)
|
пре 1 година |
|
kompute-shaders
|
06943a69f6
ggml : move rope type enum to ggml.h (#8949)
|
пре 1 година |
|
llamafile
|
2f8bd2b901
llamafile : extend sgemm.cpp support for Q5_0 models (#10010)
|
пре 1 година |
|
vulkan-shaders
|
544f409b4b
vulkan : argsort barriers must be under uniform control flow (ggml/951)
|
пре 1 година |
|
CMakeLists.txt
|
60ce97c9d8
add amx kernel for gemm (#8998)
|
пре 1 година |
|
ggml-aarch64.c
|
6a0f779484
ggml : add run-time detection of neon, i8mm and sve (#9331)
|
пре 1 година |
|
ggml-aarch64.h
|
370b1f7e7a
ggml : minor naming changes (#8433)
|
пре 1 година |
|
ggml-alloc.c
|
cd60b88bf7
ggml-alloc : remove buffer_id from leaf_alloc (ggml/987)
|
пре 1 година |
|
ggml-amx.cpp
|
60ce97c9d8
add amx kernel for gemm (#8998)
|
пре 1 година |
|
ggml-backend-impl.h
|
6374743747
ggml : add backend registry / device interfaces to BLAS backend (#9752)
|
пре 1 година |
|
ggml-backend.cpp
|
6b8447352d
[CANN] Adapt to dynamically loadable backends mechanism (#9970)
|
пре 1 година |
|
ggml-blas.cpp
|
96776405a1
ggml : move more prints to the ggml log system (#9839)
|
пре 1 година |
|
ggml-cann.cpp
|
6b8447352d
[CANN] Adapt to dynamically loadable backends mechanism (#9970)
|
пре 1 година |
|
ggml-common.h
|
9bc6db28d0
ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151)
|
пре 1 година |
|
ggml-cpu-impl.h
|
23e0d70bac
ggml : move common CPU backend impl to new header (#9509)
|
пре 1 година |
|
ggml-cuda.cu
|
524afeec9d
musa: workaround for Guilty Lockup in cleaning src0 (#10042)
|
пре 1 година |
|
ggml-impl.h
|
73afe681aa
fix: use `vm_allocate` to allocate CPU backend buffer on macOS (#9875)
|
пре 1 година |
|
ggml-kompute.cpp
|
c83ad6d01e
ggml-backend : add device and backend reg interfaces (#9707)
|
пре 1 година |
|
ggml-metal.m
|
668750357e
metal : support permuted matrix multiplicaions (#10033)
|
пре 1 година |
|
ggml-metal.metal
|
668750357e
metal : support permuted matrix multiplicaions (#10033)
|
пре 1 година |
|
ggml-quants.c
|
6a0f779484
ggml : add run-time detection of neon, i8mm and sve (#9331)
|
пре 1 година |
|
ggml-quants.h
|
6a0f779484
ggml : add run-time detection of neon, i8mm and sve (#9331)
|
пре 1 година |
|
ggml-rpc.cpp
|
d5ebd79c76
rpc : pack only RPC structs (#9959)
|
пре 1 година |
|
ggml-sycl.cpp
|
87421a23e8
[SYCL] Add SYCL Backend registry, device and Event Interfaces (#9705)
|
пре 1 година |
|
ggml-vulkan.cpp
|
f010b77a37
vulkan : add backend registry / device interfaces (#9721)
|
пре 1 година |
|
ggml.c
|
c39665f589
CUDA: fix MMQ for non-contiguous src0, add tests (#10021)
|
пре 1 година |