| .. |
|
ggml-amx
|
60ce97c9d8
add amx kernel for gemm (#8998)
|
1 year ago |
|
ggml-cann
|
904837e0cb
cann: fix crash when llama-bench is running on multiple cann devices (#9627)
|
1 year ago |
|
ggml-cuda
|
13dca2a54a
Vectorize load instructions in dmmv f16 CUDA kernel (#9816)
|
1 year ago |
|
ggml-sycl
|
1db8c84fc6
fix mul_mat_vec_q and *_vec_q error (#9939)
|
1 year ago |
|
kompute @ 4565194ed7
|
f3f65429c4
llama : reorganize source code + improve CMake (#8006)
|
1 year ago |
|
kompute-shaders
|
06943a69f6
ggml : move rope type enum to ggml.h (#8949)
|
1 year ago |
|
llamafile
|
23e0d70bac
ggml : move common CPU backend impl to new header (#9509)
|
1 year ago |
|
vulkan-shaders
|
544f409b4b
vulkan : argsort barriers must be under uniform control flow (ggml/951)
|
1 year ago |
|
CMakeLists.txt
|
60ce97c9d8
add amx kernel for gemm (#8998)
|
1 year ago |
|
ggml-aarch64.c
|
6a0f779484
ggml : add run-time detection of neon, i8mm and sve (#9331)
|
1 year ago |
|
ggml-aarch64.h
|
370b1f7e7a
ggml : minor naming changes (#8433)
|
1 year ago |
|
ggml-alloc.c
|
cd60b88bf7
ggml-alloc : remove buffer_id from leaf_alloc (ggml/987)
|
1 year ago |
|
ggml-amx.cpp
|
60ce97c9d8
add amx kernel for gemm (#8998)
|
1 year ago |
|
ggml-backend-impl.h
|
6374743747
ggml : add backend registry / device interfaces to BLAS backend (#9752)
|
1 year ago |
|
ggml-backend.cpp
|
87421a23e8
[SYCL] Add SYCL Backend registry, device and Event Interfaces (#9705)
|
1 year ago |
|
ggml-blas.cpp
|
96776405a1
ggml : move more prints to the ggml log system (#9839)
|
1 year ago |
|
ggml-cann.cpp
|
becfd387f6
[CANN] Fix cann compilation error (#9891)
|
1 year ago |
|
ggml-common.h
|
9bc6db28d0
ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151)
|
1 year ago |
|
ggml-cpu-impl.h
|
23e0d70bac
ggml : move common CPU backend impl to new header (#9509)
|
1 year ago |
|
ggml-cuda.cu
|
96776405a1
ggml : move more prints to the ggml log system (#9839)
|
1 year ago |
|
ggml-impl.h
|
73afe681aa
fix: use `vm_allocate` to allocate CPU backend buffer on macOS (#9875)
|
1 year ago |
|
ggml-kompute.cpp
|
c83ad6d01e
ggml-backend : add device and backend reg interfaces (#9707)
|
1 year ago |
|
ggml-metal.m
|
d5ac8cf2f2
ggml : add metal backend registry / device (#9713)
|
1 year ago |
|
ggml-metal.metal
|
bf9c1013ac
metal : use F32 prec for K*Q in vec FA (#9595)
|
1 year ago |
|
ggml-quants.c
|
6a0f779484
ggml : add run-time detection of neon, i8mm and sve (#9331)
|
1 year ago |
|
ggml-quants.h
|
6a0f779484
ggml : add run-time detection of neon, i8mm and sve (#9331)
|
1 year ago |
|
ggml-rpc.cpp
|
d5ebd79c76
rpc : pack only RPC structs (#9959)
|
1 year ago |
|
ggml-sycl.cpp
|
87421a23e8
[SYCL] Add SYCL Backend registry, device and Event Interfaces (#9705)
|
1 year ago |
|
ggml-vulkan.cpp
|
f010b77a37
vulkan : add backend registry / device interfaces (#9721)
|
1 year ago |
|
ggml.c
|
f594bc80ba
ggml : add asserts for type conversion in fattn kernels (#9971)
|
1 year ago |