| .. |
|
ggml-blas
|
5931c1f233
ggml : add support for dynamic loading of backends (#10469)
|
1 год назад |
|
ggml-cann
|
7a395f67a7
CANN: Add support for async operator submission (#12864)
|
9 месяцев назад |
|
ggml-cpu
|
13b0a04597
whisper: remove MSVC warnings pragmas (whisper/3090)
|
8 месяцев назад |
|
ggml-cuda
|
d8919424f1
CUDA: fix FlashAttention on Turing (#13415)
|
8 месяцев назад |
|
ggml-hip
|
84778e9770
CUDA/HIP: Share the same unified memory allocation logic. (#12934)
|
9 месяцев назад |
|
ggml-kompute
|
ba1cb19cdd
llama : add Qwen2VL support + multimodal RoPE (#10361)
|
1 год назад |
|
ggml-metal
|
611aa914ef
metal : optimize MoE for large batches (#13388)
|
8 месяцев назад |
|
ggml-musa
|
b1b132efcb
cuda : enable CUDA Graph on CUDA Toolkit < 12.x (#12394)
|
10 месяцев назад |
|
ggml-opencl
|
12b17501e6
opencl: fix incorrect local_size index in profiling log (#12868)
|
9 месяцев назад |
|
ggml-rpc
|
b486ba05bf
rpc : add rpc_msg_set_tensor_hash_req (#13353)
|
8 месяцев назад |
|
ggml-sycl
|
17512a94d6
sycl : implementation of reordered Q4_0 MMVQ for Intel GPUs (#12858)
|
8 месяцев назад |
|
ggml-vulkan
|
dc1d2adfc0
vulkan: scalar flash attention implementation (#13324)
|
8 месяцев назад |
|
CMakeLists.txt
|
bba9d945c1
cmake : removed stdc++fs (whisper/3097)
|
8 месяцев назад |
|
ggml-alloc.c
|
f057808ffa
ggml: Don't assert fail when tensor data changes (#13222)
|
8 месяцев назад |
|
ggml-backend-impl.h
|
70680c48e5
ggml : upgrade init_tensor API to return a ggml_status (#11854)
|
10 месяцев назад |
|
ggml-backend-reg.cpp
|
ba7654380a
ggml-backend : fix backend search path (#12330)
|
10 месяцев назад |
|
ggml-backend.cpp
|
9070365020
CUDA: fix logic for clearing padding with -ngl 0 (#13320)
|
8 месяцев назад |
|
ggml-common.h
|
492d7f1ff7
musa: fix all warnings, re-enable `-DLLAMA_FATAL_WARNINGS=ON` in ci and update doc (#12611)
|
9 месяцев назад |
|
ggml-impl.h
|
cb79c2e7fa
ggml: don't include arm_neon.h when using CUDA 12 with ARM Neon (ggml/1187)
|
9 месяцев назад |
|
ggml-opt.cpp
|
02e4eaf22f
ggml-opt: fix data corruption (ggml/1022)
|
1 год назад |
|
ggml-quants.c
|
13b0a04597
whisper: remove MSVC warnings pragmas (whisper/3090)
|
8 месяцев назад |
|
ggml-quants.h
|
ae8de6d50a
ggml : build backends as libraries (#10256)
|
1 год назад |
|
ggml-threading.cpp
|
ae8de6d50a
ggml : build backends as libraries (#10256)
|
1 год назад |
|
ggml-threading.h
|
cb13ef85a4
remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797)
|
1 год назад |
|
ggml.c
|
611aa914ef
metal : optimize MoE for large batches (#13388)
|
8 месяцев назад |
|
gguf.cpp
|
a6f32f0b34
Fix clang warning in gguf_check_reserved_keys (#12686)
|
9 месяцев назад |