cturan/llama.cpp @ 2f54e348ad2999c4e31b8777592247622b20420f

Johannes Gäßler 2356fb1d53 CUDA: fix bad asserts for partial offload (#13337)		8 months ago
..
ggml-blas	5931c1f233 ggml : add support for dynamic loading of backends (#10469)	1 year ago
ggml-cann	7a395f67a7 CANN: Add support for async operator submission (#12864)	9 months ago
ggml-cpu	9fdfcdaedd rpc : use backend registry, support dl backends (#13304)	8 months ago
ggml-cuda	2356fb1d53 CUDA: fix bad asserts for partial offload (#13337)	8 months ago
ggml-hip	84778e9770 CUDA/HIP: Share the same unified memory allocation logic. (#12934)	9 months ago
ggml-kompute	ba1cb19cdd llama : add Qwen2VL support + multimodal RoPE (#10361)	1 year ago
ggml-metal	7604a7d6b8 metal : fix floating-point range of attention scores in FA kernels (#13090)	9 months ago
ggml-musa	b1b132efcb cuda : enable CUDA Graph on CUDA Toolkit < 12.x (#12394)	10 months ago
ggml-opencl	12b17501e6 opencl: fix incorrect local_size index in profiling log (#12868)	9 months ago
ggml-rpc	9fdfcdaedd rpc : use backend registry, support dl backends (#13304)	8 months ago
ggml-sycl	66645a5285 SYCL: Disable mul_mat kernels for noncontiguous tensor b (#13308)	8 months ago
ggml-vulkan	8ae5ebcf85 vulkan: Additional type support for unary, binary, and copy (#13266)	8 months ago
CMakeLists.txt	1d735c0b4f ggml : add SSE 4.2 and x64 base variant for CPUs without AVX (#12871)	9 months ago
ggml-alloc.c	f057808ffa ggml: Don't assert fail when tensor data changes (#13222)	8 months ago
ggml-backend-impl.h	70680c48e5 ggml : upgrade init_tensor API to return a ggml_status (#11854)	10 months ago
ggml-backend-reg.cpp	ba7654380a ggml-backend : fix backend search path (#12330)	10 months ago
ggml-backend.cpp	9070365020 CUDA: fix logic for clearing padding with -ngl 0 (#13320)	8 months ago
ggml-common.h	492d7f1ff7 musa: fix all warnings, re-enable `-DLLAMA_FATAL_WARNINGS=ON` in ci and update doc (#12611)	9 months ago
ggml-impl.h	cb79c2e7fa ggml: don't include arm_neon.h when using CUDA 12 with ARM Neon (ggml/1187)	9 months ago
ggml-opt.cpp	02e4eaf22f ggml-opt: fix data corruption (ggml/1022)	1 year ago
ggml-quants.c	5bbe6a9fe9 ggml : portability fixes for VS 2017 (#12150)	10 months ago
ggml-quants.h	ae8de6d50a ggml : build backends as libraries (#10256)	1 year ago
ggml-threading.cpp	ae8de6d50a ggml : build backends as libraries (#10256)	1 year ago
ggml-threading.h	cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797)	1 year ago
ggml.c	2356fb1d53 CUDA: fix bad asserts for partial offload (#13337)	8 months ago
gguf.cpp	a6f32f0b34 Fix clang warning in gguf_check_reserved_keys (#12686)	9 months ago