cturan/llama.cpp @ 360d6533db39e11577afe9b0aece20c6b5ddaf1f

Diego Devesa 360d6533db ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797)		4 months ago
..
ggml-blas	e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850)	4 months ago
ggml-cann	c0389dba43 CANN: Disable acl_graph for prefill stage (#15933)	4 months ago
ggml-cpu	24a6734daf ggml-cpu : add check for ARM MATMUL_INT8/i8mm support (#15922)	4 months ago
ggml-cuda	360d6533db ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797)	4 months ago
ggml-hip	29c8fbe4e0 HIP: bump requirement to rocm 6.1 (#15296)	5 months ago
ggml-metal	0f0a3c2851 metal : make the backend async (#15906)	4 months ago
ggml-musa	7a6e91ad26 CUDA: replace GGML_CUDA_F16 with CUDA arch checks (#15433)	5 months ago
ggml-opencl	e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850)	4 months ago
ggml-rpc	e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850)	4 months ago
ggml-sycl	e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850)	4 months ago
ggml-vulkan	360d6533db ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797)	4 months ago
ggml-webgpu	e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850)	4 months ago
ggml-zdnn	e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850)	4 months ago
CMakeLists.txt	ff27f80a74 ggml: initial IBM zDNN backend (#14975)	5 months ago
ggml-alloc.c	fd1234cb46 llama : add gpt-oss (#15091)	5 months ago
ggml-backend-impl.h	360d6533db ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797)	4 months ago
ggml-backend-reg.cpp	360d6533db ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797)	4 months ago
ggml-backend.cpp	e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850)	4 months ago
ggml-common.h	fd1234cb46 llama : add gpt-oss (#15091)	5 months ago
ggml-impl.h	fd1234cb46 llama : add gpt-oss (#15091)	5 months ago
ggml-opt.cpp	5cdb27e091 finetune: SGD optimizer, more CLI args (#13873)	5 months ago
ggml-quants.c	f44f793172 ggml-quants : fix make_qp_quants NANs and IQ1 assertion errors (#15379)	5 months ago
ggml-quants.h	fd1234cb46 llama : add gpt-oss (#15091)	5 months ago
ggml-threading.cpp	ae8de6d50a ggml : build backends as libraries (#10256)	1 year ago
ggml-threading.h	cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797)	1 year ago
ggml.c	b0d52998b9 cuda : fix supports_op condition for get_rows when number of blocks is too large (#15868)	4 months ago
ggml.cpp	fedf034a98 ggml : Print backtrace on uncaught C++ exceptions (ggml/1232)	7 months ago
gguf.cpp	a81283820a gguf: gguf_writer refactor (#15691)	4 months ago