cturan/llama.cpp @ 68ee98ae181a5c83a5cc6261daeee69a1f588c15

Prajwal B Mehendarkar 6d69ab3f26 cmake : Dont define XOPENSOURCE on AIX (#16481)		3 months ago
..
ggml-blas	4d3d455d3c sync : whisper.cpp (ggml/1359)	3 months ago
ggml-cann	aa4711d369 CANN: Improve ACL graph matching (#16166)	3 months ago
ggml-cpu	1deee0f8d4 cpu : optimize the ggml NORM operation (#15953)	3 months ago
ggml-cuda	9d0882840e Disable CUDA host buffers on integrated GPUs (#16308)	3 months ago
ggml-hip	e95fec640f HIP: Disable ROCWMMA fattn on CDNA when compiled against ROCWMMA 2.0.0 (#16221)	3 months ago
ggml-metal	b2c08c9ec4 metal : mark FA blocks (#16372)	3 months ago
ggml-musa	91a2a56556 musa: update compile flags (#16265)	3 months ago
ggml-opencl	7c156df414 opencl: support pad_ext (#15888)	3 months ago
ggml-rpc	f39283960b rpc : check src buffer when copying tensor (#16421)	3 months ago
ggml-sycl	b260213755 [SYCL] refactor soft_max, add soft_max_back (#16472)	3 months ago
ggml-vulkan	86df2c9ae4 vulkan: use a more appropriate amount of threads when generating shaders (#16418)	3 months ago
ggml-webgpu	74b8fc17f9 ggml webgpu: profiling, CI updates, reworking of command submission (#16452)	3 months ago
ggml-zdnn	264f1b5187 zdnn: refactor codebase + add docs (#16178)	3 months ago
CMakeLists.txt	6d69ab3f26 cmake : Dont define XOPENSOURCE on AIX (#16481)	3 months ago
ggml-alloc.c	638d330246 ggml : fix graph reallocation with multiple chunks (#16396)	3 months ago
ggml-backend-impl.h	898acba681 rpc : add support for multiple devices (#16276)	3 months ago
ggml-backend-reg.cpp	02463ab27b ggml-backend : add root cause in error message if loading backend library fails (#16172)	3 months ago
ggml-backend.cpp	e789095502 llama: print memory breakdown on exit (#15860)	3 months ago
ggml-common.h	fd1234cb46 llama : add gpt-oss (#15091)	5 months ago
ggml-impl.h	34fcc5a4ac model : Apertus model implementation (#15852)	3 months ago
ggml-opt.cpp	5cdb27e091 finetune: SGD optimizer, more CLI args (#13873)	5 months ago
ggml-quants.c	f6b4af3d04 ggml : fix uninitialized is_on_grid in quantize_row_iq3_xxs_impl (#15928)	3 months ago
ggml-quants.h	fd1234cb46 llama : add gpt-oss (#15091)	5 months ago
ggml-threading.cpp	ae8de6d50a ggml : build backends as libraries (#10256)	1 year ago
ggml-threading.h	cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797)	1 year ago
ggml.c	ef07a40906 ggml webgpu: add support for soft_max, optimize rms_norm (#16357)	3 months ago
ggml.cpp	fedf034a98 ggml : Print backtrace on uncaught C++ exceptions (ggml/1232)	7 months ago
gguf.cpp	a81283820a gguf: gguf_writer refactor (#15691)	4 months ago