cturan/llama.cpp

Autor	SHA1 Mensaxe	Data
zrm	b853d45601 ggml : add NUMA support (#1556)	%!s(int64=2) %!d(string=hai) anos
kiltyj	9d0693bce3 metal : use shared buffers between CPU and GPU (#1696)	%!s(int64=2) %!d(string=hai) anos
Johannes Gäßler	affc76edfd cuda : loading models directly into VRAM, norm calculation on GPU, broadcasting for ggml_mul (#1483)	%!s(int64=2) %!d(string=hai) anos
Maxime	503db28849 llama : fix name shadowing and C4146 (#1526)	%!s(int64=2) %!d(string=hai) anos
Ivan Stepanov	34d9f22f44 Wrap exceptions in std::exception to verbose output on exception. (#1316)	%!s(int64=2) %!d(string=hai) anos
xloem	ea3a0ad6b6 llama : update stubs for systems without mmap and mlock (#1266)	%!s(int64=2) %!d(string=hai) anos
slaren	b925f1f1b0 cuBLAS: fall back to pageable memory if pinned alloc fails (#1233)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	84ca9c2ecf examples : fix save-load-state + rename llama-util.h	%!s(int64=2) %!d(string=hai) anos