cturan/llama.cpp @ b2d80e105a59b54822edf7ce7f3ed5f317e96e21

réplica de https://github.com/cturan/llama.cpp

Georgi Gerganov bcc0eb4591 llama : per-layer KV cache + quantum K cache (#4309)		%!s(int64=2) %!d(string=hai) anos
..
CMakeLists.txt	b12fa0d1c1 build : link against build info instead of compiling against it (#3879)	%!s(int64=2) %!d(string=hai) anos
quantize-stats.cpp	bcc0eb4591 llama : per-layer KV cache + quantum K cache (#4309)	%!s(int64=2) %!d(string=hai) anos