cturan/llama.cpp

réplica de https://github.com/cturan/llama.cpp

Autor	SHA1 Mensaxe	Data
compilade	90083283ec imatrix : use GGUF to store importance matrices (#9400)	hai 6 meses
Ed Addario	fa4a9f2a1c quantize : handle user-defined pruning of whole layers (blocks) (#13037)	hai 6 meses
Ed Addario	e5c834f718 quantize : improve tensor-type pattern matching (#13033)	hai 8 meses
Diego Devesa	1d36b3670b llama : move end-user examples to tools directory (#13249)	hai 8 meses