cturan/llama.cpp

mirror de https://github.com/cturan/llama.cpp

Autor	SHA1 Mensagem	Data
compilade	90083283ec imatrix : use GGUF to store importance matrices (#9400)	há 6 meses atrás
Ed Addario	fa4a9f2a1c quantize : handle user-defined pruning of whole layers (blocks) (#13037)	há 7 meses atrás
Ed Addario	e5c834f718 quantize : improve tensor-type pattern matching (#13033)	há 8 meses atrás
Diego Devesa	1d36b3670b llama : move end-user examples to tools directory (#13249)	há 8 meses atrás