cturan/llama.cpp

mirror of https://github.com/cturan/llama.cpp

Author	SHA1 Message	Date
Kawrakow	44ca159faf 1.5 bit: we can do even better (#5999)	1 year ago
Kawrakow	be858f6205 Better 1.5 bit quantization (#5971)	1 year ago
Georgi Gerganov	bf47a5eefc ggml : remove __constant__ specifier for CUDA tables (#5940)	1 year ago
Georgi Gerganov	8a3012a4ad ggml : add ggml-common.h to deduplicate shared code (#5940)	1 year ago