cturan/llama.cpp

Author	SHA1 Message	Date
Johannes Gäßler	02e4eaf22f ggml-opt: fix data corruption (ggml/1022)	1 year ago
Johannes Gäßler	8a43e940ab ggml: new optimization interface (ggml/988)	1 year ago
Diego Devesa	ae8de6d50a ggml : build backends as libraries (#10256)	1 year ago
Diego Devesa	9f40989351 ggml : move CPU backend to a separate file (#10144)	1 year ago
Gilad S.	73afe681aa fix: use `vm_allocate` to allocate CPU backend buffer on macOS (#9875)	1 year ago
bandoti	d6fe7abf04 ggml: unify backend logging mechanism (#9709)	1 year ago
slaren	23e0d70bac ggml : move common CPU backend impl to new header (#9509)	1 year ago
Georgi Gerganov	d6a04f872d ggml : hide ggml_object, ggml_cgraph, ggml_hash_set (#9408)	1 year ago
compilade	9bc6db28d0 ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151)	1 year ago
jdomke	76614f352e ggml : reading the runtime sve config of the cpu (#8709)	1 year ago
Sigbjørn Skjæret	b72c20b85c Fix conversion of unnormalized BF16->BF16 weights (#7843)	1 year ago
slaren	2b1f616b20 ggml : reduce hash table reset cost (#8698)	1 year ago
Dibakar Gope	0f1a39f343 ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (#5780)	1 year ago
Georgi Gerganov	f3f65429c4 llama : reorganize source code + improve CMake (#8006)	1 year ago