cturan/llama.cpp

Author	SHA1 Message	Date
Johannes Gäßler	202084d31d tests: add gradient tests for all backends (ggml/932)	1 year ago
Johannes Gäßler	dbbebcab33 ggml: fix ggml_graph_cpy undefined behavior (ggml/943)	1 year ago
compilade	9bc6db28d0 ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151)	1 year ago
Molly Sophia	8f1d81a0b6 llama : support RWKV v6 models (#8980)	1 year ago
Faisal Zaghloul	42c76d1358 Threadpool: take 2 (#8672)	1 year ago
Georgi Gerganov	231cff5f6f sync : ggml	1 year ago
Johannes Gäßler	e11bd856d5 CPU/CUDA: Gemma 2 FlashAttention support (#8542)	1 year ago
compilade	a1631e53f6 llama : simplify Mamba with advanced batch splits (#8526)	1 year ago
Daniel Bevenius	06943a69f6 ggml : move rope type enum to ggml.h (#8949)	1 year ago
Molly Sophia	2d5dd7bb3f ggml : add epsilon as a parameter for group_norm (#8818)	1 year ago
Daniel Bevenius	655858ace0 ggml : move c parameter comment to ggml_rope_ext (ggml/901)	1 year ago
Sigbjørn Skjæret	b72c20b85c Fix conversion of unnormalized BF16->BF16 weights (#7843)	1 year ago
slaren	2b1f616b20 ggml : reduce hash table reset cost (#8698)	1 year ago
Georgi Gerganov	eddcb5238b ggml : add and use ggml_cpu_has_llamafile() (#8664)	1 year ago
hipudding	1bdd8ae19f [CANN] Add Ascend NPU backend (#6035)	1 year ago
Georgi Gerganov	370b1f7e7a ggml : minor naming changes (#8433)	1 year ago
Dibakar Gope	0f1a39f343 ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (#5780)	1 year ago
Georgi Gerganov	f3f65429c4 llama : reorganize source code + improve CMake (#8006)	1 year ago