cturan/llama.cpp

Author	SHA1 Message	Date
Johannes Gäßler	10d2af0eaa llama/ggml: add LLM training support (#10544)	9 months ago
Georgi Gerganov	611aa914ef metal : optimize MoE for large batches (#13388)	9 months ago
Johannes Gäßler	2356fb1d53 CUDA: fix bad asserts for partial offload (#13337)	9 months ago
SXX	77d5e9a76a ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs (#13107)	9 months ago
Georgi Gerganov	87616f0680 ggml : fix trailing whitespaces (#0)	9 months ago
Acly	c6e8cc28c1 ggml : Depthwise 2D convolution (ggml/1152)	9 months ago
Diego Devesa	fe92821ea9 ggml : add bilinear upscale support (ggml/1185)	10 months ago
Diego Devesa	459895c326 ggml : add more generic custom op, remove deprecated custom ops (ggml/1183)	10 months ago
Diego Devesa	e0e912f49b llama : add option to override model tensor buffers (#11397)	10 months ago
Georgi Gerganov	b4ae50810e metal : improve FA + improve MoE (#12612)	10 months ago
Molly Sophia	7dfad387e3 llama: Add support for RWKV v7 architecture (#12412)	10 months ago
vmobilis	d6ae2fa061 ggml : ggml_compute_forward_concat() for arbitrary tensor type (ggml/1118)	11 months ago
mgroeber9110	5bbe6a9fe9 ggml : portability fixes for VS 2017 (#12150)	11 months ago
Aaron Teo	af7747c95a ggml-cpu: Support s390x SIMD Instruction Set (#12019)	11 months ago
Maxim Evtush	7b891bdc86 fix: typos in documentation files (#11791)	1 year ago
William Tambellini	1a0e87d291 ggml : add option to not print stack on abort (ggml/1081)	1 year ago
Johannes Gäßler	8137b4bb2b CPU/CUDA: fix (GQA) mul mat back, add CUDA support (#11380)	1 year ago
Johannes Gäßler	9c8dcefe17 CUDA: backwards pass for misc. ops, add tests (#11257)	1 year ago
Johannes Gäßler	432df2d5f9 RoPE: fix back, CUDA support for back + noncont. (#11240)	1 year ago
Molly Sophia	ee7136c6d1 llama: add support for QRWKV6 model architecture (#11001)	1 year ago
Johannes Gäßler	53ff6b9b9f GGUF: C++ refactor, backend support, misc fixes (#11030)	1 year ago
Georgi Gerganov	0bf2d10c55 tts : add OuteTTS support (#10784)	1 year ago
Johannes Gäßler	081b29bd2a tests: add tests for GGUF (#10830)	1 year ago
Daniel Bevenius	3919da8e33 ggml : add check for grad_accs (ggml/1046)	1 year ago
HimariO	ba1cb19cdd llama : add Qwen2VL support + multimodal RoPE (#10361)	1 year ago
Djip007	19d8762ab6 ggml : refactor online repacking (#10446)	1 year ago
PAB	c2082d93a8 ggml : add `GGML_PAD_REFLECT_1D` operation (ggml/1034)	1 year ago
Shupei Fan	c202cef168 ggml-cpu: support IQ4_NL_4_4 by runtime repack (#10541)	1 year ago
Diego Devesa	5931c1f233 ggml : add support for dynamic loading of backends (#10469)	1 year ago
Diego Devesa	a5e47592b6 cuda : optimize argmax (#10441)	1 year ago

Newer Older

Commit History Find

Commit History