Johannes Gäßler
|
10d2af0eaa
llama/ggml: add LLM training support (#10544)
|
9 months ago |
Georgi Gerganov
|
611aa914ef
metal : optimize MoE for large batches (#13388)
|
9 months ago |
Johannes Gäßler
|
2356fb1d53
CUDA: fix bad asserts for partial offload (#13337)
|
9 months ago |
SXX
|
77d5e9a76a
ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs (#13107)
|
9 months ago |
Georgi Gerganov
|
87616f0680
ggml : fix trailing whitespaces (#0)
|
9 months ago |
Acly
|
c6e8cc28c1
ggml : Depthwise 2D convolution (ggml/1152)
|
9 months ago |
Diego Devesa
|
fe92821ea9
ggml : add bilinear upscale support (ggml/1185)
|
10 months ago |
Diego Devesa
|
459895c326
ggml : add more generic custom op, remove deprecated custom ops (ggml/1183)
|
10 months ago |
Diego Devesa
|
e0e912f49b
llama : add option to override model tensor buffers (#11397)
|
10 months ago |
Georgi Gerganov
|
b4ae50810e
metal : improve FA + improve MoE (#12612)
|
10 months ago |
Molly Sophia
|
7dfad387e3
llama: Add support for RWKV v7 architecture (#12412)
|
10 months ago |
vmobilis
|
d6ae2fa061
ggml : ggml_compute_forward_concat() for arbitrary tensor type (ggml/1118)
|
11 months ago |
mgroeber9110
|
5bbe6a9fe9
ggml : portability fixes for VS 2017 (#12150)
|
11 months ago |
Aaron Teo
|
af7747c95a
ggml-cpu: Support s390x SIMD Instruction Set (#12019)
|
11 months ago |
Maxim Evtush
|
7b891bdc86
fix: typos in documentation files (#11791)
|
1 year ago |
William Tambellini
|
1a0e87d291
ggml : add option to not print stack on abort (ggml/1081)
|
1 year ago |
Johannes Gäßler
|
8137b4bb2b
CPU/CUDA: fix (GQA) mul mat back, add CUDA support (#11380)
|
1 year ago |
Johannes Gäßler
|
9c8dcefe17
CUDA: backwards pass for misc. ops, add tests (#11257)
|
1 year ago |
Johannes Gäßler
|
432df2d5f9
RoPE: fix back, CUDA support for back + noncont. (#11240)
|
1 year ago |
Molly Sophia
|
ee7136c6d1
llama: add support for QRWKV6 model architecture (#11001)
|
1 year ago |
Johannes Gäßler
|
53ff6b9b9f
GGUF: C++ refactor, backend support, misc fixes (#11030)
|
1 year ago |
Georgi Gerganov
|
0bf2d10c55
tts : add OuteTTS support (#10784)
|
1 year ago |
Johannes Gäßler
|
081b29bd2a
tests: add tests for GGUF (#10830)
|
1 year ago |
Daniel Bevenius
|
3919da8e33
ggml : add check for grad_accs (ggml/1046)
|
1 year ago |
HimariO
|
ba1cb19cdd
llama : add Qwen2VL support + multimodal RoPE (#10361)
|
1 year ago |
Djip007
|
19d8762ab6
ggml : refactor online repacking (#10446)
|
1 year ago |
PAB
|
c2082d93a8
ggml : add `GGML_PAD_REFLECT_1D` operation (ggml/1034)
|
1 year ago |
Shupei Fan
|
c202cef168
ggml-cpu: support IQ4_NL_4_4 by runtime repack (#10541)
|
1 year ago |
Diego Devesa
|
5931c1f233
ggml : add support for dynamic loading of backends (#10469)
|
1 year ago |
Diego Devesa
|
a5e47592b6
cuda : optimize argmax (#10441)
|
1 year ago |