Georgi Gerganov
|
947d3ad27d
ci : add GG_BUILD_EXTRA_TESTS_0 env (#7098)
|
1 éve |
Georgi Gerganov
|
9c67c2773d
ggml : add Flash Attention (#5021)
|
1 éve |
Georgi Gerganov
|
d2c898f746
ci : tmp disable gguf-split (#6983)
|
1 éve |
Georgi Gerganov
|
853d06ffe2
ci : tmp disable slow tests
|
1 éve |
Georgi Gerganov
|
aa750c1ede
tests : minor bash stuff (#6902)
|
1 éve |
Sigbjørn Skjæret
|
8800226d65
Fix --split-max-size (#6655)
|
1 éve |
Georgi Gerganov
|
cfde806eb9
ci : fix BGE wget (#6383)
|
1 éve |
slaren
|
280345968d
cuda : rename build flag to LLAMA_CUDA (#6299)
|
1 éve |
Neo Zhang Jianyu
|
8ced9f7e32
add wait() to make code stable (#5895)
|
1 éve |
Georgi Gerganov
|
87c91c0766
ci : reduce 3b ppl chunks to 1 to avoid timeout (#5771)
|
1 éve |
Georgi Gerganov
|
b1de96824b
ci : fix wikitext url + compile warnings (#5569)
|
1 éve |
Ananta Bastola
|
6e4e973b26
ci : add an option to fail on compile warning (#3952)
|
1 éve |
Georgi Gerganov
|
594845aab1
ci : fix BERT model download and convert
|
1 éve |
Georgi Gerganov
|
49cc1f7d67
bert : add tests + fix quantization (#5475)
|
1 éve |
Abhilash Majumder
|
0f648573dd
ggml : add unified SYCL backend for Intel GPUs (#2690)
|
2 éve |
crasm
|
413e7b0559
ci : add model tests + script wrapper (#4586)
|
2 éve |
Georgi Gerganov
|
38566680cd
ggml : add IQ2 to test-backend-ops + refactoring (#4990)
|
2 éve |
Georgi Gerganov
|
ba69bbc84c
imatrix : offload to GPU support (#4957)
|
2 éve |
Georgi Gerganov
|
c918fe8dca
metal : create autorelease pool during library build (#4970)
|
2 éve |
Georgi Gerganov
|
58ba655af0
metal : enable shader debugging (cmake option) (#4705)
|
2 éve |
Georgi Gerganov
|
1142013da4
save-load-state : fix example + add ci test (#3655)
|
2 éve |
Georgi Gerganov
|
1a8c8795d6
ci : check if there is enough VRAM (#3596)
|
2 éve |
slaren
|
789c8c945a
ci : add LoRA test to CI (#2650)
|
2 éve |
Georgi Gerganov
|
5439a0ab57
ci : pip install gguf in editable mode (#2782)
|
2 éve |
Cebtenzzre
|
7c2227a197
chmod : make scripts executable (#2675)
|
2 éve |
Georgi Gerganov
|
6381d4e110
gguf : new file format with flexible meta data (beta) (#2398)
|
2 éve |
Georgi Gerganov
|
dd6c67d3cb
ci : fix args
|
2 éve |
Georgi Gerganov
|
5d500e8ccf
ci : add 7B CUDA tests (#2319)
|
2 éve |
Georgi Gerganov
|
4c013bb738
ci : fix MNT realpath usage (#2250)
|
2 éve |
Georgi Gerganov
|
d01bccde9f
ci : integrate with ggml-org/ci (#2250)
|
2 éve |