Sigbjørn Skjæret
|
8800226d65
Fix --split-max-size (#6655)
|
1 year ago |
Georgi Gerganov
|
cfde806eb9
ci : fix BGE wget (#6383)
|
1 year ago |
slaren
|
280345968d
cuda : rename build flag to LLAMA_CUDA (#6299)
|
1 year ago |
Neo Zhang Jianyu
|
8ced9f7e32
add wait() to make code stable (#5895)
|
1 year ago |
Georgi Gerganov
|
87c91c0766
ci : reduce 3b ppl chunks to 1 to avoid timeout (#5771)
|
1 year ago |
Georgi Gerganov
|
b1de96824b
ci : fix wikitext url + compile warnings (#5569)
|
1 year ago |
Ananta Bastola
|
6e4e973b26
ci : add an option to fail on compile warning (#3952)
|
1 year ago |
Georgi Gerganov
|
594845aab1
ci : fix BERT model download and convert
|
1 year ago |
Georgi Gerganov
|
49cc1f7d67
bert : add tests + fix quantization (#5475)
|
1 year ago |
Abhilash Majumder
|
0f648573dd
ggml : add unified SYCL backend for Intel GPUs (#2690)
|
2 years ago |
crasm
|
413e7b0559
ci : add model tests + script wrapper (#4586)
|
2 years ago |
Georgi Gerganov
|
38566680cd
ggml : add IQ2 to test-backend-ops + refactoring (#4990)
|
2 years ago |
Georgi Gerganov
|
ba69bbc84c
imatrix : offload to GPU support (#4957)
|
2 years ago |
Georgi Gerganov
|
c918fe8dca
metal : create autorelease pool during library build (#4970)
|
2 years ago |
Georgi Gerganov
|
58ba655af0
metal : enable shader debugging (cmake option) (#4705)
|
2 years ago |
Georgi Gerganov
|
1142013da4
save-load-state : fix example + add ci test (#3655)
|
2 years ago |
Georgi Gerganov
|
1a8c8795d6
ci : check if there is enough VRAM (#3596)
|
2 years ago |
slaren
|
789c8c945a
ci : add LoRA test to CI (#2650)
|
2 years ago |
Georgi Gerganov
|
5439a0ab57
ci : pip install gguf in editable mode (#2782)
|
2 years ago |
Cebtenzzre
|
7c2227a197
chmod : make scripts executable (#2675)
|
2 years ago |
Georgi Gerganov
|
6381d4e110
gguf : new file format with flexible meta data (beta) (#2398)
|
2 years ago |
Georgi Gerganov
|
dd6c67d3cb
ci : fix args
|
2 years ago |
Georgi Gerganov
|
5d500e8ccf
ci : add 7B CUDA tests (#2319)
|
2 years ago |
Georgi Gerganov
|
4c013bb738
ci : fix MNT realpath usage (#2250)
|
2 years ago |
Georgi Gerganov
|
d01bccde9f
ci : integrate with ggml-org/ci (#2250)
|
2 years ago |