Georgi Gerganov
|
e235b267a2
py : switch to snake_case (#8305)
|
1 سال پیش |
ditsuke
|
821922916f
fix: Update script paths in CI scripts
|
1 سال پیش |
Georgi Gerganov
|
f3f65429c4
llama : reorganize source code + improve CMake (#8006)
|
1 سال پیش |
Olivier Chafik
|
1c641e6aac
`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809)
|
1 سال پیش |
Galunid
|
9c4c9cc83f
Move convert.py to examples/convert-legacy-llama.py (#7430)
|
1 سال پیش |
Georgi Gerganov
|
55ac3b7aea
ci : use Pythia models instead of OpenLlama (#7470)
|
1 سال پیش |
Georgi Gerganov
|
e84b71c2c6
ggml : drop support for QK_K=64 (#7473)
|
1 سال پیش |
slaren
|
b228aba91a
remove convert-lora-to-ggml.py (#7204)
|
1 سال پیش |
Georgi Gerganov
|
947d3ad27d
ci : add GG_BUILD_EXTRA_TESTS_0 env (#7098)
|
1 سال پیش |
Georgi Gerganov
|
9c67c2773d
ggml : add Flash Attention (#5021)
|
1 سال پیش |
Georgi Gerganov
|
d2c898f746
ci : tmp disable gguf-split (#6983)
|
1 سال پیش |
Georgi Gerganov
|
853d06ffe2
ci : tmp disable slow tests
|
1 سال پیش |
Georgi Gerganov
|
aa750c1ede
tests : minor bash stuff (#6902)
|
1 سال پیش |
Sigbjørn Skjæret
|
8800226d65
Fix --split-max-size (#6655)
|
1 سال پیش |
Georgi Gerganov
|
cfde806eb9
ci : fix BGE wget (#6383)
|
1 سال پیش |
slaren
|
280345968d
cuda : rename build flag to LLAMA_CUDA (#6299)
|
1 سال پیش |
Neo Zhang Jianyu
|
8ced9f7e32
add wait() to make code stable (#5895)
|
1 سال پیش |
Georgi Gerganov
|
87c91c0766
ci : reduce 3b ppl chunks to 1 to avoid timeout (#5771)
|
1 سال پیش |
Georgi Gerganov
|
b1de96824b
ci : fix wikitext url + compile warnings (#5569)
|
1 سال پیش |
Ananta Bastola
|
6e4e973b26
ci : add an option to fail on compile warning (#3952)
|
1 سال پیش |
Georgi Gerganov
|
594845aab1
ci : fix BERT model download and convert
|
1 سال پیش |
Georgi Gerganov
|
49cc1f7d67
bert : add tests + fix quantization (#5475)
|
1 سال پیش |
Abhilash Majumder
|
0f648573dd
ggml : add unified SYCL backend for Intel GPUs (#2690)
|
1 سال پیش |
crasm
|
413e7b0559
ci : add model tests + script wrapper (#4586)
|
2 سال پیش |
Georgi Gerganov
|
38566680cd
ggml : add IQ2 to test-backend-ops + refactoring (#4990)
|
2 سال پیش |
Georgi Gerganov
|
ba69bbc84c
imatrix : offload to GPU support (#4957)
|
2 سال پیش |
Georgi Gerganov
|
c918fe8dca
metal : create autorelease pool during library build (#4970)
|
2 سال پیش |
Georgi Gerganov
|
58ba655af0
metal : enable shader debugging (cmake option) (#4705)
|
2 سال پیش |
Georgi Gerganov
|
1142013da4
save-load-state : fix example + add ci test (#3655)
|
2 سال پیش |
Georgi Gerganov
|
1a8c8795d6
ci : check if there is enough VRAM (#3596)
|
2 سال پیش |