Xuan Son Nguyen
|
7bb36ccf91
gguf : enforce that tensor names are unique (#6905)
|
1 سال پیش |
Neo Zhang
|
ce023f6f2f
add device version in device list (#6959)
|
1 سال پیش |
github-actions[bot]
|
6e472f58e4
flake.lock: Update
|
1 سال پیش |
mgroeber9110
|
4dba7e8114
Replace "alternative" boolean operator in conditional compilation directive (#6949)
|
1 سال پیش |
Pierrick Hymbert
|
b7368332e2
ci: server: tests python env on github container ubuntu latest / fix n_predict (#6935)
|
1 سال پیش |
agray3
|
928e0b7013
Reset schedule earlier to allow overlap with ggml graph computation on device (#6933)
|
1 سال پیش |
Pierrick Hymbert
|
0c4d489e29
quantize: add imatrix and dataset metadata in GGUF (#6658)
|
1 سال پیش |
slaren
|
017e6999b5
add basic tensor data validation function (#6884)
|
1 سال پیش |
slaren
|
e2764cd7ca
gguf : fix mismatch between alloc and free functions (#6929)
|
1 سال پیش |
Justine Tunney
|
4b1c3c98b4
llamafile : use 64-bit integers in sgemm (#6928)
|
1 سال پیش |
Pierrick Hymbert
|
bbe3c6e761
ci: server: fix python installation (#6925)
|
1 سال پیش |
Pierrick Hymbert
|
7f5ff558ee
server: stop generation at `n_ctx_train` if `n_predict` is not set (#6638)
|
1 سال پیش |
Pierrick Hymbert
|
9e4e077ec5
ci: server: fix python installation (#6922)
|
1 سال پیش |
Georgi Gerganov
|
83b72cb086
Merge pull request from GHSA-p5mv-gjc5-mwqv
|
1 سال پیش |
Pierrick Hymbert
|
d4a9afc100
ci: server: fix python installation (#6918)
|
1 سال پیش |
Pierrick Hymbert
|
7d641c26ac
ci: fix concurrency for pull_request_target (#6917)
|
1 سال پیش |
Pierrick Hymbert
|
5790c8dac1
bench: server add stop word for PHI-2 (#6916)
|
1 سال پیش |
vik
|
46e12c4692
llava : add support for moondream vision language model (#6899)
|
1 سال پیش |
Georgi Gerganov
|
dba497e0c1
cmake : restore LLAMA_LLAMAFILE_DEFAULT
|
1 سال پیش |
Georgi Gerganov
|
fa0b4ad252
cmake : remove obsolete ANDROID check
|
1 سال پیش |
slaren
|
d6e1d44f16
llama : synchronize before get/set session data (#6911)
|
1 سال پیش |
Georgi Gerganov
|
853d06ffe2
ci : tmp disable slow tests
|
1 سال پیش |
BarfingLemurs
|
3fe0596c18
readme : update model list (#6908)
|
1 سال پیش |
slaren
|
0ead1f1072
llama : check that all the tensor data is in the model file (#6885)
|
1 سال پیش |
Georgi Gerganov
|
51543729ff
ggml : fix redefinition of vaddvq_f32 for 32-bit ARM (#6906)
|
1 سال پیش |
Daniel Bevenius
|
4ab99d8d47
clip : rename lerp function to avoid conflict (#6894)
|
1 سال پیش |
Georgi Gerganov
|
54770413c4
ggml : fix MIN / MAX macros (#6904)
|
1 سال پیش |
Georgi Gerganov
|
aa750c1ede
tests : minor bash stuff (#6902)
|
1 سال پیش |
jiez
|
1966eb2615
quantize : add '--keep-split' to quantize model into shards (#6688)
|
1 سال پیش |
Johannes Gäßler
|
784e11dea1
README: add graphic for matrix multiplication (#6881)
|
1 سال پیش |