Pierrick Hymbert
|
e75c6279d1
server : enhanced health endpoint (#5548)
|
1 rok temu |
Pierrick Hymbert
|
36376abe05
server : --n-predict option document and cap to max value (#5549)
|
1 rok temu |
Daniel Hiltgen
|
66c1968f7a
server : graceful server shutdown (#5244)
|
1 rok temu |
Georgi Gerganov
|
1dcc3fde00
common : fix ub (#5530)
|
1 rok temu |
Herman Semenov
|
5d3de51f97
ggml, common, examples, tests : fixed type arguments in printf (#5528)
|
1 rok temu |
Daniel Bevenius
|
fc0c8d286a
llava : update surgery script to not remove tensors (#5536)
|
1 rok temu |
Kawrakow
|
bd2d4e393b
1.5 bit quantization (#5453)
|
1 rok temu |
github-actions[bot]
|
c8e0d7efeb
flake.lock: Update
|
1 rok temu |
Georgi Gerganov
|
8f1be0d42f
ggml : add ALiBi support for ggml_soft_max_ext (#5488)
|
1 rok temu |
Ananta Bastola
|
6e4e973b26
ci : add an option to fail on compile warning (#3952)
|
1 rok temu |
clibdev
|
d250c9d61d
gitignore : update for CLion IDE (#5544)
|
1 rok temu |
Georgi Gerganov
|
5bf2b94dd4
cmake : fix VULKAN and ROCm builds (#5525)
|
1 rok temu |
Georgi Gerganov
|
d2819d5577
scripts : add helpers script for bench comparing commits (#5521)
|
1 rok temu |
Herman Semenov
|
4cb0727698
llava : removed excess free(NULL) operation (#5531)
|
1 rok temu |
Herman Semenov
|
65085c713e
llama : minor fixed return int value (#5529)
|
1 rok temu |
Alexey Parfenov
|
6dcc02d244
server : add "samplers" param to control the samplers order (#5494)
|
1 rok temu |
Rőczey Barnabás
|
5f5808ca7b
server : fix system prompt cli (#5516)
|
1 rok temu |
bmwl
|
f486f6e1e5
ggml : add numa options (#5377)
|
1 rok temu |
Daniel Bevenius
|
60ed04cf82
llava : fix clip-model-is-vision flag in README.md (#5509)
|
1 rok temu |
Georgi Gerganov
|
594845aab1
ci : fix BERT model download and convert
|
1 rok temu |
Douglas Hanley
|
4524290e87
Use correct type of pooling for embedding models (#5500)
|
1 rok temu |
Georgi Gerganov
|
c06e45d729
clip : fix wrong loop condition
|
1 rok temu |
slaren
|
9060a1e9df
cuda : print message when initialization fails (#5512)
|
1 rok temu |
Georgi Gerganov
|
9350a1cf21
scripts : add hf.sh helper script (#5501)
|
1 rok temu |
Michaël de Vries
|
73122473ff
fix(gguf-py): special tokens are no longer skipped when add_<token>_token is set to false (#5487)
|
1 rok temu |
Elbios
|
0d4177126b
llava : fix memory management bug (#5491)
|
1 rok temu |
John
|
7930a8a6e8
llaba : hotfix for llava-1.6 image number (#5495)
|
1 rok temu |
Neuman Vong
|
704359e299
vulkan: Find optimal memory type but with fallback (#5381)
|
1 rok temu |
Rune
|
594fca3fef
readme : fix typo (#5490)
|
1 rok temu |
John
|
ccbb277f46
llava : update README.md (#5489)
|
1 rok temu |