github-actions[bot]
|
c8e0d7efeb
flake.lock: Update
|
1 year ago |
Georgi Gerganov
|
8f1be0d42f
ggml : add ALiBi support for ggml_soft_max_ext (#5488)
|
1 year ago |
Ananta Bastola
|
6e4e973b26
ci : add an option to fail on compile warning (#3952)
|
1 year ago |
clibdev
|
d250c9d61d
gitignore : update for CLion IDE (#5544)
|
1 year ago |
Georgi Gerganov
|
5bf2b94dd4
cmake : fix VULKAN and ROCm builds (#5525)
|
1 year ago |
Georgi Gerganov
|
d2819d5577
scripts : add helpers script for bench comparing commits (#5521)
|
1 year ago |
Herman Semenov
|
4cb0727698
llava : removed excess free(NULL) operation (#5531)
|
1 year ago |
Herman Semenov
|
65085c713e
llama : minor fixed return int value (#5529)
|
1 year ago |
Alexey Parfenov
|
6dcc02d244
server : add "samplers" param to control the samplers order (#5494)
|
1 year ago |
Rőczey Barnabás
|
5f5808ca7b
server : fix system prompt cli (#5516)
|
1 year ago |
bmwl
|
f486f6e1e5
ggml : add numa options (#5377)
|
1 year ago |
Daniel Bevenius
|
60ed04cf82
llava : fix clip-model-is-vision flag in README.md (#5509)
|
1 year ago |
Georgi Gerganov
|
594845aab1
ci : fix BERT model download and convert
|
1 year ago |
Douglas Hanley
|
4524290e87
Use correct type of pooling for embedding models (#5500)
|
1 year ago |
Georgi Gerganov
|
c06e45d729
clip : fix wrong loop condition
|
1 year ago |
slaren
|
9060a1e9df
cuda : print message when initialization fails (#5512)
|
1 year ago |
Georgi Gerganov
|
9350a1cf21
scripts : add hf.sh helper script (#5501)
|
1 year ago |
Michaël de Vries
|
73122473ff
fix(gguf-py): special tokens are no longer skipped when add_<token>_token is set to false (#5487)
|
1 year ago |
Elbios
|
0d4177126b
llava : fix memory management bug (#5491)
|
1 year ago |
John
|
7930a8a6e8
llaba : hotfix for llava-1.6 image number (#5495)
|
1 year ago |
Neuman Vong
|
704359e299
vulkan: Find optimal memory type but with fallback (#5381)
|
1 year ago |
Rune
|
594fca3fef
readme : fix typo (#5490)
|
1 year ago |
John
|
ccbb277f46
llava : update README.md (#5489)
|
1 year ago |
Michael Podvitskiy
|
8084d55440
cmake : ARM intrinsics detection for MSVC (#5401)
|
1 year ago |
John
|
aa23412989
llava : support v1.6 (#5267)
|
1 year ago |
AT
|
f5ca054855
Early return for zero size calls to get_tensor. (#5482)
|
1 year ago |
John
|
6c00a06692
gguf : add python reader example (#5216)
|
1 year ago |
Jared Van Bortel
|
ea9c8e1143
llama : add support for Nomic Embed (#5468)
|
1 year ago |
Aarni Koskela
|
c4e6dd59e4
llama : allow raw byte in SPM vocabs; don't crash on nl 404 (#5478)
|
1 year ago |
Aarni Koskela
|
037259be68
llama : make load error reporting more granular (#5477)
|
1 year ago |