Georgi Gerganov
|
c06e45d729
clip : fix wrong loop condition
|
1 жил өмнө |
slaren
|
9060a1e9df
cuda : print message when initialization fails (#5512)
|
1 жил өмнө |
Georgi Gerganov
|
9350a1cf21
scripts : add hf.sh helper script (#5501)
|
1 жил өмнө |
Michaël de Vries
|
73122473ff
fix(gguf-py): special tokens are no longer skipped when add_<token>_token is set to false (#5487)
|
1 жил өмнө |
Elbios
|
0d4177126b
llava : fix memory management bug (#5491)
|
1 жил өмнө |
John
|
7930a8a6e8
llaba : hotfix for llava-1.6 image number (#5495)
|
1 жил өмнө |
Neuman Vong
|
704359e299
vulkan: Find optimal memory type but with fallback (#5381)
|
1 жил өмнө |
Rune
|
594fca3fef
readme : fix typo (#5490)
|
1 жил өмнө |
John
|
ccbb277f46
llava : update README.md (#5489)
|
1 жил өмнө |
Michael Podvitskiy
|
8084d55440
cmake : ARM intrinsics detection for MSVC (#5401)
|
1 жил өмнө |
John
|
aa23412989
llava : support v1.6 (#5267)
|
1 жил өмнө |
AT
|
f5ca054855
Early return for zero size calls to get_tensor. (#5482)
|
1 жил өмнө |
John
|
6c00a06692
gguf : add python reader example (#5216)
|
1 жил өмнө |
Jared Van Bortel
|
ea9c8e1143
llama : add support for Nomic Embed (#5468)
|
1 жил өмнө |
Aarni Koskela
|
c4e6dd59e4
llama : allow raw byte in SPM vocabs; don't crash on nl 404 (#5478)
|
1 жил өмнө |
Aarni Koskela
|
037259be68
llama : make load error reporting more granular (#5477)
|
1 жил өмнө |
Daniel Bevenius
|
263978904c
finetune : rename feed-forward tensors (w1/w2/w3) (#4839)
|
1 жил өмнө |
Georgi Gerganov
|
cf45252a7c
tests : multi-thread the tokenizer tests (#5474)
|
1 жил өмнө |
Douglas Hanley
|
03bf161eb6
llama : support batched embeddings (#5466)
|
1 жил өмнө |
Johannes Gäßler
|
ad014bba97
make: add error message for bad CUDA version (#5444)
|
1 жил өмнө |
Georgi Gerganov
|
49cc1f7d67
bert : add tests + fix quantization (#5475)
|
1 жил өмнө |
Georgi Gerganov
|
99b8b43d7b
tests : disable moe test (#5473)
|
1 жил өмнө |
Kawrakow
|
895407f31b
ggml-quants : fix compiler warnings (shadow variable) (#5472)
|
1 жил өмнө |
Georgi Gerganov
|
099afc6274
llama : fix quantization when tensors are missing (#5423)
|
1 жил өмнө |
Georgi Gerganov
|
df334a1125
swift : package no longer use ggml dependency (#5465)
|
1 жил өмнө |
Lee
|
dbd8828eb0
py : fix persimmon `n_rot` conversion (#5460)
|
1 жил өмнө |
Abhilash Majumder
|
43fe07c1a4
ggml-sycl: Replace 3d ops with macro (#5458)
|
1 жил өмнө |
Daniel Bevenius
|
4a46d2b792
llava : remove prog parameter from ArgumentParser (#5457)
|
1 жил өмнө |
Georgi Gerganov
|
3b169441df
sync : ggml (#5452)
|
1 жил өмнө |
Johannes Gäßler
|
3bdc4cd0f5
CUDA: mul_mat_vec_q tiling, refactor mul mat logic (#5434)
|
1 жил өмнө |