cturan/llama.cpp

Эзэн	SHA1 Мессеж	Огноо
Georgi Gerganov	c06e45d729 clip : fix wrong loop condition	1 жил өмнө
slaren	9060a1e9df cuda : print message when initialization fails (#5512)	1 жил өмнө
Georgi Gerganov	9350a1cf21 scripts : add hf.sh helper script (#5501)	1 жил өмнө
Michaël de Vries	73122473ff fix(gguf-py): special tokens are no longer skipped when add_<token>_token is set to false (#5487)	1 жил өмнө
Elbios	0d4177126b llava : fix memory management bug (#5491)	1 жил өмнө
John	7930a8a6e8 llaba : hotfix for llava-1.6 image number (#5495)	1 жил өмнө
Neuman Vong	704359e299 vulkan: Find optimal memory type but with fallback (#5381)	1 жил өмнө
Rune	594fca3fef readme : fix typo (#5490)	1 жил өмнө
John	ccbb277f46 llava : update README.md (#5489)	1 жил өмнө
Michael Podvitskiy	8084d55440 cmake : ARM intrinsics detection for MSVC (#5401)	1 жил өмнө
John	aa23412989 llava : support v1.6 (#5267)	1 жил өмнө
AT	f5ca054855 Early return for zero size calls to get_tensor. (#5482)	1 жил өмнө
John	6c00a06692 gguf : add python reader example (#5216)	1 жил өмнө
Jared Van Bortel	ea9c8e1143 llama : add support for Nomic Embed (#5468)	1 жил өмнө
Aarni Koskela	c4e6dd59e4 llama : allow raw byte in SPM vocabs; don't crash on nl 404 (#5478)	1 жил өмнө
Aarni Koskela	037259be68 llama : make load error reporting more granular (#5477)	1 жил өмнө
Daniel Bevenius	263978904c finetune : rename feed-forward tensors (w1/w2/w3) (#4839)	1 жил өмнө
Georgi Gerganov	cf45252a7c tests : multi-thread the tokenizer tests (#5474)	1 жил өмнө
Douglas Hanley	03bf161eb6 llama : support batched embeddings (#5466)	1 жил өмнө
Johannes Gäßler	ad014bba97 make: add error message for bad CUDA version (#5444)	1 жил өмнө
Georgi Gerganov	49cc1f7d67 bert : add tests + fix quantization (#5475)	1 жил өмнө
Georgi Gerganov	99b8b43d7b tests : disable moe test (#5473)	1 жил өмнө
Kawrakow	895407f31b ggml-quants : fix compiler warnings (shadow variable) (#5472)	1 жил өмнө
Georgi Gerganov	099afc6274 llama : fix quantization when tensors are missing (#5423)	1 жил өмнө
Georgi Gerganov	df334a1125 swift : package no longer use ggml dependency (#5465)	1 жил өмнө
Lee	dbd8828eb0 py : fix persimmon `n_rot` conversion (#5460)	1 жил өмнө
Abhilash Majumder	43fe07c1a4 ggml-sycl: Replace 3d ops with macro (#5458)	1 жил өмнө
Daniel Bevenius	4a46d2b792 llava : remove prog parameter from ArgumentParser (#5457)	1 жил өмнө
Georgi Gerganov	3b169441df sync : ggml (#5452)	1 жил өмнө
Johannes Gäßler	3bdc4cd0f5 CUDA: mul_mat_vec_q tiling, refactor mul mat logic (#5434)	1 жил өмнө

Шинэ Хуучин

Коммит түүх Хайх

Коммит түүх