cturan/llama.cpp

Szerző	SHA1 Üzenet	Dátum
Someone Serge	a5c088d8c6 flake.nix: rocm not yet supported on aarch64, so hide the output	2 éve
Someone Serge	1e3900ebac flake.nix: expose full scope in legacyPackages	2 éve
Georgi Gerganov	e39106c055 ggml : add ggml_vdotq_s32 alias (#4715)	2 éve
Georgi Gerganov	9fbda719de clip : refactor + bug fixes (#4696)	2 éve
Johannes Gäßler	39d8bc71ed CUDA: fixed tensor cores not being used on RDNA3 (#4697)	2 éve
automaticcat	24a447e20a ggml : add ggml_cpu_has_avx_vnni() (#4589)	2 éve
Johannes Gäßler	a20f3c7465 CUDA: fix tensor core logic for Pascal and HIP (#4682)	2 éve
Georgi Gerganov	0235b9b571 clip : use ggml_backend_buffer_is_host (#4205)	2 éve
Steward Garcia	ce18d727a4 clip : enable gpu backend (#4205)	2 éve
hydai	91bb39cec7 cuda: fix vmm oom issue on NVIDIA AGX Orin (#4687)	2 éve
crasm	04ac0607e9 python : add check-requirements.sh and GitHub workflow (#4585)	2 éve
Philip Taron	68eccbdc5b flake.nix : rewrite (#4605)	2 éve
Cuong Trinh Manh	97bbca6e85 cmake : fix ld warning duplicate libraries libllama.a (#4671)	2 éve
Justine Tunney	4af4801566 llava-cli : refactor to use sampling library (#4669)	2 éve
Justine Tunney	db49ff8ed7 server : replace sleep with condition variables (#4673)	2 éve
SakuraUmi	60f55e888c server : fix OpenAI server sampling w.r.t. penalty. (#4675)	2 éve
Karthik Sethuraman	b93edd22f5 server : allow to generate multimodal embeddings (#4681)	2 éve
andrijdavid	82d6eab224 main-cmake-pkg : fix build issue (#4665)	2 éve
Peter Sugihara	afd997ab60 llama.swiftui : fix infinite loop, ouput timings, buff UI (#4674)	2 éve
Georgi Gerganov	c8255f8a6b scripts : print list of sync commits	2 éve
Tamotsu Takahashi	441f51dca0 ci : build with CLBlast + ggml-opencl use GGML_API (whisper/1576)	2 éve
Georgi Gerganov	38b3de4658 sync : ggml	2 éve
bssrdf	afc8c19291 ggml : fix some mul mat cases + add tests for src1 F16 (ggml/669)	2 éve
Georgi Gerganov	ca38b8d334 scripts : do not sync commits from this repo	2 éve
Justine Tunney	65e5f6dadb Fix OpenAI server sampling w.r.t. temp and seed (#4668)	2 éve
manikbhandari	ea5497df5d gpt2 : Add gpt2 architecture integration (#4555)	2 éve
Nam D. Tran	f6793491b5 llama : add AWQ for llama, llama2, mpt, and mistral models (#4593)	2 éve
Daniel Bevenius	879b690a9e finetune : fix output formatting in print_params (#4653)	2 éve
Georgi Gerganov	b47879b0dd scripts : add sync-ggml-am.sh	2 éve
Georgi Gerganov	951010fa53 ggml : fix dot product for ARM (#4630)	2 éve

Újabb Korábbi

Commit történet Keresés

Commit történet