cturan/llama.cpp

Autor	SHA1 Mensaje	Fecha
Someone Serge	356ea17e0f flake.nix: expose checks	hace 2 años
Someone Serge	a5c088d8c6 flake.nix: rocm not yet supported on aarch64, so hide the output	hace 2 años
Someone Serge	1e3900ebac flake.nix: expose full scope in legacyPackages	hace 2 años
Georgi Gerganov	e39106c055 ggml : add ggml_vdotq_s32 alias (#4715)	hace 2 años
Georgi Gerganov	9fbda719de clip : refactor + bug fixes (#4696)	hace 2 años
Johannes Gäßler	39d8bc71ed CUDA: fixed tensor cores not being used on RDNA3 (#4697)	hace 2 años
automaticcat	24a447e20a ggml : add ggml_cpu_has_avx_vnni() (#4589)	hace 2 años
Johannes Gäßler	a20f3c7465 CUDA: fix tensor core logic for Pascal and HIP (#4682)	hace 2 años
Georgi Gerganov	0235b9b571 clip : use ggml_backend_buffer_is_host (#4205)	hace 2 años
Steward Garcia	ce18d727a4 clip : enable gpu backend (#4205)	hace 2 años
hydai	91bb39cec7 cuda: fix vmm oom issue on NVIDIA AGX Orin (#4687)	hace 2 años
crasm	04ac0607e9 python : add check-requirements.sh and GitHub workflow (#4585)	hace 2 años
Philip Taron	68eccbdc5b flake.nix : rewrite (#4605)	hace 2 años
Cuong Trinh Manh	97bbca6e85 cmake : fix ld warning duplicate libraries libllama.a (#4671)	hace 2 años
Justine Tunney	4af4801566 llava-cli : refactor to use sampling library (#4669)	hace 2 años
Justine Tunney	db49ff8ed7 server : replace sleep with condition variables (#4673)	hace 2 años
SakuraUmi	60f55e888c server : fix OpenAI server sampling w.r.t. penalty. (#4675)	hace 2 años
Karthik Sethuraman	b93edd22f5 server : allow to generate multimodal embeddings (#4681)	hace 2 años
andrijdavid	82d6eab224 main-cmake-pkg : fix build issue (#4665)	hace 2 años
Peter Sugihara	afd997ab60 llama.swiftui : fix infinite loop, ouput timings, buff UI (#4674)	hace 2 años
Georgi Gerganov	c8255f8a6b scripts : print list of sync commits	hace 2 años
Tamotsu Takahashi	441f51dca0 ci : build with CLBlast + ggml-opencl use GGML_API (whisper/1576)	hace 2 años
Georgi Gerganov	38b3de4658 sync : ggml	hace 2 años
bssrdf	afc8c19291 ggml : fix some mul mat cases + add tests for src1 F16 (ggml/669)	hace 2 años
Georgi Gerganov	ca38b8d334 scripts : do not sync commits from this repo	hace 2 años
Justine Tunney	65e5f6dadb Fix OpenAI server sampling w.r.t. temp and seed (#4668)	hace 2 años
manikbhandari	ea5497df5d gpt2 : Add gpt2 architecture integration (#4555)	hace 2 años
Nam D. Tran	f6793491b5 llama : add AWQ for llama, llama2, mpt, and mistral models (#4593)	hace 2 años
Daniel Bevenius	879b690a9e finetune : fix output formatting in print_params (#4653)	hace 2 años
Georgi Gerganov	b47879b0dd scripts : add sync-ggml-am.sh	hace 2 años

Posterior Anterior

Historial de Commits Buscar

Historial de Commits