cturan/llama.cpp

Autor	SHA1 Zpráva	Datum
Georgi Gerganov	0235b9b571 clip : use ggml_backend_buffer_is_host (#4205)	před 2 roky
Steward Garcia	ce18d727a4 clip : enable gpu backend (#4205)	před 2 roky
hydai	91bb39cec7 cuda: fix vmm oom issue on NVIDIA AGX Orin (#4687)	před 2 roky
crasm	04ac0607e9 python : add check-requirements.sh and GitHub workflow (#4585)	před 2 roky
Philip Taron	68eccbdc5b flake.nix : rewrite (#4605)	před 2 roky
Cuong Trinh Manh	97bbca6e85 cmake : fix ld warning duplicate libraries libllama.a (#4671)	před 2 roky
Justine Tunney	4af4801566 llava-cli : refactor to use sampling library (#4669)	před 2 roky
Justine Tunney	db49ff8ed7 server : replace sleep with condition variables (#4673)	před 2 roky
SakuraUmi	60f55e888c server : fix OpenAI server sampling w.r.t. penalty. (#4675)	před 2 roky
Karthik Sethuraman	b93edd22f5 server : allow to generate multimodal embeddings (#4681)	před 2 roky
andrijdavid	82d6eab224 main-cmake-pkg : fix build issue (#4665)	před 2 roky
Peter Sugihara	afd997ab60 llama.swiftui : fix infinite loop, ouput timings, buff UI (#4674)	před 2 roky
Georgi Gerganov	c8255f8a6b scripts : print list of sync commits	před 2 roky
Tamotsu Takahashi	441f51dca0 ci : build with CLBlast + ggml-opencl use GGML_API (whisper/1576)	před 2 roky
Georgi Gerganov	38b3de4658 sync : ggml	před 2 roky
bssrdf	afc8c19291 ggml : fix some mul mat cases + add tests for src1 F16 (ggml/669)	před 2 roky
Georgi Gerganov	ca38b8d334 scripts : do not sync commits from this repo	před 2 roky
Justine Tunney	65e5f6dadb Fix OpenAI server sampling w.r.t. temp and seed (#4668)	před 2 roky
manikbhandari	ea5497df5d gpt2 : Add gpt2 architecture integration (#4555)	před 2 roky
Nam D. Tran	f6793491b5 llama : add AWQ for llama, llama2, mpt, and mistral models (#4593)	před 2 roky
Daniel Bevenius	879b690a9e finetune : fix output formatting in print_params (#4653)	před 2 roky
Georgi Gerganov	b47879b0dd scripts : add sync-ggml-am.sh	před 2 roky
Georgi Gerganov	951010fa53 ggml : fix dot product for ARM (#4630)	před 2 roky
wonjun Jang	f56d6077d0 Add byte token type when tokenizer.model is not exists (#4641)	před 2 roky
slaren	dc68f0054c cuda : fix vmm pool with multi GPU (#4620)	před 2 roky
WillCorticesAI	de8e496437 Update comment for AdamW implementation reference. (#4604)	před 2 roky
FantasyGmm	77465dad48 Fix new CUDA10 compilation errors (#4635)	před 2 roky
Paul Tsochantaris	a206137f92 Adding Emeltal reference to UI list (#4629)	před 2 roky
slaren	b9f47952ff simplify bug issue template (#4623)	před 2 roky
Shintarou Okada	753be377b6 llama : add PLaMo model (#3557)	před 2 roky

Novější Starší

Historie revizí Hledat

Historie revizí