cturan/llama.cpp

Аутор	SHA1 Порука	Датум
Georgi Gerganov	76484fbfd3 sync : ggml	пре 2 година
Johannes Gäßler	c71d608ce7 ggml: cache sin/cos for RoPE (#4908)	пре 2 година
Georgi Gerganov	4be5ef556d metal : remove old API (#4919)	пре 2 година
Georgi Gerganov	0ea069b87b server : fix prompt caching with system prompt (#4914)	пре 2 година
Georgi Gerganov	f172de03f1 llama : fix detokenization of non-special added-tokens (#4916)	пре 2 година
Georgi Gerganov	2d57de5255 metal : disable log for loaded kernels (#4794)	пре 2 година
David Friehs	df845cc982 llama : minimize size used for state save/load (#4820)	пре 2 година
Someone	6b48ed0893 workflows: unbreak nix-build-aarch64, and split it out (#4915)	пре 2 година
Yann Follet	722d33f34e main : add parameter --no-display-prompt (#4541)	пре 2 година
texmex76	c30b1ef39a gguf : fix potential infinite for-loop (#4600)	пре 2 година
Georgi Gerganov	b38b5e93ae metal : refactor kernel loading code (#4794)	пре 2 година
Johannes Gäßler	7dc78764e2 compare-llama-bench: tweak output format (#4910)	пре 2 година
Ziad Ben Hadj-Alouane	356327feb3 server : fix deadlock that occurs in multi-prompt scenarios (#4905)	пре 2 година
makomk	ee8243adaa server : fix crash with multimodal models without BOS token (#4904)	пре 2 година
Georgi Gerganov	15ebe59210 convert : update phi-2 to latest HF repo (#4903)	пре 2 година
Georgi Gerganov	de473f5f8e sync : ggml	пре 2 година
Georgi Gerganov	f238461236 ggml : fix 32-bit ARM compat for IQ2_XS (whisper/1758)	пре 2 година
slaren	fa5c1fb44a backend_sched : fix assignments	пре 2 година
Maximilian Winter	52ee4540c0 examples : add pydantic models to GBNF grammar generator (#4883)	пре 2 година
Johannes Gäßler	3fe81781e3 CUDA: faster q8_0 -> f16 dequantization (#4895)	пре 2 година
slaren	e7e4df031b llama : ggml-backend integration (#4766)	пре 2 година
Georgi Gerganov	584d674be6 llama : remove redundant assert for StableLM (#4901)	пре 2 година
Daniel Bevenius	930f907d3e export-lora : use LLAMA_FILE_MAGIC_GGLA (#4894)	пре 2 година
Zay	e790eef21c llama.swiftui : update models layout (#4826)	пре 2 година
Georgi Gerganov	5537d9d36b gitignore : imatrix	пре 2 година
Johannes Gäßler	1b280c9fff CUDA: fix softmax compile for old CUDA versions (#4862)	пре 2 година
Georgi Gerganov	3cabe80630 llama : fix typo "imp_embd" -> "inp_embd"	пре 2 година
howlger	4315a94366 common : streamline the formatting of help (#4890)	пре 2 година
Georgi Gerganov	2d00741e12 py : fix lint (#4889)	пре 2 година
Georgi Gerganov	f445c0e68c llama : fix llm_build_k_shift to use correct n_rot (#4889)	пре 2 година

Новије Старије

Историја ревизија Пронађи

Историја ревизија