cturan/llama.cpp

Autor	SHA1 Wiadomość	Data
Kawrakow	2b3a665d39 llama : use Q4_K for attn_v for Q2_K_S when n_gqa >= 4 (#4996)	2 lat temu
Kawrakow	334a835a1c ggml : importance matrix support for legacy quants (#4969)	2 lat temu
David Friehs	4483396751 llama : apply classifier-free guidance to logits directly (#4951)	2 lat temu
Kawrakow	2faaef3979 llama : check for 256 divisibility for IQ2_XS, IQ2_XXS (#4950)	2 lat temu
David Pflug	a836c8f534 llama : fix missing quotes (#4937)	2 lat temu
Georgi Gerganov	bb0c139247 llama : check LLAMA_TRACE env for extra logging (#4929)	2 lat temu
Georgi Gerganov	03c5267490 llama : use LLAMA_LOG_ macros for logging	2 lat temu
Kawrakow	a128c38de8 Fix ffn_down quantization mix for MoE models (#4927)	2 lat temu
Karthik Kumar Viswanathan	ac32902a87 llama : support WinXP build with MinGW 8.1.0 (#3419)	2 lat temu
Kawrakow	147b17ac94 2-bit quantizations (#4897)	2 lat temu
Kawrakow	807179ec58 Make Q3_K_S be the same as olf Q3_K_L for Mixtral-8x7B (#4906)	2 lat temu
Georgi Gerganov	4be5ef556d metal : remove old API (#4919)	2 lat temu
Georgi Gerganov	f172de03f1 llama : fix detokenization of non-special added-tokens (#4916)	2 lat temu
David Friehs	df845cc982 llama : minimize size used for state save/load (#4820)	2 lat temu
Georgi Gerganov	15ebe59210 convert : update phi-2 to latest HF repo (#4903)	2 lat temu
slaren	e7e4df031b llama : ggml-backend integration (#4766)	2 lat temu
Georgi Gerganov	584d674be6 llama : remove redundant assert for StableLM (#4901)	2 lat temu
Georgi Gerganov	3cabe80630 llama : fix typo "imp_embd" -> "inp_embd"	2 lat temu
Georgi Gerganov	f445c0e68c llama : fix llm_build_k_shift to use correct n_rot (#4889)	2 lat temu
Kawrakow	469e75d0a3 llama : restore intended k-quants mixes for MoE models (#4872)	2 lat temu
Kawrakow	49662cbed3 ggml : SOTA 2-bit quants (add IQ2_XS) (#4856)	2 lat temu
pudepiedj	43f76bf1c3 main : print total token count and tokens consumed so far (#4874)	2 lat temu
Brian	57d016ba2d llama : add additional suffixes for model params (#4834)	2 lat temu
Austin	329ff61569 llama : recognize 1B phi models (#4847)	2 lat temu
Kawrakow	dd5ae06405 SOTA 2-bit quants (#4773)	2 lat temu
Georgi Gerganov	b0034d93ce examples : add passkey test (#3856)	2 lat temu
Georgi Gerganov	9dede37d81 llama : remove unused vars (#4796)	2 lat temu
Georgi Gerganov	3c36213df8 llama : remove redundant GQA check (#4796)	2 lat temu
Georgi Gerganov	d117d4dc5d llama : print tensor meta for debugging	2 lat temu
Georgi Gerganov	540938f890 llama : llama_model_desc print number of experts	2 lat temu

Nowsze Starsze

Historia zmian Szukaj

Historia zmian