cturan/llama.cpp

Автор	SHA1 Опис	Дата
Douglas Hanley	2891c8aa9a Add support for BERT embedding models (#5423)	1 рік тому
snadampal	a07d0fee1f ggml : add mmla kernels for quantized GEMM (#4966)	1 рік тому
Paul Tsochantaris	e5ca3937c6 llama : do not cap thread count when MoE on CPU (#5419)	1 рік тому
slaren	41f308f58e llama : do not print "offloading layers" message in CPU-only builds (#5416)	1 рік тому
Johannes Gäßler	b7b74cef36 fix trailing whitespace (#5407)	1 рік тому
runfuture	4aa43fab56 llama : fix MiniCPM (#5392)	1 рік тому
Johannes Gäßler	26d4efd11e sampling: fix top_k <= 0 (#5388)	1 рік тому
0cc4m	ee1628bdfe Basic Vulkan Multi-GPU implementation (#5321)	1 рік тому
runfuture	316c7faf77 llama : add MiniCPM support (#5346)	1 рік тому
Kawrakow	89503dcb5f iq3_xxs: quards for the no-imatrix situation (#5334)	1 рік тому
Jared Van Bortel	1ec3332ade YaRN : store rope scaling type as int32_t in memory (#5285)	1 рік тому
Ian Bull	e1e721094d llama : fix memory leak in llama_batch_free (#5252)	1 рік тому
Guoteng	ce32060198 llama : support InternLM2 (#5184)	1 рік тому
Georgi Gerganov	d3bac7d584 llama : reorder build_orion() at correct place (#5118)	1 рік тому
Georgi Gerganov	5cb04dbc16 llama : remove LLAMA_MAX_DEVICES and LLAMA_SUPPORTS_GPU_OFFLOAD (#5240)	1 рік тому
Yiming Cui	d62520eb2c Fix typos of IQ2_XXS and IQ3_XXS in llama.cpp (#5231)	1 рік тому
Jared Van Bortel	e8dc55d006 kompute : llama-bench support and ggml_cpu_has_kompute() (#5226)	1 рік тому
Kawrakow	f4d7e54974 SOTA 3-bit quants (#5196)	1 рік тому
Jared Van Bortel	6daa69ee81 kompute : fix fallback to CPU (#5201)	1 рік тому
Jared Van Bortel	fbf1ddec69 Nomic Vulkan backend (#4456)	1 рік тому
divinity76	2aed77eb06 fix typo "RLIMIT_MLOCK" (#5175)	1 рік тому
0cc4m	2307523d32 ggml : add Vulkan backend (#2059)	2 роки тому
Abhilash Majumder	0f648573dd ggml : add unified SYCL backend for Intel GPUs (#2690)	2 роки тому
Johannes Gäßler	9241c3a2ac Apply min_p to unsorted tokens (#5115)	2 роки тому
Johannes Gäßler	b2b2bf988c Tests for min_p, sampling queue (#5147)	2 роки тому
sharpHL	f2e69d28c0 llama : add support for Orion-14B (#5118)	2 роки тому
Kawrakow	1182cf4d4f Another bucket sort (#5109)	2 роки тому
l3utterfly	5eaf9964fc llama : dynamic temperature sampling (#4972)	2 роки тому
Kawrakow	faa3526a1e Fix Q3_K_XS for MoE models (#5113)	2 роки тому
slaren	1387ea2117 llama : pre-allocate input tensors in a separate buffer (#5100)	2 роки тому

Новіші Старіші

Історія комітів Пошук

Історія комітів