cturan/llama.cpp

Autor	SHA1 Mensaxe	Data
Howard Su	64cc19b4fe Fix the validation of main device (#1872)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	4bfcc855ab metal : parallel command buffer encoding (#1860)	%!s(int64=2) %!d(string=hai) anos
Johannes Gäßler	6b8312e797 Better error when using both LoRA + GPU layers (#1861)	%!s(int64=2) %!d(string=hai) anos
Johannes Gäßler	254a7a7a5f CUDA full GPU acceleration, KV cache in VRAM (#1827)	%!s(int64=2) %!d(string=hai) anos
0xspringtime	9254920265 baby-llama : fix operator!= (#1821)	%!s(int64=2) %!d(string=hai) anos
xaedes	e32089b2c2 train : improved training-from-scratch example (#1652)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	2347e45e7b llama : do a warm-up eval at start for better timings (#1824)	%!s(int64=2) %!d(string=hai) anos
Kerfuffle	74d4cfa343 Allow "quantizing" to f16 and f32 (#1787)	%!s(int64=2) %!d(string=hai) anos
Kawrakow	74a6d922f1 Metal implementation for all k_quants (#1807)	%!s(int64=2) %!d(string=hai) anos
slaren	e4caa8da59 ci : run when changing only the CUDA sources (#1800)	%!s(int64=2) %!d(string=hai) anos
Howard Su	58970a4c39 Leverage mmap for offloading tensors to GPU (#1597)	%!s(int64=2) %!d(string=hai) anos
Kawrakow	8c0a10e64d metal : fix failure to load model (#1817)	%!s(int64=2) %!d(string=hai) anos
Kerfuffle	fa84c4b3e8 Fix issue where interactive mode crashes when input exceeds ctx size (#1789)	%!s(int64=2) %!d(string=hai) anos
Kyle Liang	12b063f0ec Fixed WSL cuda's OOM error (#1594)	%!s(int64=2) %!d(string=hai) anos
Ryan Landay	31d2b5f4a4 Update SHA256SUMS with current hashes for models quantized using q4_0 (#1798)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	4de0334f5c cmake : fix Metal build (close #1791)	%!s(int64=2) %!d(string=hai) anos
Artyom Lebedev	3f1223155a k-quants : GCC12 compilation fix (#1792)	%!s(int64=2) %!d(string=hai) anos
Andrei	303f5809f1 metal : fix issue with ggml-metal.metal path. Closes #1769 (#1782)	%!s(int64=2) %!d(string=hai) anos
Aisuko	059e99066d doc : fix wrong address of BLIS.md (#1772)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	17c10acfb4 ggml : force no_alloc == false when creating opt tensors (close #1699)	%!s(int64=2) %!d(string=hai) anos
Kawrakow	e9b66ee982 metal : add Q4_1 implementation (#1785)	%!s(int64=2) %!d(string=hai) anos
Kerfuffle	4f0154b0ba llama : support requantizing models instead of only allowing quantization from 16/32bit (#1691)	%!s(int64=2) %!d(string=hai) anos
Xingchen Song(宋星辰)	ef3171d162 ggml : workaround for missing _mm256_setr_m128i in GCC < 8 (#1638)	%!s(int64=2) %!d(string=hai) anos
rankaiyx	555275a693 make : add SSSE3 compilation use case (#1659)	%!s(int64=2) %!d(string=hai) anos
Robert Sung-wook Shin	98ed165574 OpenCL: Add release memory (#1741)	%!s(int64=2) %!d(string=hai) anos
Johannes Gäßler	ae9663f188 Windows nvcc workaround (#1753)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	b33dee282f metal : fix build "tanhf" -> "tanh"	%!s(int64=2) %!d(string=hai) anos
AT	92f44ff7f7 metal : add GELU implementation (#1770)	%!s(int64=2) %!d(string=hai) anos
Kawrakow	245fc3c37d metal : faster q4_0 (#1775)	%!s(int64=2) %!d(string=hai) anos
Kawrakow	72ff5282bf metal : add Q2_K implementation (#1762)	%!s(int64=2) %!d(string=hai) anos

Posterior Anterior

Commit History Buscar

Commit History