cturan/llama.cpp

Автор	SHA1 Сообщение	Дата
Georgi Gerganov	178b1850eb k-quants : fix zero-weight guard in Q6_K (ref #3040)	2 лет назад
Kerfuffle	ea2c85d5d2 convert-llama-ggml-to-gguf: Try to handle files older than GGJTv3 (#3023)	2 лет назад
Cebtenzzre	9912b9efc8 build : add LLAMA_METAL_NDEBUG flag (#3033)	2 лет назад
Cebtenzzre	9e2023156e make : use new flag variables for recent changes (#3019)	2 лет назад
Cebtenzzre	de2fe892af examples : replace fprintf to stdout with printf (#3017)	2 лет назад
Erik Scholz	c9c3220c48 convert: fix convert.py not working with int filename_stem (#3028)	2 лет назад
Kawrakow	d59bd97065 Guard against all weights in a super-block being zero (#3010)	2 лет назад
Georgi Gerganov	35938ee3b0 llama : update logic for number of threads when using BLAS	2 лет назад
Georgi Gerganov	921772104b speculative : add grammar support (#2991)	2 лет назад
Georgi Gerganov	2ba85c8609 py : minor	2 лет назад
Georgi Gerganov	e36ecdccc8 build : on Mac OS enable Metal by default (#2901)	2 лет назад
slaren	bd33e5ab92 ggml-opencl : store GPU buffer in ggml_tensor::extra (#2994)	2 лет назад
Cebtenzzre	3103568144 llama-bench : make cpp file non-executable (#2999)	2 лет назад
Leng Yue	5b8530d88c make : add speculative example (#3003)	2 лет назад
Aarni Koskela	e4386f417f server : add a subtle loading animation to the edit box (#2466)	2 лет назад
Jiahao Li	35195689cd 2x faster (rms) norm cuda kernels (3.7% e2e improvement) (#2985)	2 лет назад
slaren	cf9b08485c ggml-alloc : use virtual memory for measurement (#2973)	2 лет назад
Georgi Gerganov	47068e5170 speculative : PoC for speeding-up inference via speculative sampling (#2926)	2 лет назад
Georgi Gerganov	8f429fa511 perplexity : fix ETA by warming up the model with an empty run	2 лет назад
Kerfuffle	6519e9c99c gguf(python): Fix special vocab handling when id < 0 (#2984)	2 лет назад
Georgi Gerganov	b7f2aa9e51 metal : restore 363f0bf and fix reduce in F16_F32 kernels (#2986)	2 лет назад
Alon	73a12a6344 cov : disable comment in PRs (#2989)	2 лет назад
opparco	3730134776 llama : fix bpe tokenize from byte (#2889)	2 лет назад
Georgi Gerganov	d9151e6f57 metal : revert 6af0bab until we fix it	2 лет назад
Alon	afc43d5f82 cov : add Code Coverage and codecov.io integration (#2928)	2 лет назад
Wentai Zhang	6460f758db opencl : fix a bug in ggml_cl_pool_malloc() for ggml_cl_mul_mat_f32() (#2955)	2 лет назад
Kawrakow	ca82cf7bac metal : more optimizations (#2959)	2 лет назад
kchro3	6a31a3bd98 swift : add support for k-quants (#2983)	2 лет назад
Kerfuffle	cff7b0bf07 convert.py : BPE fixes (#2938)	2 лет назад
Ido S	340af42f09 docs : add `catai` to `README.md` (#2967)	2 лет назад

Новее Раньше

История коммитов Найти

История коммитов