cturan/llama.cpp

Autor	SHA1 Zpráva	Datum
R0CKSTAR	524afeec9d musa: workaround for Guilty Lockup in cleaning src0 (#10042)	před 1 rokem
Georgi Gerganov	8125e6cbfc server : don't overfill the batch during infill (#10018)	před 1 rokem
Georgi Gerganov	8841ce3f43 llama : switch KQ multiplication to F32 precision by default (#10015)	před 1 rokem
Georgi Gerganov	cc2983d375 sync : ggml	před 1 rokem
bssrdf	8c60a8a462 increase cuda_cpy block size (ggml/996)	před 1 rokem
Georgi Gerganov	9e4a2563ea scripts : fix amx sync [no ci]	před 1 rokem
Georgi Gerganov	668750357e metal : support permuted matrix multiplicaions (#10033)	před 1 rokem
wwoodsTM	ff252ea48e llama : add DRY sampler (#9702)	před 1 rokem
Michael Podvitskiy	d80fb71f8b llama: string_split fix (#10022)	před 1 rokem
Srihari-mcw	2f8bd2b901 llamafile : extend sgemm.cpp support for Q5_0 models (#10010)	před 1 rokem
Georgi Gerganov	bc5ba007b2 server : check that the prompt fits in the slot's context (#10030)	před 1 rokem
Xuan Son Nguyen	958367bf53 server : refactor slot input data, move tokenizer to HTTP thread (#10023)	před 1 rokem
Georgi Gerganov	40f2555797 ci : fix cmake flags for SYCL	před 1 rokem
Johannes Gäßler	167a515651 CUDA: fix insufficient buffer clearing for MMQ (#10032)	před 1 rokem
Johannes Gäßler	c39665f589 CUDA: fix MMQ for non-contiguous src0, add tests (#10021)	před 1 rokem
wwoodsTM	0a1c750c80 server : samplers accept the prompt correctly (#10019)	před 1 rokem
Georgi Gerganov	190a37d797 sync : ggml	před 1 rokem
Georgi Gerganov	2d3aba9ee8 llama.vim : bump generation time limit to 3s [no ci]	před 1 rokem
Johannes Gäßler	80273a306d CUDA: fix 1D im2col, add tests (ggml/993)	před 1 rokem
Daniel Bevenius	c19af0acb1 ggml : remove redundant set of contexts used field (ggml/978)	před 1 rokem
Michael Coppola	ac113a0fee llama.vim : add classic vim support (#9995)	před 1 rokem
Jun Hee Yoo	4c9388fb96 metal : add POOL2D and fix IM2COL (#9943)	před 1 rokem
github-actions[bot]	873279b159 flake.lock: Update	před 1 rokem
Xuan Son Nguyen	c8c07d658a llama : fix empty batch causing llama_batch_allocr to crash (#9966)	před 1 rokem
Daniel Bevenius	19d900a756 llama : rename batch to ubatch (#9950)	před 1 rokem
Molly Sophia	11d47057a5 Rwkv chat template fix (#10001)	před 1 rokem
Xuan Son Nguyen	c421ac072d lora : warn user if new token is added in the adapter (#9948)	před 1 rokem
Molly Sophia	4ff7fe1fb3 llama : add chat template for RWKV-World + fix EOT (#9968)	před 1 rokem
leo-pony	6b8447352d [CANN] Adapt to dynamically loadable backends mechanism (#9970)	před 1 rokem
Daniel Bevenius	674804a996 arg : fix typo in embeddings argument help [no ci] (#9994)	před 1 rokem

Novější Starší

Historie revizí Hledat

Historie revizí