cturan/llama.cpp

Автор	SHA1 Сообщение	Дата
Georgi Gerganov	07028f9d74 flake.lock: Update (#10063)	1 год назад
R0CKSTAR	524afeec9d musa: workaround for Guilty Lockup in cleaning src0 (#10042)	1 год назад
Georgi Gerganov	8125e6cbfc server : don't overfill the batch during infill (#10018)	1 год назад
Georgi Gerganov	8841ce3f43 llama : switch KQ multiplication to F32 precision by default (#10015)	1 год назад
Georgi Gerganov	cc2983d375 sync : ggml	1 год назад
bssrdf	8c60a8a462 increase cuda_cpy block size (ggml/996)	1 год назад
Georgi Gerganov	9e4a2563ea scripts : fix amx sync [no ci]	1 год назад
Georgi Gerganov	668750357e metal : support permuted matrix multiplicaions (#10033)	1 год назад
wwoodsTM	ff252ea48e llama : add DRY sampler (#9702)	1 год назад
Michael Podvitskiy	d80fb71f8b llama: string_split fix (#10022)	1 год назад
Srihari-mcw	2f8bd2b901 llamafile : extend sgemm.cpp support for Q5_0 models (#10010)	1 год назад
Georgi Gerganov	bc5ba007b2 server : check that the prompt fits in the slot's context (#10030)	1 год назад
Xuan Son Nguyen	958367bf53 server : refactor slot input data, move tokenizer to HTTP thread (#10023)	1 год назад
Georgi Gerganov	40f2555797 ci : fix cmake flags for SYCL	1 год назад
Johannes Gäßler	167a515651 CUDA: fix insufficient buffer clearing for MMQ (#10032)	1 год назад
Johannes Gäßler	c39665f589 CUDA: fix MMQ for non-contiguous src0, add tests (#10021)	1 год назад
wwoodsTM	0a1c750c80 server : samplers accept the prompt correctly (#10019)	1 год назад
Georgi Gerganov	190a37d797 sync : ggml	1 год назад
Georgi Gerganov	2d3aba9ee8 llama.vim : bump generation time limit to 3s [no ci]	1 год назад
Johannes Gäßler	80273a306d CUDA: fix 1D im2col, add tests (ggml/993)	1 год назад
Daniel Bevenius	c19af0acb1 ggml : remove redundant set of contexts used field (ggml/978)	1 год назад
Michael Coppola	ac113a0fee llama.vim : add classic vim support (#9995)	1 год назад
Jun Hee Yoo	4c9388fb96 metal : add POOL2D and fix IM2COL (#9943)	1 год назад
github-actions[bot]	873279b159 flake.lock: Update	1 год назад
Xuan Son Nguyen	c8c07d658a llama : fix empty batch causing llama_batch_allocr to crash (#9966)	1 год назад
Daniel Bevenius	19d900a756 llama : rename batch to ubatch (#9950)	1 год назад
Molly Sophia	11d47057a5 Rwkv chat template fix (#10001)	1 год назад
Xuan Son Nguyen	c421ac072d lora : warn user if new token is added in the adapter (#9948)	1 год назад
Molly Sophia	4ff7fe1fb3 llama : add chat template for RWKV-World + fix EOT (#9968)	1 год назад
leo-pony	6b8447352d [CANN] Adapt to dynamically loadable backends mechanism (#9970)	1 год назад

Новее Раньше

История коммитов Найти

История коммитов