cturan/llama.cpp

Autor	SHA1 Nachricht	Datum
Pierrick Hymbert	0c4d489e29 quantize: add imatrix and dataset metadata in GGUF (#6658)	vor 1 Jahr
slaren	017e6999b5 add basic tensor data validation function (#6884)	vor 1 Jahr
jiez	1966eb2615 quantize : add '--keep-split' to quantize model into shards (#6688)	vor 1 Jahr
Douglas Hanley	b4e4b8a935 llama : add llama_get_pooling_type function (#6862)	vor 1 Jahr
Johannes Gäßler	28103f4832 Server: fix seed for multiple slots (#6835)	vor 1 Jahr
Georgi Gerganov	40f74e4d73 llama : add option to render special/control tokens (#6807)	vor 1 Jahr
Pedro Cuenca	b97bc3966e llama : support Llama 3 HF conversion (#6745)	vor 1 Jahr
Olivier Chafik	cbaadc9294 grammars: 1.5x faster inference w/ complex grammars (vector reserves / reuses) (#6609)	vor 1 Jahr
Jared Van Bortel	1b67731e18 BERT tokenizer fixes (#6498)	vor 1 Jahr
Rick G	e3c337d87c llama : support negative ith in llama_get_ API (#6519)	vor 1 Jahr
Jan Boon	beea6e1b16 llama : save and restore kv cache for single seq id (#6341)	vor 1 Jahr
Clint Herron	9b84ae1806 examples : add GBNF validator program (#5948)	vor 1 Jahr
Jared Van Bortel	be55134a53 convert : refactor vocab selection logic (#6355)	vor 1 Jahr
compilade	557410b8f0 llama : greatly reduce output buffer memory usage (#6122)	vor 1 Jahr
Kawrakow	55c1b2a3bb IQ1_M: 1.75 bpw quantization (#6302)	vor 1 Jahr
Kawrakow	d25b1c31b0 quantize : be able to override metadata by key (#6321)	vor 1 Jahr
Kawrakow	1d0331c12a quantize: options for output and token embedding tensors qtype (#6239)	vor 1 Jahr
Pierrick Hymbert	dba1af6129 llama_model_loader: support multiple split/shard GGUFs (#6187)	vor 1 Jahr
Theia Vogel	877b4d0c62 llama : add support for control vectors (#5970)	vor 1 Jahr
Michael Podvitskiy	69ff61397d llama : support models without vocabulary (#5798)	vor 1 Jahr
slaren	f30ea47a87 llama : add pipeline parallelism support (#6017)	vor 1 Jahr
Georgi Gerganov	05b06210c9 llama : more consistent names of count variables (#5994)	vor 1 Jahr
Georgi Gerganov	ee35600b90 llama : fix F16/F32 downcast + improve names (#5980)	vor 1 Jahr
DAN™	bcebd7dbf6 llama : add support for GritLM (#5959)	vor 1 Jahr
compilade	c2101a2e90 llama : support Mamba Selective State Space Models (#5328)	vor 1 Jahr
Georgi Gerganov	29ae62d2ae llama : fix embeddings (#5796)	vor 1 Jahr
Douglas Hanley	475df1d6cf llama : allow for user specified embedding pooling type (#5849)	vor 1 Jahr
Michael Podvitskiy	4a6e2d6142 llama : add abort_callback to interrupt computation (#5409)	vor 1 Jahr
Pierrick Hymbert	3ab8b3a92e llama : cleanup unused mmq flags (#5772)	vor 1 Jahr
Marcus Dunn	d5ab29757e llama : constified `llama_set_state_data`'s `src` (#5774)	vor 1 Jahr

Neuer Älter

Commit Verlauf Finden

Commit Verlauf