cturan/llama.cpp

Autor	SHA1 Mensaxe	Data
Xuan-Son Nguyen	aa3b7a90b4 arg: add --cache-list argument to list cached models (#17073)	hai 2 meses
Gadflyii	3df2244df4 llama : add --no-host to disable host buffers (#16310)	hai 3 meses
Aaron Teo	624207e676 devops: add s390x & ppc64le CI (#15925)	hai 3 meses
Douglas Hanley	b5bd037832 llama : add support for qwen3 reranker (#15824)	hai 3 meses
Uilian Ries	152729f884 common : add missing chrono header for common.cpp (#16211)	hai 3 meses
Johannes Gäßler	e81b8e4b7f llama: use FA + max. GPU layers by default (#15434)	hai 4 meses
Sigbjørn Skjæret	84ab83cc0b model : jina-embeddings-v3 support (#13693)	hai 4 meses
Georgi Gerganov	9ebebef62f llama : remove KV cache defragmentation logic (#15473)	hai 4 meses
Jie Fu (傅杰)	2f3dbffb17 common : fix incorrect print of non-ascii characters in the logging (#15466)	hai 4 meses
Jonathan Graehl	5cdb27e091 finetune: SGD optimizer, more CLI args (#13873)	hai 5 meses
Diego Devesa	d6818d06a6 llama : allow other bufts when overriding to CPU, add --no-repack option (#14990)	hai 5 meses
compilade	90083283ec imatrix : use GGUF to store importance matrices (#9400)	hai 6 meses
Georgi Gerganov	225e7a1438 llama : add high-throughput mode (#14363)	hai 6 meses
Georgi Gerganov	6ffd4e9c44 server : pre-calculate EOG logit biases (#14721)	hai 6 meses
Ruikai Peng	dd6e6d0b6a vocab : prevent tokenizer overflow (#14301)	hai 7 meses
fanyang	456af35eb7 build : suppress gcc15 compile warnings (#14261)	hai 7 meses
Diego Devesa	6adc3c3ebc llama : add thread safety test (#14035)	hai 7 meses
Georgi Gerganov	d3e64b9f49 llama : rework embeddings logic (#14208)	hai 7 meses
bandoti	2e89f76b7a common: fix issue with regex_escape routine on windows (#14133)	hai 7 meses
Georgi Gerganov	745aa5319b llama : deprecate llama_kv_self_ API (#14030)	hai 7 meses
Max Krasnyansky	053b1539c0 threading: support for GGML_SCHED_PRIO_LOW, update thread info on Windows to avoid throttling (#12995)	hai 7 meses
Đinh Trọng Huy	e0e3aa231d llama : add support for BertForSequenceClassification reranker (#13858)	hai 7 meses
Percy Piper	c508256db2 rpc : Fix build on OpenBSD (#13541)	hai 7 meses
Georgi Gerganov	a4090d1174 llama : remove llama_kv_cache_view API + remove deprecated (#13653)	hai 8 meses
Georgi Gerganov	e298d2fbd0 kv-cache : add SWA support (#13194)	hai 8 meses
psocolovsky	1dfbf2cf3a common : add load_progress_callback (#13617)	hai 8 meses
Olivier Chafik	3198405e98 `common`: add partial regex support (#12808)	hai 8 meses
Johannes Gäßler	10d2af0eaa llama/ggml: add LLM training support (#10544)	hai 8 meses
David Huang	7f323a589f Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386)	hai 8 meses
Georgi Gerganov	51fb96b1ff context : remove logits_all flag (#13284)	hai 8 meses

Posterior Anterior

Commit History Buscar

Commit History