cturan/llama.cpp

Author	SHA1 Message	Date
Georgi Gerganov	01612b7409 llama : reuse compute graphs (#14482)	6 months ago
Georgi Gerganov	ad57d3edd2 batch : fix uninitialized has_cpl flag (#14733)	6 months ago
Georgi Gerganov	225e7a1438 llama : add high-throughput mode (#14363)	6 months ago
Georgi Gerganov	67d1ef23c6 batch : add optional for sequential equal split (#14511)	6 months ago
Georgi Gerganov	c79184d2d1 batch : add n_used count (#14512)	6 months ago
Georgi Gerganov	4c9fdfbe15 ubatch : new splitting logic (#14217)	7 months ago
Georgi Gerganov	d3e64b9f49 llama : rework embeddings logic (#14208)	7 months ago
Georgi Gerganov	b9912ac570 batch : auto-gen positions + verify multi-sequence input (#14177)	7 months ago
Georgi Gerganov	80709b70a2 batch : add LLAMA_BATCH_DEBUG environment variable (#14172)	7 months ago
Georgi Gerganov	60c666347b batch : rework llama_batch_allocr (#14153)	7 months ago
Georgi Gerganov	f6e1a7aa87 context : simplify output counting logic during decode (#14142)	7 months ago
Georgi Gerganov	c3ee46fab4 batch : remove logits_all flag (#14141)	7 months ago
Georgi Gerganov	12d0188c0d kv-cache : refactor + add llama_memory_state_i (#13746)	7 months ago
Georgi Gerganov	c642bc014c kv-cache : separate recurrent vs non-recurrent impl (#12799)	8 months ago
Georgi Gerganov	e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181)	10 months ago
Georgi Gerganov	f66f582927 llama : refactor `src/llama.cpp` (#10902)	1 year ago