cturan/llama.cpp @ e14e842e87103ec2a004770cec95a3f94f861bda

Georgi Gerganov 7956bb4d7f bench : cache the llama_context state at computed depth (#16944)		2 miesięcy temu
..
batched-bench	7fd205a8e8 scripts : add script to bench models (#16894)	2 miesięcy temu
cvector-generator	07808ebb07 cmake : Do not install tools on iOS targets (#15903)	4 miesięcy temu
export-lora	07808ebb07 cmake : Do not install tools on iOS targets (#15903)	4 miesięcy temu
gguf-split	1d660d2fae ci : use smaller model (#16168)	3 miesięcy temu
imatrix	fe6a9882ac Manually link -lbsd to resolve flock symbol on AIX (#16610)	2 miesięcy temu
llama-bench	7956bb4d7f bench : cache the llama_context state at computed depth (#16944)	2 miesięcy temu
main	2f61c0f5bf llama-cli: prevent spurious assistant token (#16202)	3 miesięcy temu
mtmd	9008027aa3 hparams : add n_embd_inp() to support extended embed (#16928)	2 miesięcy temu
perplexity	3ffd0fae47 perplexity : show more kl-divergence data (#16321)	3 miesięcy temu
quantize	1d660d2fae ci : use smaller model (#16168)	3 miesięcy temu
rpc	41386cf365 rpc : report actual free memory (#16616)	3 miesięcy temu
run	fe6a9882ac Manually link -lbsd to resolve flock symbol on AIX (#16610)	2 miesięcy temu
server	16bcc1259d kv-cache : pad the cache size to 256 for performance (#17046)	2 miesięcy temu
tokenize	07808ebb07 cmake : Do not install tools on iOS targets (#15903)	4 miesięcy temu
tts	34fcc5a4ac model : Apertus model implementation (#15852)	3 miesięcy temu
CMakeLists.txt	9b61acf060 mtmd : rename llava directory to mtmd (#13311)	8 miesięcy temu