cturan/llama.cpp @ 90083283ec254fa8d33897746dea229aee401b37

compilade 90083283ec imatrix : use GGUF to store importance matrices (#9400)		6 bulan lalu
..
batched-bench	225e7a1438 llama : add high-throughput mode (#14363)	6 bulan lalu
cvector-generator	745aa5319b llama : deprecate llama_kv_self_ API (#14030)	7 bulan lalu
export-lora	1d36b3670b llama : move end-user examples to tools directory (#13249)	8 bulan lalu
gguf-split	e9b6350e61 scripts : make the shell scripts cross-platform (#14341)	6 bulan lalu
imatrix	90083283ec imatrix : use GGUF to store importance matrices (#9400)	6 bulan lalu
llama-bench	fffcce535e llama-bench : add --no-warmup flag (#14224) (#14270)	7 bulan lalu
main	abf241045d main : honor --verbose-prompt on interactive prompts (#14350)	6 bulan lalu
mtmd	28657a8229 ggml : implement GEGLU_ERF and GEGLU_QUICK ops (#14445)	6 bulan lalu
perplexity	745aa5319b llama : deprecate llama_kv_self_ API (#14030)	7 bulan lalu
quantize	90083283ec imatrix : use GGUF to store importance matrices (#9400)	6 bulan lalu
rpc	c508256db2 rpc : Fix build on OpenBSD (#13541)	7 bulan lalu
run	a457551332 cmake : do not search for curl libraries by ourselves (#14613)	6 bulan lalu
server	6ffd4e9c44 server : pre-calculate EOG logit biases (#14721)	6 bulan lalu
tokenize	1d36b3670b llama : move end-user examples to tools directory (#13249)	8 bulan lalu
tts	53f925074d sync : vendor (#13901)	7 bulan lalu
CMakeLists.txt	9b61acf060 mtmd : rename llava directory to mtmd (#13311)	8 bulan lalu