Lennart Austenfeld 18361c579c server: fix memory reservations in populate_token_probs (#18787) 1 semana atrás
..
batched-bench 147a521636 tool/ex/tests: consistently free ctx, then model (#18168) 1 mês atrás
cli 13f1e4a9ca llama : add adaptive-p sampler (#17927) 1 semana atrás
completion 13f1e4a9ca llama : add adaptive-p sampler (#17927) 1 semana atrás
cvector-generator 254098a279 common : refactor common_sampler + grammar logic changes (#17937) 1 mês atrás
export-lora 07808ebb07 cmake : Do not install tools on iOS targets (#15903) 4 meses atrás
fit-params 64848deb18 llama-fit-params: free memory target per device (#18679) 3 semanas atrás
gguf-split 6c2131773c cli: new CLI experience (#17824) 1 mês atrás
imatrix 254098a279 common : refactor common_sampler + grammar logic changes (#17937) 1 mês atrás
llama-bench aa1dc3770a Setting mmap and direct_io to false as default in llama-bench.cpp (#18841) 1 semana atrás
mtmd c945aaaef2 mtmd : Fix ASR for LFM2.5-Audio-1.5B (#18876) 1 semana atrás
perplexity 254098a279 common : refactor common_sampler + grammar logic changes (#17937) 1 mês atrás
quantize 33ded988ba quantize: prevent input/output file collision (#18451) 4 semanas atrás
rpc d2d626938a Install rpc-server when GGML_RPC is ON. (#17149) 2 meses atrás
server 18361c579c server: fix memory reservations in populate_token_probs (#18787) 1 semana atrás
tokenize 07808ebb07 cmake : Do not install tools on iOS targets (#15903) 4 meses atrás
tts 516a4ca9b5 refactor : remove libcurl, use OpenSSL when available (#18828) 2 semanas atrás
CMakeLists.txt a180ba78c7 cmake: only build cli when server is enabled (#18670) 2 semanas atrás