Xuan-Son Nguyen 6df686bee6 server : refactor oai_parser_opt, move it to server_chat_params (#18937) 1 week ago
..
batched-bench 147a521636 tool/ex/tests: consistently free ctx, then model (#18168) 1 month ago
cli 6df686bee6 server : refactor oai_parser_opt, move it to server_chat_params (#18937) 1 week ago
completion 13f1e4a9ca llama : add adaptive-p sampler (#17927) 1 week ago
cvector-generator 254098a279 common : refactor common_sampler + grammar logic changes (#17937) 1 month ago
export-lora 07808ebb07 cmake : Do not install tools on iOS targets (#15903) 4 months ago
fit-params 64848deb18 llama-fit-params: free memory target per device (#18679) 2 weeks ago
gguf-split 6c2131773c cli: new CLI experience (#17824) 1 month ago
imatrix 254098a279 common : refactor common_sampler + grammar logic changes (#17937) 1 month ago
llama-bench aa1dc3770a Setting mmap and direct_io to false as default in llama-bench.cpp (#18841) 1 week ago
mtmd c945aaaef2 mtmd : Fix ASR for LFM2.5-Audio-1.5B (#18876) 1 week ago
perplexity 254098a279 common : refactor common_sampler + grammar logic changes (#17937) 1 month ago
quantize 33ded988ba quantize: prevent input/output file collision (#18451) 4 weeks ago
rpc d2d626938a Install rpc-server when GGML_RPC is ON. (#17149) 2 months ago
server 6df686bee6 server : refactor oai_parser_opt, move it to server_chat_params (#18937) 1 week ago
tokenize 07808ebb07 cmake : Do not install tools on iOS targets (#15903) 4 months ago
tts 516a4ca9b5 refactor : remove libcurl, use OpenSSL when available (#18828) 2 weeks ago
CMakeLists.txt a180ba78c7 cmake: only build cli when server is enabled (#18670) 2 weeks ago