cturan/llama.cpp @ caf5681fcb47dfe9bafee94ef9aa8f669ac986c7

matteo caf5681fcb server : support jinja extra template kwargs (Qwen3 enable_thinking feature), from command line and from client (#13196)		6 luni în urmă
..
batched-bench	745aa5319b llama : deprecate llama_kv_self_ API (#14030)	7 luni în urmă
cvector-generator	745aa5319b llama : deprecate llama_kv_self_ API (#14030)	7 luni în urmă
export-lora	1d36b3670b llama : move end-user examples to tools directory (#13249)	8 luni în urmă
gguf-split	1d36b3670b llama : move end-user examples to tools directory (#13249)	8 luni în urmă
imatrix	745aa5319b llama : deprecate llama_kv_self_ API (#14030)	7 luni în urmă
llama-bench	fffcce535e llama-bench : add --no-warmup flag (#14224) (#14270)	7 luni în urmă
main	abf241045d main : honor --verbose-prompt on interactive prompts (#14350)	6 luni în urmă
mtmd	5d5c066de8 mtmd : fix Pixtral OOM with large images by capping image_size to 1024 (#14326)	6 luni în urmă
perplexity	745aa5319b llama : deprecate llama_kv_self_ API (#14030)	7 luni în urmă
quantize	fa4a9f2a1c quantize : handle user-defined pruning of whole layers (blocks) (#13037)	6 luni în urmă
rpc	c508256db2 rpc : Fix build on OpenBSD (#13541)	7 luni în urmă
run	66aba7aca9 run : avoid double tokenization (#14327)	6 luni în urmă
server	caf5681fcb server : support jinja extra template kwargs (Qwen3 enable_thinking feature), from command line and from client (#13196)	6 luni în urmă
tokenize	1d36b3670b llama : move end-user examples to tools directory (#13249)	8 luni în urmă
tts	53f925074d sync : vendor (#13901)	7 luni în urmă
CMakeLists.txt	9b61acf060 mtmd : rename llava directory to mtmd (#13311)	8 luni în urmă