cturan/llama.cpp @ b6814

Daniel Bevenius 56b4795842 model-conversion : add support for SentenceTransformers (#16387)		3 mesi fa
..
batched	267c1399f1 common : refactor downloading system, handle mmproj with -hf option (#12694)	9 mesi fa
batched.swift	29f538ac63 examples : remove references to `make` in examples [no ci] (#15457)	4 mesi fa
convert-llama2c-to-ggml	a81283820a gguf: gguf_writer refactor (#15691)	4 mesi fa
deprecation-warning	f112d198cd Update deprecation-warning.cpp (#10619)	1 anno fa
diffusion	6d758839ff Add LLaDA-7b-MoE diffusion model (#16003)	4 mesi fa
embedding	b5bd037832 llama : add support for qwen3 reranker (#15824)	3 mesi fa
eval-callback	624207e676 devops: add s390x & ppc64le CI (#15925)	3 mesi fa
gen-docs	7cc2d2c889 ggml : move AMX to the CPU backend (#10570)	1 anno fa
gguf	53ff6b9b9f GGUF: C++ refactor, backend support, misc fixes (#11030)	1 anno fa
gguf-hash	53ff6b9b9f GGUF: C++ refactor, backend support, misc fixes (#11030)	1 anno fa
llama.android	745aa5319b llama : deprecate llama_kv_self_ API (#14030)	7 mesi fa
llama.swiftui	745aa5319b llama : deprecate llama_kv_self_ API (#14030)	7 mesi fa
lookahead	2f37014073 lookahead : add sample command to readme (#15447)	4 mesi fa
lookup	745aa5319b llama : deprecate llama_kv_self_ API (#14030)	7 mesi fa
model-conversion	56b4795842 model-conversion : add support for SentenceTransformers (#16387)	3 mesi fa
parallel	2adf8d83ac parallel : add option for different RNG seeds (#14757)	6 mesi fa
passkey	29f538ac63 examples : remove references to `make` in examples [no ci] (#15457)	4 mesi fa
retrieval	29f538ac63 examples : remove references to `make` in examples [no ci] (#15457)	4 mesi fa
save-load-state	00131d6eaf tests : update for LLAMA_SET_ROWS=1 (#14961)	5 mesi fa
simple	1cbd80f8cf examples : support encoder-decoder models in the simple example (#16002)	4 mesi fa
simple-chat	d7f5f4e578 simple-chat : fix context-exceeded condition (#14494)	6 mesi fa
simple-cmake-pkg	68ff663a04 repo : update links to new url (#11886)	11 mesi fa
speculative	e92d53b29e sampling : optimize samplers by reusing bucket sort (#15665)	4 mesi fa
speculative-simple	d8914fc47e common : add --override-tensor-draft, --cpu-moe-draft and --n-cpu-moe-draft parameters (#15191)	5 mesi fa
sycl	29f538ac63 examples : remove references to `make` in examples [no ci] (#15457)	4 mesi fa
training	5cdb27e091 finetune: SGD optimizer, more CLI args (#13873)	5 mesi fa
CMakeLists.txt	432cf4304c codeowners : update + cleanup (#16174)	3 mesi fa
convert_legacy_llama.py	a0ec17b32e metadata: Detailed Dataset Authorship Metadata (#8875)	1 anno fa
json_schema_pydantic_example.py	3fd62a6b1c py : type-check all Python scripts with Pyright (#8341)	1 anno fa
json_schema_to_grammar.py	7057faf64b json : support `enum` values within `allOf` (#15830)	4 mesi fa
llama.vim	9ebebef62f llama : remove KV cache defragmentation logic (#15473)	4 mesi fa
pydantic_models_to_grammar.py	090fca7a07 pydantic : replace uses of __annotations__ with get_type_hints (#8474)	1 anno fa
pydantic_models_to_grammar_examples.py	1d36b3670b llama : move end-user examples to tools directory (#13249)	8 mesi fa
reason-act.sh	e9b6350e61 scripts : make the shell scripts cross-platform (#14341)	6 mesi fa
regex_to_grammar.py	e235b267a2 py : switch to snake_case (#8305)	1 anno fa
server-llama2-13B.sh	e9b6350e61 scripts : make the shell scripts cross-platform (#14341)	6 mesi fa
server_embd.py	a19b5cef16 llama : fix FA when KV cache is not used (i.e. embeddings) (#12825)	9 mesi fa
ts-type-to-grammar.sh	e9b6350e61 scripts : make the shell scripts cross-platform (#14341)	6 mesi fa