Daniel Bevenius 56b4795842 model-conversion : add support for SentenceTransformers (#16387) 3 mesi fa
..
batched 267c1399f1 common : refactor downloading system, handle mmproj with -hf option (#12694) 9 mesi fa
batched.swift 29f538ac63 examples : remove references to `make` in examples [no ci] (#15457) 4 mesi fa
convert-llama2c-to-ggml a81283820a gguf: gguf_writer refactor (#15691) 4 mesi fa
deprecation-warning f112d198cd Update deprecation-warning.cpp (#10619) 1 anno fa
diffusion 6d758839ff Add LLaDA-7b-MoE diffusion model (#16003) 4 mesi fa
embedding b5bd037832 llama : add support for qwen3 reranker (#15824) 3 mesi fa
eval-callback 624207e676 devops: add s390x & ppc64le CI (#15925) 3 mesi fa
gen-docs 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 anno fa
gguf 53ff6b9b9f GGUF: C++ refactor, backend support, misc fixes (#11030) 1 anno fa
gguf-hash 53ff6b9b9f GGUF: C++ refactor, backend support, misc fixes (#11030) 1 anno fa
llama.android 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 mesi fa
llama.swiftui 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 mesi fa
lookahead 2f37014073 lookahead : add sample command to readme (#15447) 4 mesi fa
lookup 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 mesi fa
model-conversion 56b4795842 model-conversion : add support for SentenceTransformers (#16387) 3 mesi fa
parallel 2adf8d83ac parallel : add option for different RNG seeds (#14757) 6 mesi fa
passkey 29f538ac63 examples : remove references to `make` in examples [no ci] (#15457) 4 mesi fa
retrieval 29f538ac63 examples : remove references to `make` in examples [no ci] (#15457) 4 mesi fa
save-load-state 00131d6eaf tests : update for LLAMA_SET_ROWS=1 (#14961) 5 mesi fa
simple 1cbd80f8cf examples : support encoder-decoder models in the simple example (#16002) 4 mesi fa
simple-chat d7f5f4e578 simple-chat : fix context-exceeded condition (#14494) 6 mesi fa
simple-cmake-pkg 68ff663a04 repo : update links to new url (#11886) 11 mesi fa
speculative e92d53b29e sampling : optimize samplers by reusing bucket sort (#15665) 4 mesi fa
speculative-simple d8914fc47e common : add --override-tensor-draft, --cpu-moe-draft and --n-cpu-moe-draft parameters (#15191) 5 mesi fa
sycl 29f538ac63 examples : remove references to `make` in examples [no ci] (#15457) 4 mesi fa
training 5cdb27e091 finetune: SGD optimizer, more CLI args (#13873) 5 mesi fa
CMakeLists.txt 432cf4304c codeowners : update + cleanup (#16174) 3 mesi fa
convert_legacy_llama.py a0ec17b32e metadata: Detailed Dataset Authorship Metadata (#8875) 1 anno fa
json_schema_pydantic_example.py 3fd62a6b1c py : type-check all Python scripts with Pyright (#8341) 1 anno fa
json_schema_to_grammar.py 7057faf64b json : support `enum` values within `allOf` (#15830) 4 mesi fa
llama.vim 9ebebef62f llama : remove KV cache defragmentation logic (#15473) 4 mesi fa
pydantic_models_to_grammar.py 090fca7a07 pydantic : replace uses of __annotations__ with get_type_hints (#8474) 1 anno fa
pydantic_models_to_grammar_examples.py 1d36b3670b llama : move end-user examples to tools directory (#13249) 8 mesi fa
reason-act.sh e9b6350e61 scripts : make the shell scripts cross-platform (#14341) 6 mesi fa
regex_to_grammar.py e235b267a2 py : switch to snake_case (#8305) 1 anno fa
server-llama2-13B.sh e9b6350e61 scripts : make the shell scripts cross-platform (#14341) 6 mesi fa
server_embd.py a19b5cef16 llama : fix FA when KV cache is not used (i.e. embeddings) (#12825) 9 mesi fa
ts-type-to-grammar.sh e9b6350e61 scripts : make the shell scripts cross-platform (#14341) 6 mesi fa