theraininsky a7b8ce2260 llama-bench : fix unexpected global variable initialize sequence issue (#11832) 11 mesi fa
..
batched afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
batched-bench afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
batched.swift f117d84b48 swift : fix llama-vocab api usage (#11645) 11 mesi fa
convert-llama2c-to-ggml afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
cvector-generator afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
deprecation-warning f112d198cd Update deprecation-warning.cpp (#10619) 1 anno fa
embedding afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
eval-callback afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
export-lora e28245f35f export-lora : fix tok_embd tensor (#11330) 1 anno fa
gbnf-validator 8b576b6c55 Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639) 11 mesi fa
gen-docs 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 anno fa
gguf 53ff6b9b9f GGUF: C++ refactor, backend support, misc fixes (#11030) 1 anno fa
gguf-hash 53ff6b9b9f GGUF: C++ refactor, backend support, misc fixes (#11030) 1 anno fa
gguf-split f11cfdfd7f ci : use -no-cnv in gguf-split tests (#11254) 1 anno fa
gritlm afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
imatrix bfd11a2344 Fix: Compile failure due to Microsoft STL breaking change (#11836) 11 mesi fa
infill afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
jeopardy 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 anno fa
llama-bench a7b8ce2260 llama-bench : fix unexpected global variable initialize sequence issue (#11832) 11 mesi fa
llama.android 3edfa7d375 llama.android: add field formatChat to control whether to parse special tokens when send message (#11270) 1 anno fa
llama.swiftui f117d84b48 swift : fix llama-vocab api usage (#11645) 11 mesi fa
llava 1ec208083c llava: add quantization for the visual projector LLAVA, Qwen2VL (#11644) 11 mesi fa
lookahead afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
lookup afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
main 27e8a23300 sampling: add Top-nσ sampler (#11223) 11 mesi fa
parallel afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
passkey afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
perplexity bfd11a2344 Fix: Compile failure due to Microsoft STL breaking change (#11836) 11 mesi fa
quantize f11cfdfd7f ci : use -no-cnv in gguf-split tests (#11254) 1 anno fa
quantize-stats afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
retrieval afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
rpc 86bf31cfe6 rpc-server : add support for the SYCL backend (#10934) 1 anno fa
run 19d3c8293b There's a better way of clearing lines (#11756) 11 mesi fa
save-load-state afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
server c1f958c038 server : (docs) Update wrong tool calling example (#11809) 11 mesi fa
simple afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
simple-chat 6171c9d258 Add Jinja template support (#11016) 1 anno fa
simple-cmake-pkg 19f65187cb cmake: add ggml find package (#11369) 11 mesi fa
speculative afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
speculative-simple afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
sycl faf67b3de4 [SYCL]set context default value to avoid memory issue, update guide (#9476) 1 anno fa
tokenize afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 anno fa
tts 6390a998bf tts : add guide tokens support (#11186) 1 anno fa
CMakeLists.txt 0bf2d10c55 tts : add OuteTTS support (#10784) 1 anno fa
Miku.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 anno fa
chat-13B.bat d9ad104440 Create chat-13B.bat (#592) 2 anni fa
chat-13B.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 anno fa
chat-persistent.sh 8fc393f246 scripts : fix pattern and get n_tokens in one go (#10221) 1 anno fa
chat-vicuna.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 anno fa
chat.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 anno fa
convert_legacy_llama.py a0ec17b32e metadata: Detailed Dataset Authorship Metadata (#8875) 1 anno fa
json_schema_pydantic_example.py 3fd62a6b1c py : type-check all Python scripts with Pyright (#8341) 1 anno fa
json_schema_to_grammar.py 66c2c93082 grammar : fix JSON Schema for string regex with top-level alt. (#9903) 1 anno fa
llama.vim 2d3aba9ee8 llama.vim : bump generation time limit to 3s [no ci] 1 anno fa
llm.vim ad9ddcff6e llm.vim : stop generation at multiple linebreaks, bind to <F2> (#2879) 2 anni fa
pydantic_models_to_grammar.py 090fca7a07 pydantic : replace uses of __annotations__ with get_type_hints (#8474) 1 anno fa
pydantic_models_to_grammar_examples.py 22f281aa16 examples : Rewrite pydantic_models_to_grammar_examples.py (#8493) 1 anno fa
reason-act.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 anno fa
regex_to_grammar.py e235b267a2 py : switch to snake_case (#8305) 1 anno fa
server-llama2-13B.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 anno fa
server_embd.py 3fd62a6b1c py : type-check all Python scripts with Pyright (#8341) 1 anno fa
ts-type-to-grammar.sh ab9a3240a9 JSON schema conversion: ⚡️ faster repetitions, min/maxLength for strings, cap number length (#6555) 1 anno fa