xiaofei a0f7016d17 rpc : fix cache directory initialization (#13188) vor 8 Monaten
..
batched 267c1399f1 common : refactor downloading system, handle mmproj with -hf option (#12694) vor 9 Monaten
batched-bench 267c1399f1 common : refactor downloading system, handle mmproj with -hf option (#12694) vor 9 Monaten
batched.swift e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181) vor 10 Monaten
convert-llama2c-to-ggml afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) vor 1 Jahr
cvector-generator e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181) vor 10 Monaten
deprecation-warning f112d198cd Update deprecation-warning.cpp (#10619) vor 1 Jahr
embedding 226251ed56 embeddings : fix batch sizes (#13076) vor 9 Monaten
eval-callback afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) vor 1 Jahr
export-lora 267c1399f1 common : refactor downloading system, handle mmproj with -hf option (#12694) vor 9 Monaten
gen-docs 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) vor 1 Jahr
gguf 53ff6b9b9f GGUF: C++ refactor, backend support, misc fixes (#11030) vor 1 Jahr
gguf-hash 53ff6b9b9f GGUF: C++ refactor, backend support, misc fixes (#11030) vor 1 Jahr
gguf-split 23106f94ea gguf-split : --merge now respects --dry-run option (#12681) vor 9 Monaten
gritlm 267c1399f1 common : refactor downloading system, handle mmproj with -hf option (#12694) vor 9 Monaten
imatrix e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181) vor 10 Monaten
infill e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181) vor 10 Monaten
jeopardy 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) vor 1 Jahr
llama-bench 5a63980117 llama-bench: fixed size of fields to correctly map to values (#13183) vor 8 Monaten
llama.android bd3f59f812 cmake : enable curl by default (#12761) vor 9 Monaten
llama.swiftui e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181) vor 10 Monaten
llava 00e3e5a194 mtmd : add qwen2vl and qwen2.5vl (#13141) vor 8 Monaten
lookahead e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181) vor 10 Monaten
lookup e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181) vor 10 Monaten
main 6408210082 main : Fix Ctrl+D/newline handling (#12951) vor 9 Monaten
parallel a10b36c91a llama : refactor kv cache guard (#12695) vor 9 Monaten
passkey 267c1399f1 common : refactor downloading system, handle mmproj with -hf option (#12694) vor 9 Monaten
perplexity 4ccea213bc hellaswag: display estimated score confidence interval (#12797) vor 9 Monaten
quantize 71e90e8813 quantize: Handle user-defined quantization levels for additional tensors (#12511) vor 9 Monaten
retrieval e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181) vor 10 Monaten
rpc a0f7016d17 rpc : fix cache directory initialization (#13188) vor 8 Monaten
run b2034c2b55 contrib: support modelscope community (#12664) vor 9 Monaten
save-load-state e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181) vor 10 Monaten
server e2e1ddb93a server : Prefilling assistant message in openai compatible API (#13174) vor 8 Monaten
simple afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) vor 1 Jahr
simple-chat e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181) vor 10 Monaten
simple-cmake-pkg 68ff663a04 repo : update links to new url (#11886) vor 11 Monaten
speculative 267c1399f1 common : refactor downloading system, handle mmproj with -hf option (#12694) vor 9 Monaten
speculative-simple 267c1399f1 common : refactor downloading system, handle mmproj with -hf option (#12694) vor 9 Monaten
sycl 81c7e64fc2 dsiable curl lib check, this action is missed by commit bd3f59f81289b920bcc597a208c14f55e39ed37e (#12761) (#12937) vor 9 Monaten
tokenize afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) vor 1 Jahr
tts 267c1399f1 common : refactor downloading system, handle mmproj with -hf option (#12694) vor 9 Monaten
CMakeLists.txt 13b4548877 cmake : do not include ./src as public for libllama (#13062) vor 9 Monaten
Miku.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) vor 1 Jahr
chat-13B.bat d9ad104440 Create chat-13B.bat (#592) vor 2 Jahren
chat-13B.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) vor 1 Jahr
chat-persistent.sh 8fc393f246 scripts : fix pattern and get n_tokens in one go (#10221) vor 1 Jahr
chat-vicuna.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) vor 1 Jahr
chat.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) vor 1 Jahr
convert_legacy_llama.py a0ec17b32e metadata: Detailed Dataset Authorship Metadata (#8875) vor 1 Jahr
json_schema_pydantic_example.py 3fd62a6b1c py : type-check all Python scripts with Pyright (#8341) vor 1 Jahr
json_schema_to_grammar.py d5fe4e81bd grammar : handle maxItems == 0 in JSON schema (#13117) vor 9 Monaten
llama.vim 68ff663a04 repo : update links to new url (#11886) vor 11 Monaten
llm.vim ad9ddcff6e llm.vim : stop generation at multiple linebreaks, bind to <F2> (#2879) vor 2 Jahren
pydantic_models_to_grammar.py 090fca7a07 pydantic : replace uses of __annotations__ with get_type_hints (#8474) vor 1 Jahr
pydantic_models_to_grammar_examples.py 68ff663a04 repo : update links to new url (#11886) vor 11 Monaten
reason-act.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) vor 1 Jahr
regex_to_grammar.py e235b267a2 py : switch to snake_case (#8305) vor 1 Jahr
server-llama2-13B.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) vor 1 Jahr
server_embd.py a19b5cef16 llama : fix FA when KV cache is not used (i.e. embeddings) (#12825) vor 9 Monaten
ts-type-to-grammar.sh ab9a3240a9 JSON schema conversion: ⚡️ faster repetitions, min/maxLength for strings, cap number length (#6555) vor 1 Jahr