Georgi Gerganov f4b2dcdf49 readme : fix typo [no ci] 1 year ago
..
baby-llama 42c76d1358 Threadpool: take 2 (#8672) 1 year ago
batched 6262d13e0b common : reimplement logging (#9418) 1 year ago
batched-bench 6262d13e0b common : reimplement logging (#9418) 1 year ago
batched.swift 0abc6a2c25 llama : llama_perf + option to disable timings during decode (#9355) 1 year ago
convert-llama2c-to-ggml 6102037bbb vocab : refactor tokenizer to reduce init overhead (#9449) 1 year ago
cvector-generator cad341d889 metal : reduce command encoding overhead (#9698) 1 year ago
deprecation-warning be6d7c0791 examples : remove `finetune` and `train-text-from-scratch` (#8669) 1 year ago
embedding f4d2b8846a llama : add reranking support (#9510) 1 year ago
eval-callback 6262d13e0b common : reimplement logging (#9418) 1 year ago
export-lora 6262d13e0b common : reimplement logging (#9418) 1 year ago
gbnf-validator df270ef745 llama : refactor sampling v2 (#9294) 1 year ago
gen-docs afbbfaa537 server : add more env vars, improve gen-docs (#9635) 1 year ago
gguf 07283b1a90 gguf : handle null name during init (#8587) 1 year ago
gguf-hash 1666f92dcd gguf-hash : update clib.json to point to original xxhash repo (#8491) 1 year ago
gguf-split 76b37d1541 gguf-split : improve --split and --merge logic (#9619) 1 year ago
gritlm 6262d13e0b common : reimplement logging (#9418) 1 year ago
imatrix eca0fab44e imatrix : disable prompt escape by default (#9543) 1 year ago
infill cea1486ecf log : add CONT level for continuing previous log entry (#9610) 1 year ago
jeopardy 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 year ago
llama-bench 7be099fa81 llama-bench: correct argument parsing error message (#9524) 1 year ago
llama.android 5fb5e24811 llama : minor sampling refactor (2) (#9386) 1 year ago
llama.swiftui 5fb5e24811 llama : minor sampling refactor (2) (#9386) 1 year ago
llava cad341d889 metal : reduce command encoding overhead (#9698) 1 year ago
lookahead 6262d13e0b common : reimplement logging (#9418) 1 year ago
lookup 6262d13e0b common : reimplement logging (#9418) 1 year ago
main f4b2dcdf49 readme : fix typo [no ci] 1 year ago
main-cmake-pkg 07a3fc0608 Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (#8258) 1 year ago
parallel 6262d13e0b common : reimplement logging (#9418) 1 year ago
passkey 6262d13e0b common : reimplement logging (#9418) 1 year ago
perplexity 37f8c7b4c9 perplexity : remove extra new lines after chunks (#9596) 1 year ago
quantize 63351143b2 quantize : improve type name parsing (#9570) 1 year ago
quantize-stats df270ef745 llama : refactor sampling v2 (#9294) 1 year ago
retrieval 6262d13e0b common : reimplement logging (#9418) 1 year ago
rpc 841713e1e4 rpc : enable vulkan (#9714) 1 year ago
save-load-state bfe76d4a17 common : move arg parser code to `arg.cpp` (#9388) 1 year ago
server 8c475b97b8 rerank : use [SEP] token instead of [BOS] (#9737) 1 year ago
simple 6262d13e0b common : reimplement logging (#9418) 1 year ago
speculative b0f27361f3 sampling : avoid expensive softmax during greedy sampling (#9605) 1 year ago
sycl faf67b3de4 [SYCL]set context default value to avoid memory issue, update guide (#9476) 1 year ago
tokenize 6262d13e0b common : reimplement logging (#9418) 1 year ago
CMakeLists.txt 148844fe97 examples : remove benchmark (#9704) 1 year ago
Miku.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 year ago
base-translate.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 year ago
chat-13B.bat d9ad104440 Create chat-13B.bat (#592) 2 years ago
chat-13B.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 year ago
chat-persistent.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 year ago
chat-vicuna.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 year ago
chat.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 year ago
convert_legacy_llama.py 672a6f1018 convert-*.py: GGUF Naming Convention Refactor and Metadata Override Refactor (#7499) 1 year ago
json_schema_pydantic_example.py 3fd62a6b1c py : type-check all Python scripts with Pyright (#8341) 1 year ago
json_schema_to_grammar.py 3fd62a6b1c py : type-check all Python scripts with Pyright (#8341) 1 year ago
llama.vim 125d03a503 llama.vim : added api key support (#5090) 2 years ago
llm.vim ad9ddcff6e llm.vim : stop generation at multiple linebreaks, bind to <F2> (#2879) 2 years ago
pydantic_models_to_grammar.py 090fca7a07 pydantic : replace uses of __annotations__ with get_type_hints (#8474) 1 year ago
pydantic_models_to_grammar_examples.py 22f281aa16 examples : Rewrite pydantic_models_to_grammar_examples.py (#8493) 1 year ago
reason-act.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 year ago
regex_to_grammar.py e235b267a2 py : switch to snake_case (#8305) 1 year ago
server-llama2-13B.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 year ago
server_embd.py 3fd62a6b1c py : type-check all Python scripts with Pyright (#8341) 1 year ago
ts-type-to-grammar.sh ab9a3240a9 JSON schema conversion: ⚡️ faster repetitions, min/maxLength for strings, cap number length (#6555) 1 year ago