krystiancha 05c3a444b8 server : fill usage info in embeddings and rerank responses (#10852) 1 年之前
..
batched 644fd71b44 sampling : refactor + optimize penalties sampler (#10803) 1 年之前
batched-bench 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
batched.swift 0abc6a2c25 llama : llama_perf + option to disable timings during decode (#9355) 1 年之前
convert-llama2c-to-ggml 8648c52101 make : deprecate (#10514) 1 年之前
cvector-generator 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
deprecation-warning f112d198cd Update deprecation-warning.cpp (#10619) 1 年之前
embedding 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
eval-callback 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
export-lora 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
gbnf-validator 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
gen-docs 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
gguf 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
gguf-hash 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
gguf-split cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797) 1 年之前
gritlm 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
imatrix 8648c52101 make : deprecate (#10514) 1 年之前
infill 82bca2257b readme : add option, update default value, fix formatting (#10271) 1 年之前
jeopardy 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 年之前
llama-bench cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797) 1 年之前
llama.android cda0e4b648 llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745) 1 年之前
llama.swiftui 43ed389a3f llama : use cmake for swift build (#10525) 1 年之前
llava 4ddd199f6f llava : Allow locally downloaded models for QwenVL (#10833) 1 年之前
lookahead 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
lookup 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
main 644fd71b44 sampling : refactor + optimize penalties sampler (#10803) 1 年之前
main-cmake-pkg 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
parallel 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
passkey 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
perplexity 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
quantize 1a31d0dc00 Update README.md (#10772) 1 年之前
quantize-stats 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
retrieval cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797) 1 年之前
rpc 9f40989351 ggml : move CPU backend to a separate file (#10144) 1 年之前
run c27ac678dd Opt class for positional argument handling (#10508) 1 年之前
save-load-state 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
server 05c3a444b8 server : fill usage info in embeddings and rerank responses (#10852) 1 年之前
simple 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
simple-chat 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
speculative 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
speculative-simple 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 年之前
sycl faf67b3de4 [SYCL]set context default value to avoid memory issue, update guide (#9476) 1 年之前
tokenize cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797) 1 年之前
CMakeLists.txt adffa6ffd5 common : improve -ctv -ctk CLI arguments (#10806) 1 年之前
Miku.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 年之前
chat-13B.bat d9ad104440 Create chat-13B.bat (#592) 2 年之前
chat-13B.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 年之前
chat-persistent.sh 8fc393f246 scripts : fix pattern and get n_tokens in one go (#10221) 1 年之前
chat-vicuna.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 年之前
chat.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 年之前
convert_legacy_llama.py a0ec17b32e metadata: Detailed Dataset Authorship Metadata (#8875) 1 年之前
json_schema_pydantic_example.py 3fd62a6b1c py : type-check all Python scripts with Pyright (#8341) 1 年之前
json_schema_to_grammar.py 66c2c93082 grammar : fix JSON Schema for string regex with top-level alt. (#9903) 1 年之前
llama.vim 2d3aba9ee8 llama.vim : bump generation time limit to 3s [no ci] 1 年之前
llm.vim ad9ddcff6e llm.vim : stop generation at multiple linebreaks, bind to <F2> (#2879) 2 年之前
pydantic_models_to_grammar.py 090fca7a07 pydantic : replace uses of __annotations__ with get_type_hints (#8474) 1 年之前
pydantic_models_to_grammar_examples.py 22f281aa16 examples : Rewrite pydantic_models_to_grammar_examples.py (#8493) 1 年之前
reason-act.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 年之前
regex_to_grammar.py e235b267a2 py : switch to snake_case (#8305) 1 年之前
server-llama2-13B.sh 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 年之前
server_embd.py 3fd62a6b1c py : type-check all Python scripts with Pyright (#8341) 1 年之前
ts-type-to-grammar.sh ab9a3240a9 JSON schema conversion: ⚡️ faster repetitions, min/maxLength for strings, cap number length (#6555) 1 年之前