| .. |
|
baby-llama
|
42c76d1358
Threadpool: take 2 (#8672)
|
1 год назад |
|
batched
|
6262d13e0b
common : reimplement logging (#9418)
|
1 год назад |
|
batched-bench
|
6262d13e0b
common : reimplement logging (#9418)
|
1 год назад |
|
batched.swift
|
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
|
1 год назад |
|
convert-llama2c-to-ggml
|
6102037bbb
vocab : refactor tokenizer to reduce init overhead (#9449)
|
1 год назад |
|
cvector-generator
|
cad341d889
metal : reduce command encoding overhead (#9698)
|
1 год назад |
|
deprecation-warning
|
be6d7c0791
examples : remove `finetune` and `train-text-from-scratch` (#8669)
|
1 год назад |
|
embedding
|
f4d2b8846a
llama : add reranking support (#9510)
|
1 год назад |
|
eval-callback
|
6262d13e0b
common : reimplement logging (#9418)
|
1 год назад |
|
export-lora
|
6262d13e0b
common : reimplement logging (#9418)
|
1 год назад |
|
gbnf-validator
|
df270ef745
llama : refactor sampling v2 (#9294)
|
1 год назад |
|
gen-docs
|
afbbfaa537
server : add more env vars, improve gen-docs (#9635)
|
1 год назад |
|
gguf
|
07283b1a90
gguf : handle null name during init (#8587)
|
1 год назад |
|
gguf-hash
|
1666f92dcd
gguf-hash : update clib.json to point to original xxhash repo (#8491)
|
1 год назад |
|
gguf-split
|
76b37d1541
gguf-split : improve --split and --merge logic (#9619)
|
1 год назад |
|
gritlm
|
6262d13e0b
common : reimplement logging (#9418)
|
1 год назад |
|
imatrix
|
eca0fab44e
imatrix : disable prompt escape by default (#9543)
|
1 год назад |
|
infill
|
cea1486ecf
log : add CONT level for continuing previous log entry (#9610)
|
1 год назад |
|
jeopardy
|
1c641e6aac
`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809)
|
1 год назад |
|
llama-bench
|
7be099fa81
llama-bench: correct argument parsing error message (#9524)
|
1 год назад |
|
llama.android
|
5fb5e24811
llama : minor sampling refactor (2) (#9386)
|
1 год назад |
|
llama.swiftui
|
5fb5e24811
llama : minor sampling refactor (2) (#9386)
|
1 год назад |
|
llava
|
cad341d889
metal : reduce command encoding overhead (#9698)
|
1 год назад |
|
lookahead
|
6262d13e0b
common : reimplement logging (#9418)
|
1 год назад |
|
lookup
|
6262d13e0b
common : reimplement logging (#9418)
|
1 год назад |
|
main
|
cea1486ecf
log : add CONT level for continuing previous log entry (#9610)
|
1 год назад |
|
main-cmake-pkg
|
07a3fc0608
Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (#8258)
|
1 год назад |
|
parallel
|
6262d13e0b
common : reimplement logging (#9418)
|
1 год назад |
|
passkey
|
6262d13e0b
common : reimplement logging (#9418)
|
1 год назад |
|
perplexity
|
37f8c7b4c9
perplexity : remove extra new lines after chunks (#9596)
|
1 год назад |
|
quantize
|
63351143b2
quantize : improve type name parsing (#9570)
|
1 год назад |
|
quantize-stats
|
df270ef745
llama : refactor sampling v2 (#9294)
|
1 год назад |
|
retrieval
|
6262d13e0b
common : reimplement logging (#9418)
|
1 год назад |
|
rpc
|
841713e1e4
rpc : enable vulkan (#9714)
|
1 год назад |
|
save-load-state
|
bfe76d4a17
common : move arg parser code to `arg.cpp` (#9388)
|
1 год назад |
|
server
|
08a43d05b6
py : update transfomers version (#9694)
|
1 год назад |
|
simple
|
6262d13e0b
common : reimplement logging (#9418)
|
1 год назад |
|
speculative
|
b0f27361f3
sampling : avoid expensive softmax during greedy sampling (#9605)
|
1 год назад |
|
sycl
|
faf67b3de4
[SYCL]set context default value to avoid memory issue, update guide (#9476)
|
1 год назад |
|
tokenize
|
6262d13e0b
common : reimplement logging (#9418)
|
1 год назад |
|
CMakeLists.txt
|
148844fe97
examples : remove benchmark (#9704)
|
1 год назад |
|
Miku.sh
|
1c641e6aac
`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809)
|
1 год назад |
|
base-translate.sh
|
1c641e6aac
`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809)
|
1 год назад |
|
chat-13B.bat
|
d9ad104440
Create chat-13B.bat (#592)
|
2 лет назад |
|
chat-13B.sh
|
1c641e6aac
`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809)
|
1 год назад |
|
chat-persistent.sh
|
1c641e6aac
`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809)
|
1 год назад |
|
chat-vicuna.sh
|
1c641e6aac
`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809)
|
1 год назад |
|
chat.sh
|
1c641e6aac
`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809)
|
1 год назад |
|
convert_legacy_llama.py
|
672a6f1018
convert-*.py: GGUF Naming Convention Refactor and Metadata Override Refactor (#7499)
|
1 год назад |
|
json_schema_pydantic_example.py
|
3fd62a6b1c
py : type-check all Python scripts with Pyright (#8341)
|
1 год назад |
|
json_schema_to_grammar.py
|
3fd62a6b1c
py : type-check all Python scripts with Pyright (#8341)
|
1 год назад |
|
llama.vim
|
125d03a503
llama.vim : added api key support (#5090)
|
2 лет назад |
|
llm.vim
|
ad9ddcff6e
llm.vim : stop generation at multiple linebreaks, bind to <F2> (#2879)
|
2 лет назад |
|
pydantic_models_to_grammar.py
|
090fca7a07
pydantic : replace uses of __annotations__ with get_type_hints (#8474)
|
1 год назад |
|
pydantic_models_to_grammar_examples.py
|
22f281aa16
examples : Rewrite pydantic_models_to_grammar_examples.py (#8493)
|
1 год назад |
|
reason-act.sh
|
1c641e6aac
`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809)
|
1 год назад |
|
regex_to_grammar.py
|
e235b267a2
py : switch to snake_case (#8305)
|
1 год назад |
|
server-llama2-13B.sh
|
1c641e6aac
`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809)
|
1 год назад |
|
server_embd.py
|
3fd62a6b1c
py : type-check all Python scripts with Pyright (#8341)
|
1 год назад |
|
ts-type-to-grammar.sh
|
ab9a3240a9
JSON schema conversion: ⚡️ faster repetitions, min/maxLength for strings, cap number length (#6555)
|
1 год назад |