Vinesh Janarthanan
|
441b72b91f
main : option to disable context shift (#9484)
|
пре 1 година |
Georgi Gerganov
|
6262d13e0b
common : reimplement logging (#9418)
|
пре 1 година |
Georgi Gerganov
|
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
|
пре 1 година |
Xuan Son Nguyen
|
bfe76d4a17
common : move arg parser code to `arg.cpp` (#9388)
|
пре 1 година |
Xuan Son Nguyen
|
3f7ccfd649
common : bring back missing args, add env var duplication check (#9375)
|
пре 1 година |
Xuan Son Nguyen
|
1b9ae5189c
common : refactor arg parser (#9308)
|
пре 1 година |
Georgi Gerganov
|
df270ef745
llama : refactor sampling v2 (#9294)
|
пре 1 година |
Aarni Koskela
|
815b1fb20a
batched-bench : add `--output-format jsonl` option (#9293)
|
пре 1 година |
Faisal Zaghloul
|
42c76d1358
Threadpool: take 2 (#8672)
|
пре 1 година |
Xuan Son Nguyen
|
fc54ef0d1c
server : support reading arguments from environment variables (#9105)
|
пре 1 година |
Zhenwei Jin
|
4af8420afb
common : remove duplicate function llama_should_add_bos_token (#8778)
|
пре 1 година |
Georgi Gerganov
|
45a55b91aa
llama : better replace_all (cont) (#8926)
|
пре 1 година |
Xuan Son Nguyen
|
1e6f6554aa
server : add lora hotswap endpoint (WIP) (#8857)
|
пре 1 година |
Liu Jia
|
0a4ce78681
common : Changed tuple to struct (TODO fix) (#8823)
|
пре 1 година |
Xuan Son Nguyen
|
de280085e7
examples : Fix `llama-export-lora` example (#8607)
|
пре 1 година |
Derrick T. Woolworth
|
86e7299ef5
added support for Authorization Bearer tokens when downloading model (#8307)
|
пре 1 година |
jaime-m-p
|
213701b51a
Detokenizer fixes (#8039)
|
пре 1 година |
Douglas Hanley
|
d12f781074
llama : streamline embeddings from "non-embedding" models (#8087)
|
пре 1 година |
Clint Herron
|
07a3fc0608
Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (#8258)
|
пре 1 година |
Xuan Son Nguyen
|
9ef0780062
Fix new line issue with chat template, disable template when in-prefix/suffix is set (#8203)
|
пре 1 година |
Sigbjørn Skjæret
|
38373cfbab
Add SPM infill support (#8016)
|
пре 1 година |
Xuan Son Nguyen
|
16791b8f0b
Add chatml fallback for cpp `llama_chat_apply_template` (#8160)
|
пре 1 година |
Xuan Son Nguyen
|
49c03c79cd
cvector: better prompt handling, add "mean vector" method (#8069)
|
пре 1 година |
Xuan Son Nguyen
|
48e6b92cc3
Add chat template support for llama-cli (#8068)
|
пре 1 година |
Yann Follet
|
646ef4a9cf
embedding : more cli arguments (#7458)
|
пре 1 година |
Frank Mai
|
b96f9afb0d
chore: clean useless beam search param (#7985)
|
пре 1 година |
Xuan Son Nguyen
|
0c7b3595b9
Add `cvector-generator` example (#7514)
|
пре 1 година |
Olivier Chafik
|
d4d915d351
url: save -mu downloads to new cache location (#7826)
|
пре 1 година |
sasha0552
|
7a16ce7db2
server : smart slot selection using Longest Common Prefix (#7728)
|
пре 1 година |
Georgi Gerganov
|
ee459f40f6
server : fix --threads-http arg (#7801)
|
пре 1 година |