MaggotHATE
|
fbc98b748e
sampling : add XTC sampler (#9742)
|
1 год назад |
Georgi Gerganov
|
11ac9800af
llama : improve infill support and special token detection (#9798)
|
1 год назад |
Diego Devesa
|
7eee341bee
common : use common_ prefix for common library functions (#9805)
|
1 год назад |
Georgi Gerganov
|
8c475b97b8
rerank : use [SEP] token instead of [BOS] (#9737)
|
1 год назад |
matiaslin
|
faac0bae26
common : ensure llama_batch size does not exceed max size (#9668)
|
1 год назад |
Georgi Gerganov
|
f4d2b8846a
llama : add reranking support (#9510)
|
1 год назад |
Georgi Gerganov
|
6262d13e0b
common : reimplement logging (#9418)
|
1 год назад |
Georgi Gerganov
|
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
|
1 год назад |
Ahmad Tameem
|
2b00fa7997
riscv : modify Makefile and add a RISCV_VECT to print log info (#9442)
|
1 год назад |
Farbod Bijary
|
67155ab7f5
feat: Implements retrying logic for downloading models using --model-url flag (#9255)
|
1 год назад |
Xuan Son Nguyen
|
6cd4e03444
arg : bring back missing ifdef (#9411)
|
1 год назад |
Xuan Son Nguyen
|
bfe76d4a17
common : move arg parser code to `arg.cpp` (#9388)
|
1 год назад |
Xuan Son Nguyen
|
3f7ccfd649
common : bring back missing args, add env var duplication check (#9375)
|
1 год назад |
slaren
|
a249843d89
common : restore --n-gpu-layers (#9371)
|
1 год назад |
Xuan Son Nguyen
|
00b02bb249
imatrix : fix arg parser for imatrix (#9366)
|
1 год назад |
Georgi Gerganov
|
faf69d4237
llama : sanitize invalid tokens (#9357)
|
1 год назад |
Xuan Son Nguyen
|
1b9ae5189c
common : refactor arg parser (#9308)
|
1 год назад |
Georgi Gerganov
|
df270ef745
llama : refactor sampling v2 (#9294)
|
1 год назад |
Aarni Koskela
|
815b1fb20a
batched-bench : add `--output-format jsonl` option (#9293)
|
1 год назад |
Radoslav Gerganov
|
82e3b03c11
rpc : make RPC servers come first in the device list (#9296)
|
1 год назад |
Faisal Zaghloul
|
42c76d1358
Threadpool: take 2 (#8672)
|
1 год назад |
Xuan Son Nguyen
|
a77feb5d71
server : add some missing env variables (#9116)
|
1 год назад |
Justine Tunney
|
436787f170
llama : fix time complexity of string replacement (#9163)
|
1 год назад |
Herman Semenov
|
93bc3839f9
common: fixed not working find argument --n-gpu-layers-draft (#9175)
|
1 год назад |
Xuan Son Nguyen
|
fc54ef0d1c
server : support reading arguments from environment variables (#9105)
|
1 год назад |
Liu Jia
|
fb487bb567
common : add support for cpu_get_num_physical_cores() on Windows (#8771)
|
1 год назад |
Zhenwei Jin
|
4af8420afb
common : remove duplicate function llama_should_add_bos_token (#8778)
|
1 год назад |
fairydreaming
|
7c3f55c100
Add support for encoder-only T5 models (#8900)
|
1 год назад |
Georgi Gerganov
|
45a55b91aa
llama : better replace_all (cont) (#8926)
|
1 год назад |
Xuan Son Nguyen
|
1e6f6554aa
server : add lora hotswap endpoint (WIP) (#8857)
|
1 год назад |