Commit History

Autor SHA1 Mensaxe Data
  Diego Devesa 7eee341bee common : use common_ prefix for common library functions (#9805) hai 1 ano
  Georgi Gerganov 8c475b97b8 rerank : use [SEP] token instead of [BOS] (#9737) hai 1 ano
  matiaslin faac0bae26 common : ensure llama_batch size does not exceed max size (#9668) hai 1 ano
  Georgi Gerganov f4d2b8846a llama : add reranking support (#9510) hai 1 ano
  Georgi Gerganov 6262d13e0b common : reimplement logging (#9418) hai 1 ano
  Georgi Gerganov 0abc6a2c25 llama : llama_perf + option to disable timings during decode (#9355) hai 1 ano
  Ahmad Tameem 2b00fa7997 riscv : modify Makefile and add a RISCV_VECT to print log info (#9442) hai 1 ano
  Farbod Bijary 67155ab7f5 feat: Implements retrying logic for downloading models using --model-url flag (#9255) hai 1 ano
  Xuan Son Nguyen 6cd4e03444 arg : bring back missing ifdef (#9411) hai 1 ano
  Xuan Son Nguyen bfe76d4a17 common : move arg parser code to `arg.cpp` (#9388) hai 1 ano
  Xuan Son Nguyen 3f7ccfd649 common : bring back missing args, add env var duplication check (#9375) hai 1 ano
  slaren a249843d89 common : restore --n-gpu-layers (#9371) hai 1 ano
  Xuan Son Nguyen 00b02bb249 imatrix : fix arg parser for imatrix (#9366) hai 1 ano
  Georgi Gerganov faf69d4237 llama : sanitize invalid tokens (#9357) hai 1 ano
  Xuan Son Nguyen 1b9ae5189c common : refactor arg parser (#9308) hai 1 ano
  Georgi Gerganov df270ef745 llama : refactor sampling v2 (#9294) hai 1 ano
  Aarni Koskela 815b1fb20a batched-bench : add `--output-format jsonl` option (#9293) hai 1 ano
  Radoslav Gerganov 82e3b03c11 rpc : make RPC servers come first in the device list (#9296) hai 1 ano
  Faisal Zaghloul 42c76d1358 Threadpool: take 2 (#8672) hai 1 ano
  Xuan Son Nguyen a77feb5d71 server : add some missing env variables (#9116) hai 1 ano
  Justine Tunney 436787f170 llama : fix time complexity of string replacement (#9163) hai 1 ano
  Herman Semenov 93bc3839f9 common: fixed not working find argument --n-gpu-layers-draft (#9175) hai 1 ano
  Xuan Son Nguyen fc54ef0d1c server : support reading arguments from environment variables (#9105) hai 1 ano
  Liu Jia fb487bb567 common : add support for cpu_get_num_physical_cores() on Windows (#8771) hai 1 ano
  Zhenwei Jin 4af8420afb common : remove duplicate function llama_should_add_bos_token (#8778) hai 1 ano
  fairydreaming 7c3f55c100 Add support for encoder-only T5 models (#8900) hai 1 ano
  Georgi Gerganov 45a55b91aa llama : better replace_all (cont) (#8926) hai 1 ano
  Xuan Son Nguyen 1e6f6554aa server : add lora hotswap endpoint (WIP) (#8857) hai 1 ano
  Liu Jia 0a4ce78681 common : Changed tuple to struct (TODO fix) (#8823) hai 1 ano
  Igor Okulist afbbcf3c04 server : update llama-server embedding flag documentation (#8779) hai 1 ano