Commit History

Autor SHA1 Mensaxe Data
  Douglas Hanley b5bd037832 llama : add support for qwen3 reranker (#15824) hai 3 meses
  Georgi Gerganov 00131d6eaf tests : update for LLAMA_SET_ROWS=1 (#14961) hai 5 meses
  Georgi Gerganov 225e7a1438 llama : add high-throughput mode (#14363) hai 6 meses
  Sigbjørn Skjæret 88fc854b4b llama : improve sep token handling (#14272) hai 7 meses
  Georgi Gerganov 745aa5319b llama : deprecate llama_kv_self_ API (#14030) hai 7 meses
  Sigbjørn Skjæret d17a809ef0 llama : support multiple classifier outputs and labels (#13940) hai 7 meses
  Georgi Gerganov 79c137f776 examples : allow extracting embeddings from decoder contexts (#13797) hai 7 meses
  Georgi Gerganov 6562e5a4d6 context : allow cache-less context for embeddings (#13108) hai 8 meses
  Georgi Gerganov 226251ed56 embeddings : fix batch sizes (#13076) hai 8 meses
  Georgi Gerganov e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181) hai 10 meses
  mgroeber9110 5bbe6a9fe9 ggml : portability fixes for VS 2017 (#12150) hai 10 meses
  Georgi Gerganov afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) hai 1 ano
  Georgi Gerganov f66f582927 llama : refactor `src/llama.cpp` (#10902) hai 1 ano
  Diego Devesa 7eee341bee common : use common_ prefix for common library functions (#9805) hai 1 ano
  Georgi Gerganov f4d2b8846a llama : add reranking support (#9510) hai 1 ano
  Georgi Gerganov 6262d13e0b common : reimplement logging (#9418) hai 1 ano
  Georgi Gerganov 0abc6a2c25 llama : llama_perf + option to disable timings during decode (#9355) hai 1 ano
  slaren 49006c67b4 llama : move random seed generation to the samplers (#9398) hai 1 ano
  Xuan Son Nguyen bfe76d4a17 common : move arg parser code to `arg.cpp` (#9388) hai 1 ano
  Xuan Son Nguyen 1b9ae5189c common : refactor arg parser (#9308) hai 1 ano
  Georgi Gerganov df270ef745 llama : refactor sampling v2 (#9294) hai 1 ano
  fairydreaming 7c3f55c100 Add support for encoder-only T5 models (#8900) hai 1 ano
  Liu Jia 0a4ce78681 common : Changed tuple to struct (TODO fix) (#8823) hai 1 ano
  Yann Follet 646ef4a9cf embedding : more cli arguments (#7458) hai 1 ano
  Douglas Hanley 80ea089d77 llama : allow pooled embeddings on any model (#7477) hai 1 ano
  Georgi Gerganov 1442677f92 common : refactor cli arg parsing (#7675) hai 1 ano
  Georgi Gerganov 6ff13987ad common : normalize naming style (#7462) hai 1 ano
  dm4 ea3b0590ee embedding : free the batch after execution (#7297) hai 1 ano
  Joan Fontanals b83cc3f5b3 llama : add Jina Embeddings architecture (#6826) hai 1 ano
  Jared Van Bortel 1b67731e18 BERT tokenizer fixes (#6498) hai 1 ano