Historique des commits

Auteur SHA1 Message Date
  Gilad S. 2194200278 fix: allocating CPU buffer with size `0` (#9917) il y a 1 an
  Gilad S. 73afe681aa fix: use `vm_allocate` to allocate CPU backend buffer on macOS (#9875) il y a 1 an
  Daniel Bevenius 9e04102448 llama : suppress conversion from 'size_t' to 'int' (#9046) il y a 1 an
  Daniel Bevenius dbf18e4de9 llava : fix typo in error message [no ci] (#9884) il y a 1 an
  Joe Eli McIlvain 66c2c93082 grammar : fix JSON Schema for string regex with top-level alt. (#9903) il y a 1 an
  Molly Sophia 10433e8b45 llama : add tensor name for "result_norm" (#9907) il y a 1 an
  Alexey Parfenov 1f66b699c4 server : fix the disappearance of the end of the text (#9867) il y a 1 an
  Georgi Gerganov 0e41b300ed sync : ggml il y a 1 an
  Daniel Bevenius cd60b88bf7 ggml-alloc : remove buffer_id from leaf_alloc (ggml/987) il y a 1 an
  leo-pony becfd387f6 [CANN] Fix cann compilation error (#9891) il y a 1 an
  Georgi Gerganov 755a9b2bf0 llama : add infill sampler (#9896) il y a 1 an
  Georgi Gerganov 223c25a72f server : improve infill context reuse (#9894) il y a 1 an
  MaggotHATE fbc98b748e sampling : add XTC sampler (#9742) il y a 1 an
  Georgi Gerganov dcdd535302 server : update preact (#9895) il y a 1 an
  Michał Tuszyński 4c42f93b22 readme : update bindings list (#9889) il y a 1 an
  VoidIsVoid a89f75e1b7 server : handle "logprobs" field with false value (#9871) il y a 1 an
  agray3 13dca2a54a Vectorize load instructions in dmmv f16 CUDA kernel (#9816) il y a 1 an
  Georgi Gerganov d4c19c0f5c server : accept extra_context for the infill endpoint (#9874) il y a 1 an
  Georgi Gerganov c7181bd294 server : reuse cached context chunks (#9866) il y a 1 an
  Georgi Gerganov 92be9f1216 flake.lock: Update (#9870) il y a 1 an
  Georgi Gerganov edc265661c server : add option to time limit the generation phase (#9865) il y a 1 an
  Georgi Gerganov 1bde94dd02 server : remove self-extend features (#9860) il y a 1 an
  Georgi Gerganov 95c76e8e92 server : remove legacy system_prompt feature (#9857) il y a 1 an
  Georgi Gerganov 11ac9800af llama : improve infill support and special token detection (#9798) il y a 1 an
  R0CKSTAR 943d20b411 musa : update doc (#9856) il y a 1 an
  Diego Devesa 96776405a1 ggml : move more prints to the ggml log system (#9839) il y a 1 an
  Diego Devesa 7eee341bee common : use common_ prefix for common library functions (#9805) il y a 1 an
  Diego Devesa 0e9f760eb1 rpc : add backend registry / device interfaces (#9812) il y a 1 an
  R0CKSTAR cf8e0a3bb9 musa: add docker image support (#9685) il y a 1 an
  Diego Devesa c7499c557c examples : do not use common library in simple example (#9803) il y a 1 an