Cronologia Commit

Autore SHA1 Messaggio Data
  Johannes Gäßler 53ff6b9b9f GGUF: C++ refactor, backend support, misc fixes (#11030) 1 anno fa
  Diego Devesa 017cc5f446 ggml-backend : only offload from host buffers (fix) (#11124) 1 anno fa
  Diego Devesa a3d50bc022 ggml-backend : only offload from host buffers (#11120) 1 anno fa
  Radoslav Gerganov a4dd490069 rpc : code cleanup (#11107) 1 anno fa
  Akarshan Biswas c0d6f790d0 SYCL: Use get_multi_ptr instead of deprecated get_pointer in wkv6 (#11087) 1 anno fa
  Eric Curtin dc7cef9f37 llama-run : fix context size (#11094) 1 anno fa
  Georgi Gerganov ecebbd292d llama : remove unused headers (#11109) 1 anno fa
  Xuan Son Nguyen 96be8c3264 github : add cmd line field to bug report (#11090) 1 anno fa
  Georgi Gerganov e6e7c75d94 server : fix extra BOS in infill endpoint (#11106) 1 anno fa
  Xuan Son Nguyen 09186fabbe llama : remove check flash_attn with lora (#11104) 1 anno fa
  Asghar Ghorbani 96a1dc27c3 llama : prevent system info string accumulation across calls (#11101) 1 anno fa
  Daniel Bevenius 6369f867a4 llama : rename missed batch params/vars to ubatch (#10059) 1 anno fa
  Georgi Gerganov 47182dd03f llama : update llama_model API names (#11063) 1 anno fa
  Georgi Gerganov 3e6e7a6bc2 tokenize : escape the prompt (#11058) 1 anno fa
  Georgi Gerganov ae2f606bb5 mmap : fix fileno macro clash (#11076) 1 anno fa
  Georgi Gerganov 727368c60f llama : use LLAMA_TOKEN_NULL (#11062) 1 anno fa
  Georgi Gerganov 5047dd3546 llama : use _impl suffix instead of _internal (#11060) 1 anno fa
  Johannes Gäßler 46e3556e01 CUDA: add BF16 support (#11093) 1 anno fa
  0cc4m b56f079e28 Vulkan: Add device-specific blacklist for coopmat for the AMD proprietary driver (#11074) 1 anno fa
  fairydreaming 9394bbd484 llama : Add support for DeepSeek V3 (#11049) 1 anno fa
  matt23654 f922a9c542 [GGML][RPC] Support for models with non-512-aligned tensors over RPC. (#11047) 1 anno fa
  DAN™ 46be942214 llama : add support for the cohere2 model architecture (#10900) 1 anno fa
  Georgi Gerganov 78c6785175 sync : ggml 1 anno fa
  Georgi Gerganov 5e3b08d606 ggml : do not install metal source when embed library (ggml/1054) 1 anno fa
  Daniel Bevenius db68c93b57 ggml : improve inputs log sched_print_assignments (ggml/1053) 1 anno fa
  Gilad S. c31fc8b966 fix: Vulkan shader gen binary path (#11037) 1 anno fa
  Molly Sophia 4b0c638b9a common : disable KV cache shifting automatically for unsupported models (#11053) 1 anno fa
  Georgi Gerganov e7da954ecc metal : avoid uint (#11019) 1 anno fa
  Georgi Gerganov f66f582927 llama : refactor `src/llama.cpp` (#10902) 1 anno fa
  Pierrick Hymbert 2f0ee84b9b server: bench: minor fixes (#10765) 1 anno fa