Historique des commits

Auteur SHA1 Message Date
  Ycros 39e73ae0d6 common : Add a warning when we can't match samplers from a string or char. (#13330) il y a 8 mois
  R0CKSTAR 1f73301b63 cuda : remove nrows_x in mul_mat_q_process_tile (#13325) il y a 8 mois
  Georgi Gerganov 4773d7a02f examples : remove infill (#13283) il y a 8 mois
  piDack 6c7fd67b64 llama : support tie embedding for chatglm models (#13328) il y a 8 mois
  Johannes Gäßler 141a908a59 CUDA: mix virt/real CUDA archs for GGML_NATIVE=OFF (#13135) il y a 8 mois
  Xuan-Son Nguyen 32916a4907 clip : refactor graph builder (#13321) il y a 8 mois
  DocShotgun ffc727203a sampling : make top_n_sigma no-op at <=0 or a single candidate (#13345) il y a 8 mois
  oobabooga 91a86a6f35 sampling : don't consider -infinity values in top_n_sigma (#13344) il y a 8 mois
  Diego Devesa f4ed10b69c cmake : remove arm64 msvc presets (#13342) il y a 8 mois
  Akarshan Biswas 1e333d5bba SYCL: Disable reorder optimize by default and stop setting tensor extras when optimize is disabled (#13254) il y a 8 mois
  Xuan-Son Nguyen 2f54e348ad llama : fix build_ffn without gate (#13336) il y a 8 mois
  Johannes Gäßler 2356fb1d53 CUDA: fix bad asserts for partial offload (#13337) il y a 8 mois
  Sigbjørn Skjæret 764b85627b convert : qwen2/3moe : set yarn metadata if present (#13331) il y a 8 mois
  Johannes Gäßler 15a28ec8c7 CUDA: fix --split-mode row for MMQ (#13323) il y a 8 mois
  compilade a7366faa5b gguf-py : avoid requiring pyside6 for other scripts (#13036) il y a 8 mois
  Johannes Gäßler 9070365020 CUDA: fix logic for clearing padding with -ngl 0 (#13320) il y a 8 mois
  oobabooga 233461f812 sampling : Integrate Top-nσ into main sampling chain (and add it to the server) (#13264) il y a 8 mois
  igardev b34c859146 server : Webui - change setText command from parent window to also send the message. (#13309) il y a 8 mois
  Xuan-Son Nguyen 9b61acf060 mtmd : rename llava directory to mtmd (#13311) il y a 8 mois
  Xuan-Son Nguyen 5215b91e93 clip : fix confused naming ffn_up and ffn_down (#13290) il y a 8 mois
  Sigbjørn Skjæret ae803bfc3d convert : bailingmoe : set yarn metadata if present (#13312) il y a 8 mois
  Akarshan Biswas 66645a5285 SYCL: Disable mul_mat kernels for noncontiguous tensor b (#13308) il y a 8 mois
  Xuan-Son Nguyen 27aa259532 mtmd : add C public API (#13184) il y a 8 mois
  Diego Devesa 9fdfcdaedd rpc : use backend registry, support dl backends (#13304) il y a 8 mois
  Aaron Teo 6eb7d25c70 ggml : activate s390x simd for Q3_K (#13301) il y a 8 mois
  Diego Devesa 86bd60d3fe llava/mtmd : fixes to fully support dl backends (#13303) il y a 8 mois
  Diego Devesa 9f2da5871f llama : build windows releases with dl backends (#13220) il y a 8 mois
  Johannes Gäßler 93c4e23905 CUDA: fix race condition in MMQ stream-k fixup (#13299) il y a 8 mois
  Johannes Gäßler 8afbd96818 CUDA: fix race condition in MMQ ids_dst (#13294) il y a 8 mois
  Jeff Bolz 8ae5ebcf85 vulkan: Additional type support for unary, binary, and copy (#13266) il y a 8 mois