Histórico de Commits

Autor SHA1 Mensagem Data
  Johannes Gäßler 6eecde3cc8 HIP: fix flash_attn_stream_k_fixup warning (#11604) há 11 meses atrás
  uvos 396856b400 CUDA/HIP: add support for selectable warp size to mmv (#11519) há 11 meses atrás
  uvos 4d0598e144 HIP: add GGML_CUDA_CC_IS_* for amd familys as increasing cc archtectures for amd gpus are not supersets of eatch other (#11601) há 11 meses atrás
  Olivier Chafik 90f9b88afb nit: more informative crash when grammar sampler fails (#11593) há 11 meses atrás
  Johannes Gäßler 864a0b67a6 CUDA: use mma PTX instructions for FlashAttention (#11583) há 11 meses atrás
  Eric Curtin 84ec8a58f7 Name colors (#11573) há 11 meses atrás
  Olivier Chafik bfcce4d693 `tool-call`: support Command R7B (+ return tool_plan "thoughts" in API) (#11585) há 11 meses atrás
  Olivier Chafik 69804487e0 Fix exotic ci env that lacks ostringstream::str (#11581) há 11 meses atrás
  Michał Moskal ff227703d6 sampling : support for llguidance grammars (#10224) há 11 meses atrás
  piDack 0cec062a63 llama : add support for GLM-Edge and GLM-Edge-V series models (#10573) há 11 meses atrás
  Olivier Chafik 53debe6f3c ci: use sccache on windows HIP jobs (#11553) há 11 meses atrás
  Olivier Chafik cfd74c86db `sync`: minja (https://github.com/google/minja/commit/418a2364b56dc9be4ed9a1a2b0fb16fb53a7a22e) (#11574) há 11 meses atrás
  Eric Curtin ecef206ccb Implement s3:// protocol (#11511) há 11 meses atrás
  Olivier Chafik 5bbc7362cb ci: simplify cmake build commands (#11548) há 11 meses atrás
  Olivier Chafik aa6fb13213 `ci`: use sccache on windows instead of ccache (#11545) há 11 meses atrás
  Olivier Chafik a83f528688 `tool-call`: fix llama 3.x and functionary 3.2, play nice w/ pydantic_ai package, update readme (#11539) há 11 meses atrás
  Olivier Chafik b1bcd309fc fix stop regression (#11543) há 11 meses atrás
  Olivier Chafik 5783575c9d Fix chatml fallback for unsupported builtin templates (when --jinja not enabled) (#11533) há 11 meses atrás
  Olivier Chafik 4a2b196d03 server : fix --jinja when there's no tools or schema (typo was forcing JSON) (#11531) há 11 meses atrás
  Steve Grubb 1bd3047a93 common: Add missing va_end (#11529) há 11 meses atrás
  Daniel Bevenius a2df2787b3 server : update help metrics processing/deferred (#11512) há 11 meses atrás
  Olivier Chafik 553f1e46e9 `ci`: ccache for all github worfklows (#11516) há 11 meses atrás
  Olivier Chafik 8b576b6c55 Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639) há 11 meses atrás
  uvos 27d135c970 HIP: require at least HIP 5.5 há 11 meses atrás
  uvos 6af1ca48cb HIP: Prepare reduction operators for wave 64 há 11 meses atrás
  uvos c300e68ef4 CUDA/HIP: add warp_size to cuda_device_info há 11 meses atrás
  Olivier Chafik 3d804dec76 sync: minja (#11499) há 11 meses atrás
  mgroeber9110 ffd0821c57 vocab : correctly identify LF token for GPT-2 style BPE tokenizer (#11496) há 11 meses atrás
  Daniel Bevenius 4314e56c4f server : use lambda instead of std::bind (#11507) há 11 meses atrás
  Isaac McFadyen 496e5bf46b server : (docs) added response format for /apply-template [no ci] (#11503) há 11 meses atrás