커밋 기록

작성자 SHA1 메시지 날짜
  Georgi Gerganov 611aa914ef metal : optimize MoE for large batches (#13388) 8 달 전
  Johannes Gäßler 0cf6725e9f CUDA: FA support for Deepseek (Ampere or newer) (#13306) 8 달 전
  Diego Devesa 27ebfcacba llama : do not crash if there is no CPU backend (#13395) 8 달 전
  Johannes Gäßler 5c86c9ed3e CUDA: fix crash on large batch size for MoE models (#13384) 8 달 전
  Bartowski efb8b47eda imatrix : Add --parse-special for enabling parsing of special tokens in imatrix calculation (#13389) 8 달 전
  R0CKSTAR 0527771dd8 llama-run: add support for downloading models from ModelScope (#13370) 8 달 전
  Xuan-Son Nguyen 2189fd3b63 mtmd : fix batch_view for m-rope (#13397) 8 달 전
  Xuan-Son Nguyen 3f96aeff39 llama : one-off chat template fix for Mistral-Small-2503 (#13398) 8 달 전
  Radoslav Gerganov b486ba05bf rpc : add rpc_msg_set_tensor_hash_req (#13353) 8 달 전
  Jeff Bolz 02115dcd9a vulkan: Allow up to 4096 elements for mul_mat_id row_ids (#13326) 8 달 전
  Xuan-Son Nguyen d9c4accaff server : (webui) rename has_multimodal --> modalities (#13393) 8 달 전
  Diego Devesa 15e03282bb ci : limit write permission to only the release step + fixes (#13392) 8 달 전
  Matt Clayton f05a6d71a0 mtmd : Expose helper_decode_image_chunk (#13366) 8 달 전
  Xuan-Son Nguyen ee01d71e58 server : (webui) fix a very small misalignment (#13387) 8 달 전
  Xuan-Son Nguyen 8c83449cb7 server : (webui) revamp the input area, plus many small UI improvements (#13365) 8 달 전
  Sigbjørn Skjæret 1a844be132 convert : support rope_scaling type and rope_type (#13349) 8 달 전
  welix 0ccc121354 mtmd : fix the calculation of n_tokens for smolvlm (#13381) 8 달 전
  Georgi Gerganov 6562e5a4d6 context : allow cache-less context for embeddings (#13108) 8 달 전
  Georgi Gerganov 51fb96b1ff context : remove logits_all flag (#13284) 8 달 전
  Diego Devesa 70a6991edf ci : move release workflow to a separate file (#13362) 8 달 전
  Diego Devesa f061021206 llama : print size and type of overridden tensors (#13364) 8 달 전
  Alberto Cabrera Pérez 8733e0cf6e sycl: addressing non-contiguous src1 mul_mats (nc and batched) (#13343) 8 달 전
  Diego Devesa 814f795e06 docker : disable arm64 and intel images (#13356) 8 달 전
  Georgi Gerganov d879433824 sync : ggml 8 달 전
  Daniel Bevenius 13b0a04597 whisper: remove MSVC warnings pragmas (whisper/3090) 8 달 전
  Jared Tweed bba9d945c1 cmake : removed stdc++fs (whisper/3097) 8 달 전
  Sigbjørn Skjæret bc4e1128f7 llama : deci : support ffn-free with attention (#13296) 8 달 전
  Ycros 39e73ae0d6 common : Add a warning when we can't match samplers from a string or char. (#13330) 8 달 전
  R0CKSTAR 1f73301b63 cuda : remove nrows_x in mul_mat_q_process_tile (#13325) 8 달 전
  Georgi Gerganov 4773d7a02f examples : remove infill (#13283) 8 달 전